Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

1.4.3 #183

Merged
merged 3 commits into from
Nov 11, 2019
Merged

1.4.3 #183

merged 3 commits into from
Nov 11, 2019

Conversation

xitongsys
Copy link
Owner

No description provided.

@xitongsys xitongsys merged commit 197c897 into master Nov 11, 2019
zolstein pushed a commit to zolstein/parquet-go that referenced this pull request Jun 23, 2023
* refactor byteArrayPage: remove offsets array

* rename Int8 encodings to Levels encoding

* change type of repetition and definition levels from []int8 to []byte

* refactor DOUBLE encoding

* refactor FLOAT encoding

* refactor INT96 encoding

* refactor INT64 encoding

* refactor INT32 encoding

* refactor BOOLEAN encoding

* add Encode/Decode methods to Type interface

* refactor encoding/fuzz package

* refactor encoding tests

* only write pages to filters if they don't have a dictionary

* remove Encode method from Page interface

* move page reader types to page_reader.go

* reformat code

* remove internal generic types (xitongsys#176)

* remove internal generic types

* refactor: add parquet.Page.Type (xitongsys#177)

* refactor: add parquet.Page.Type

* refactor indexed page type (xitongsys#178)

* refactor page index type

* Update dictionary.go

Co-authored-by: Kevin Burke <kevin.burke@segment.com>

* fix bit-packed encoding of booleans in memory (xitongsys#179)

* fix bit-packed encoding of booleans in memory

* fix panic on Go 1.17

* fuzz RLE encoding + fix

* Update dictionary.go

Co-authored-by: Kevin Burke <kevin.burke@segment.com>

* Update encoding/rle/rle.go

Co-authored-by: Kevin Burke <kevin.burke@segment.com>

* rename: values => bits

Co-authored-by: Kevin Burke <kevin.burke@segment.com>

Co-authored-by: Kevin Burke <kevin.burke@segment.com>

Co-authored-by: Kevin Burke <kevin.burke@segment.com>

Co-authored-by: Kevin Burke <kevin.burke@segment.com>

* simplify page decoding

* remove sizeOf* functions

* Optimize file reader (xitongsys#182)

* add Close methods on page and value readers

* pool large memory buffers + document resource management on Pages instances

* optimize reader + close rows where needed + simplify page value readers

* fix typos

* show example closing reader

* Update reader.go

Co-authored-by: Kevin Burke <kevin.burke@segment.com>

Co-authored-by: Kevin Burke <kevin.burke@segment.com>

* add deprecated BIT_PACKED encoding (xitongsys#181)

* Refactor parquet.RowReader and parquet.RowWriter (xitongsys#183)

* add Close methods on page and value readers

* pool large memory buffers + document resource management on Pages instances

* optimize reader + close rows where needed + simplify page value readers

* fix typos

* show example closing reader

* add buffer benchmarks

* report row throughput in buffer benchmarks

* refactor parquet.RowWriter

* refactor parquet.RowReader

* mirror writer implementation on buffer

* Update row.go

Co-authored-by: Kevin Burke <kevin.burke@segment.com>

* Optimize parquet row reader (xitongsys#184)

* simplify rowGroupRows: reduce memory footprint + more immutable state + use booleans to explicitly represent the state

* optimize parquet row reader

* rename: columnReadRows => readRows

Co-authored-by: Kevin Burke <kevin.burke@segment.com>

* fix bugs that were detected after merging latest changes

Co-authored-by: Kevin Burke <kevin.burke@segment.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant