Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add ParquetMetaDataReader #6431

Open
wants to merge 4 commits into
base: master
Choose a base branch
from
Open

Conversation

etseidl
Copy link
Contributor

@etseidl etseidl commented Sep 20, 2024

Which issue does this PR close?

Relates to #6002.

Rationale for this change

Consolidate Parquet metadata parsing into a single API. See discussion in #6392 for additional context.

What changes are included in this PR?

Adds the ParquetMetaDataReader struct.

Are there any user-facing changes?

No

@github-actions github-actions bot added the parquet Changes to the parquet crate label Sep 20, 2024
Comment on lines +240 to +243
// TODO(ets): what is the correct behavior for missing page indexes? MetadataLoader would
// leave them as `None`, while the parser in `index_reader::read_columns_indexes` returns a
// vector of empty vectors.
// I think it's best to leave them as `None`.
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is the remaining outstanding issue from #6392. My preference is to leave the page indexes as None if they are not present, but this differs from current behavior.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
parquet Changes to the parquet crate
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant