Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for presigned-url reading to the default client #180

Merged
merged 4 commits into from
Apr 25, 2024

Conversation

nicklan
Copy link
Collaborator

@nicklan nicklan commented Apr 24, 2024

This is most non-testing code from hackathon. Most code taken from #144. It supports:

  1. Reading parquet files from http/https
  2. Reading via the filesystem interface from http/https

Currently working on how best to test this since we can't have stable urls to read from

kernel/src/client/default/filesystem.rs Show resolved Hide resolved
kernel/src/client/default/filesystem.rs Show resolved Hide resolved
kernel/src/client/default/parquet.rs Show resolved Hide resolved
kernel/src/client/default/parquet.rs Show resolved Hide resolved
kernel/src/client/default/parquet.rs Show resolved Hide resolved
rbr.map_err(Error::Arrow)
.and_then(|rb| reorder_record_batch(rb, &requested_ordering))
});
Ok(stream.boxed())
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

for the code above don't we do this elsewhere? can we reuse? maybe i'm missing something but i had the feeling this exists elsewhere

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yeah, it's similar to the code in the ParquetOpener. But that uses a ParquetRecordBatchStreamBuilder and this uses a ParquetRecordBatchReaderBuilder. They have a common trait, but you'll notice that the build method is defined uniquely depending on the sync vs. async, and since we have both the types get really gross if you try and make this code shared.

Copy link
Collaborator

@hntd187 hntd187 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGMT

@nicklan nicklan merged commit 6761b9d into delta-incubator:main Apr 25, 2024
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

3 participants