Fix S3InputStream's handling of large skips #24521

alexjo2144 · 2024-12-18T21:25:41Z

Description

When the skip(n) method is called the MAX_SKIP_BYTES check is skipped, resulting in the call potentially blocking for a long time.

Instead of delegating to the underlying stream, set the nextReadPosition value. This allows the next read to decide if it is best to keep the existing s3 object stream or open a new one.

This behavior matches the implementations for Azure and GCS.

Additional context and related issues

Release notes

( ) This is not user-visible or is docs only, and no release notes are required.
( ) Release notes are required. Please propose a release note for me.
( ) Release notes are required, with the following suggested text:

## Section
* Fix some things. ({issue}`issuenumber`)

alexjo2144 · 2024-12-18T21:29:49Z

I was able to reproduce this issue, but not in a way where it's going to be easy to write a regression test. What I did was upload a 10GB uncompressed JSON file to s3 and set up the TextLineReader to read a small split. Before this change that reader took much longer to load a split at the end of the file than it did at the start of the file. After this change it is constant regardless of the split's start offset.

lib/trino-filesystem-s3/src/main/java/io/trino/filesystem/s3/S3InputStream.java

When the skip(n) method is called the MAX_SKIP_BYTES check is skipped, resulting in the call potentially blocking for a long time. Instead of delegating to the underlying stream, set the nextReadPosition value. This allows the next read to decide if it is best to keep the existing s3 object stream or open a new one. This behavior matches the implementations for Azure and GCS.

wendigo · 2024-12-19T11:22:11Z

I'm testing this with secrets now

findinpath · 2024-12-19T12:54:34Z

resulting in the call potentially blocking for a long time.

What is the furher consequence of this?
Please enhance the original description of the issue.

wendigo · 2024-12-19T13:49:20Z

@findinpath I think that existing description is exhaustive enough. For S3 FS any delayed request will cause planning/execution to be longer than necessary.

wendigo · 2024-12-19T13:49:31Z

@alexjo2144 thanks, merging

cla-bot bot added the cla-signed label Dec 18, 2024

alexjo2144 requested review from electrum, wendigo and raunaqmorarka December 18, 2024 21:27

wendigo reviewed Dec 18, 2024

View reviewed changes

lib/trino-filesystem-s3/src/main/java/io/trino/filesystem/s3/S3InputStream.java Outdated Show resolved Hide resolved

alexjo2144 force-pushed the ajo/s3-input-stream-skip branch from 11fba15 to 5fe42db Compare December 18, 2024 22:02

wendigo approved these changes Dec 18, 2024

View reviewed changes

raunaqmorarka approved these changes Dec 19, 2024

View reviewed changes

findinpath requested a review from anusudarsan December 19, 2024 12:52

wendigo merged commit 6c52253 into trinodb:master Dec 19, 2024
128 checks passed

github-actions bot added this to the 469 milestone Dec 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix S3InputStream's handling of large skips #24521

Fix S3InputStream's handling of large skips #24521

alexjo2144 commented Dec 18, 2024 •

edited

Loading

alexjo2144 commented Dec 18, 2024 •

edited

Loading

wendigo commented Dec 19, 2024

findinpath commented Dec 19, 2024

wendigo commented Dec 19, 2024

wendigo commented Dec 19, 2024

Fix S3InputStream's handling of large skips #24521

Fix S3InputStream's handling of large skips #24521

Conversation

alexjo2144 commented Dec 18, 2024 • edited Loading

Description

Additional context and related issues

Release notes

alexjo2144 commented Dec 18, 2024 • edited Loading

wendigo commented Dec 19, 2024

findinpath commented Dec 19, 2024

wendigo commented Dec 19, 2024

wendigo commented Dec 19, 2024

alexjo2144 commented Dec 18, 2024 •

edited

Loading

alexjo2144 commented Dec 18, 2024 •

edited

Loading