Incorrect start offset when range reads are interrupted #446

wilson1yan · 2024-07-31T04:04:45Z

Environment details

GCP A100 machines
(unclear about exact software, as this bug was countered internally on Google infra when running on XManager Cloud)

Steps to reproduce

Given a range read of start to end bytes, if the read is interrupted, the code here will make another GET request at the wrong start offset, as it should be self.start + self._bytes_downloaded if self.start is not None. Currently, the downloader will return end + 1 bytes instead of end - start + 1 bytes after resuming from interruption even when start is specified.

The same issue also probably happens here in Downloader.

We encountered this issue when using the google python storage library in the setting where we had thousands of workers doing repeated range reads on the same GCS file. Code would run and return the wrong bytes somewhere 20-60min in. Adding in a fix (see pull request) seemed to resolve the issue.

The text was updated successfully, but these errors were encountered:

andrewsg · 2024-08-06T21:53:50Z

Thanks very much! Looking at this now. I need to add tests so I may work on a separate PR for this issue but I appreciate your fix as well.

wilson1yan · 2024-08-06T22:05:51Z

Sounds good, feel free to work on a separate PR! Just thought I'd let you all know about the bug.

Please let me know when you end up applying the fix and/or push a new pip release. As right now my project is pip installing from this pull request directly, but would be good to just install an official pypi version thats updated.

andrewsg · 2024-08-07T22:18:04Z

@wilson1yan This is now being released as v2.7.2. Thanks again.

wilson1yan · 2024-08-07T22:26:20Z

Awesome, thanks!

product-auto-label bot added the api: storage Issues related to the googleapis/google-resumable-media-python API. label Jul 31, 2024

blunderbuss-gcf bot assigned andrewsg Jul 31, 2024

wilson1yan mentioned this issue Jul 31, 2024

Fix range read when interrupted for self.start is not None #447

Closed

andrewsg mentioned this issue Aug 7, 2024

Fix: Correctly calculate starting offset for retries of ranged reads #450

Merged

andrewsg closed this as completed in #450 Aug 7, 2024

shunping mentioned this issue Aug 13, 2024

[Bug]: [Python SDK] Data Corruption on GCS read in 2.53.0 - 2.58.0 SDKs. apache/beam#32169

Closed

17 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incorrect start offset when range reads are interrupted #446

Incorrect start offset when range reads are interrupted #446

wilson1yan commented Jul 31, 2024 •

edited

Loading

andrewsg commented Aug 6, 2024

wilson1yan commented Aug 6, 2024

andrewsg commented Aug 7, 2024

wilson1yan commented Aug 7, 2024

Incorrect start offset when range reads are interrupted #446

Incorrect start offset when range reads are interrupted #446

Comments

wilson1yan commented Jul 31, 2024 • edited Loading

Environment details

Steps to reproduce

andrewsg commented Aug 6, 2024

wilson1yan commented Aug 6, 2024

andrewsg commented Aug 7, 2024

wilson1yan commented Aug 7, 2024

wilson1yan commented Jul 31, 2024 •

edited

Loading