Fix panic: cannot consume from pending buffer #303

NobodyXu · 2024-10-15T13:30:14Z

Fixed #298

This PR changes the decoder do_poll_read impl, to not advance the buffer on first flush.

The panic is because the decoder try to advance the buffer before polling the underlying buf reader.

In this PR I tried a different fix, by making sure buf.consume is always called, even on error.
I suspect that previously we didn't consume the buffer on error, and that might have caused the same data to be decompressed again.

~~I can't think of anywhere else that could fail, the decoder implementation looks alright.~~

Fixed #298 Signed-off-by: Jiahao XU <Jiahao_XU@outlook.com>

NobodyXu · 2024-10-15T13:33:50Z

@Turbo87 Can you try this PR please?

Turbo87 · 2024-10-15T15:01:23Z

yep, I'll give it a try.

Turbo87 · 2024-10-16T09:34:19Z

unfortunately still failing 😢

NobodyXu · 2024-10-16T11:12:29Z

Thanks.

Is the software open-source?

Can I have a look at the code and the test?

Turbo87 · 2024-10-16T12:00:19Z

Can I have a look at the code and the test?

the code yes, the test no. unfortunately we don't have a test that reproduces it. I can only run it on our staging environment where I can reproduce it. our test suite runs with an in-memory object_store instance instead of the S3 implementation we use on staging and production.

Signed-off-by: Jiahao XU <30436523+NobodyXu@users.noreply.github.com>

NobodyXu · 2024-10-16T13:52:37Z

I've found the cause of the panic.

It is because the decoder try to advance the buffer before polling the underlying buf reader.

cc @Turbo87 I've updated the PR, can you try again please?

Thank you!

Turbo87 · 2024-10-16T16:29:20Z

the panic appears to be gone, but we're now seeing an "interval out of range" error result without a stacktrace. I will have to improve our logging a bit to figure out where exactly that is coming from.

NobodyXu · 2024-10-18T21:25:12Z

cc @robjtede Shall we merge and publish this for now, since it at least fixes the panic for @Turbo87

Turbo87 · 2024-10-19T08:39:26Z

we're now seeing an "interval out of range" error

it turns out that this was a bug on our side, related to how we calculate exponential backoff for failed jobs.

I can confirm that #303 appears to fix the issue for us! 🎉

thanks again! :)

NobodyXu · 2024-10-19T08:42:04Z

Thank you!

NobodyXu · 2024-10-20T02:10:19Z

cc @robjtede let's get this merged and cut a new release, as it is confirmed to fix the panic

NobodyXu · 2024-10-20T06:33:29Z

I will get this merged and ask for review in the release PR.

robjtede · 2024-10-20T11:53:53Z

Ahh wonderful, yes, lets get this out today.

Turbo87 · 2024-10-20T21:02:56Z

thanks again for the investigation, fix and release! I just merged the latest update into crates.io :)

Fix panic: cannot consume from pending buffer

cb122d4

Fixed #298 Signed-off-by: Jiahao XU <Jiahao_XU@outlook.com>

robjtede added the A-semver-patch bug fixes label Oct 15, 2024

NobodyXu added 2 commits October 17, 2024 00:43

Do not consume buf on first flush tokio bufread decoder impl

eb07612

Signed-off-by: Jiahao XU <30436523+NobodyXu@users.noreply.github.com>

Do not advance buffer in first flush in futures bufread decoder impl

241bee6

Signed-off-by: Jiahao XU <30436523+NobodyXu@users.noreply.github.com>

NobodyXu requested a review from robjtede October 19, 2024 08:41

This comment has been minimized.

Sign in to view

NobodyXu added this pull request to the merge queue Oct 20, 2024

Merged via the queue into main with commit 014e6e4 Oct 20, 2024
16 checks passed

NobodyXu deleted the fix/panic branch October 20, 2024 06:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix panic: cannot consume from pending buffer #303

Fix panic: cannot consume from pending buffer #303

NobodyXu commented Oct 15, 2024 •

edited

Loading

NobodyXu commented Oct 15, 2024

Turbo87 commented Oct 15, 2024

Turbo87 commented Oct 16, 2024

NobodyXu commented Oct 16, 2024

Turbo87 commented Oct 16, 2024

NobodyXu commented Oct 16, 2024

Turbo87 commented Oct 16, 2024

NobodyXu commented Oct 18, 2024

Turbo87 commented Oct 19, 2024

NobodyXu commented Oct 19, 2024

This comment has been minimized.

NobodyXu commented Oct 20, 2024

NobodyXu commented Oct 20, 2024

robjtede commented Oct 20, 2024

Turbo87 commented Oct 20, 2024

Fix panic: cannot consume from pending buffer #303

Fix panic: cannot consume from pending buffer #303

Conversation

NobodyXu commented Oct 15, 2024 • edited Loading

NobodyXu commented Oct 15, 2024

Turbo87 commented Oct 15, 2024

Turbo87 commented Oct 16, 2024

NobodyXu commented Oct 16, 2024

Turbo87 commented Oct 16, 2024

NobodyXu commented Oct 16, 2024

Turbo87 commented Oct 16, 2024

NobodyXu commented Oct 18, 2024

Turbo87 commented Oct 19, 2024

NobodyXu commented Oct 19, 2024

This comment has been minimized.

NobodyXu commented Oct 20, 2024

NobodyXu commented Oct 20, 2024

robjtede commented Oct 20, 2024

Turbo87 commented Oct 20, 2024

NobodyXu commented Oct 15, 2024 •

edited

Loading