Enforce Logging of Errors in GCS Rest RetriesTests #50761

original-brownbear · 2020-01-08T19:27:13Z

It's impossible to tell why #50754 fails without this change.
We're failing to close the exchange somewhere and there is no
write timeout in the GCS SDK (something to look into separately)
only a read timeout on the socket so if we're failing on an assertion without
reading the full request body (at least into the read-buffer ... this test works
in some other spots that intentionally don't fully drain the requestBody because the body is small enough to fit into the read-buffer I think ... that's why it's only failing for the large write ... I left the small write spots as is for that reason for now) we're locking up waiting forever on write0.

This change ensure the exchange is closed in the tests where we could lock up
on a write and logs the failure so we can find out what broke #50754.

It's impossible to tell why elastic#50754 fails without this change. We're failing to close the `exchange` somewhere and there is no write timeout in the GCS SDK (something to look into separately) only a read timeout on the socket so if we're failing on an assertion without reading the full request body (at least into the read-buffer) we're locking up waiting forever on `write0`. This change ensure the `exchange` is closed in the tests where we could lock up on a write and logs the failure so we can find out what broke elastic#50754.

elasticmachine · 2020-01-08T19:27:16Z

Pinging @elastic/es-distributed (:Distributed/Snapshot/Restore)

original-brownbear · 2020-01-08T19:27:36Z

@tlrx it doesn't end :( but getting closer :)

tlrx

LGTM

@tlrx it doesn't end :( but getting closer :)

I know - so sorry about that. Once you got bored I'll take the relay :)

original-brownbear · 2020-01-09T08:43:41Z

Thanks Tanguy! :)

It's impossible to tell why elastic#50754 fails without this change. We're failing to close the `exchange` somewhere and there is no write timeout in the GCS SDK (something to look into separately) only a read timeout on the socket so if we're failing on an assertion without reading the full request body (at least into the read-buffer) we're locking up waiting forever on `write0`. This change ensure the `exchange` is closed in the tests where we could lock up on a write and logs the failure so we can find out what broke elastic#50754.

It's impossible to tell why #50754 fails without this change. We're failing to close the `exchange` somewhere and there is no write timeout in the GCS SDK (something to look into separately) only a read timeout on the socket so if we're failing on an assertion without reading the full request body (at least into the read-buffer) we're locking up waiting forever on `write0`. This change ensure the `exchange` is closed in the tests where we could lock up on a write and logs the failure so we can find out what broke #50754.

It's impossible to tell why elastic#50754 fails without this change. We're failing to close the `exchange` somewhere and there is no write timeout in the GCS SDK (something to look into separately) only a read timeout on the socket so if we're failing on an assertion without reading the full request body (at least into the read-buffer) we're locking up waiting forever on `write0`. This change ensure the `exchange` is closed in the tests where we could lock up on a write and logs the failure so we can find out what broke elastic#50754.

original-brownbear added >test Issues or PRs that are addressing/adding tests :Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs v8.0.0 v7.6.0 labels Jan 8, 2020

original-brownbear requested a review from tlrx January 8, 2020 19:27

tlrx approved these changes Jan 9, 2020

View reviewed changes

original-brownbear merged commit 95fcee2 into elastic:master Jan 9, 2020

original-brownbear deleted the 50754-logging branch January 9, 2020 08:43

original-brownbear mentioned this pull request Jan 9, 2020

Enforce Logging of Errors in GCS Rest RetriesTests (#50761) #50783

Merged

original-brownbear restored the 50754-logging branch August 6, 2020 18:26

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enforce Logging of Errors in GCS Rest RetriesTests #50761

Enforce Logging of Errors in GCS Rest RetriesTests #50761

original-brownbear commented Jan 8, 2020 •

edited

Loading

elasticmachine commented Jan 8, 2020

original-brownbear commented Jan 8, 2020 •

edited

Loading

tlrx left a comment

original-brownbear commented Jan 9, 2020

Enforce Logging of Errors in GCS Rest RetriesTests #50761

Enforce Logging of Errors in GCS Rest RetriesTests #50761

Conversation

original-brownbear commented Jan 8, 2020 • edited Loading

elasticmachine commented Jan 8, 2020

original-brownbear commented Jan 8, 2020 • edited Loading

tlrx left a comment

Choose a reason for hiding this comment

original-brownbear commented Jan 9, 2020

original-brownbear commented Jan 8, 2020 •

edited

Loading

original-brownbear commented Jan 8, 2020 •

edited

Loading