GH-149: Improve error messages on write thread failure #150

C0urante · 2021-09-21T21:28:44Z

Addresses #149.

The KCBQThreadPoolExecutor is modified to only track one write thread exception at a time, which allows us to include a complete stack trace of that exception when failing the task.

The "Attempted to reduce batch size below 1." error is rewritten to include more information on a potential root cause and make the attached cause of the exception clearer.

Some cleanup of unnecessary exception wrapping when instantiating various AbstractConfig subclasses is also made.

...ector/src/main/java/com/wepay/kafka/connect/bigquery/write/batch/KCBQThreadPoolExecutor.java

ddasarathan

From what I understand, all threads can throw exceptions but only the first gets logged and that bubbles up and kills the task. Since the exception will be thrown, we do not want to reset the encounteredError atomic ref?

ddasarathan

LGTM

C0urante · 2021-11-17T15:13:00Z

From what I understand, all threads can throw exceptions but only the first gets logged and that bubbles up and kills the task. Since the exception will be thrown, we do not want to reset the encounteredError atomic ref?

We do not reset the reference because the exception may be used multiple times. We throw exceptions encountered from write threads in both BigQuerySinkTask::flush (as part of the call to KCBQThreadPoolExecutor::awaitCurrentTasks) and at the very beginning of BigQuerySinkTask::put; the former prevents us from committing offsets for data that we weren't able to write to BigQuery (see #68), and the latter causes the task to fail (since throwing an error from SinkTask::flush doesn't actually cause a task to fail).

C0urante · 2021-11-17T15:15:21Z

@ddasarathan thanks for taking a look. Given the LGTM I'll merge but if you are unsatisfied by my explanation of the way we use KCBQThreadPoolExecutor.encounteredError please let me know; happy to revert if something isn't right here or file a follow-up if there's room for improvement.

C0urante requested a review from a team September 21, 2021 22:00

C0urante mentioned this pull request Sep 23, 2021

Adding message metadata in logs in case of errors #128

Closed

C0urante force-pushed the gh-149 branch from 92a8fff to 85299c3 Compare September 27, 2021 17:43

ddasarathan reviewed Nov 17, 2021

View reviewed changes

...ector/src/main/java/com/wepay/kafka/connect/bigquery/write/batch/KCBQThreadPoolExecutor.java Show resolved Hide resolved

ddasarathan reviewed Nov 17, 2021

View reviewed changes

ddasarathan approved these changes Nov 17, 2021

View reviewed changes

C0urante added 2 commits November 17, 2021 10:07

GH-149: Improve error messages on write thread failure

adde608

GH-149: Add more detail to error message for batch reduction error

ba36e59

C0urante force-pushed the gh-149 branch from 85299c3 to ba36e59 Compare November 17, 2021 15:07

C0urante merged commit d8fc535 into 1.6.x Nov 17, 2021

C0urante deleted the gh-149 branch November 17, 2021 15:15

C0urante mentioned this pull request Nov 17, 2021

Improved error messages on write thread failure #149

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GH-149: Improve error messages on write thread failure #150

GH-149: Improve error messages on write thread failure #150

C0urante commented Sep 21, 2021

ddasarathan left a comment •

edited

Loading

ddasarathan left a comment

C0urante commented Nov 17, 2021

C0urante commented Nov 17, 2021

GH-149: Improve error messages on write thread failure #150

GH-149: Improve error messages on write thread failure #150

Conversation

C0urante commented Sep 21, 2021

ddasarathan left a comment • edited Loading

Choose a reason for hiding this comment

ddasarathan left a comment

Choose a reason for hiding this comment

C0urante commented Nov 17, 2021

C0urante commented Nov 17, 2021

ddasarathan left a comment •

edited

Loading