fixed potential deadlock when a heartbeat request fails #1286

aksdb · 2019-02-18T10:20:07Z

When using a ConsumerGroup I managed to block all but one consumers after Kafka went down (but Zookeeper kept running).

Test setup:

zookeeper + kafka
1 partition
run a consumer multiple times, restarting as desired causing the group to be rebalanced (just in case)
kill kafka - one consumer properly leaves the "Consume" method while the others keep waiting

Debugging revealed that the "retries" went into negative numbers, simply because the failed retries were not evaluated at that point in the code. My PR adds this missing check and therefore allows a clean shutdown on a failed kafka connection.

aksdb · 2019-02-18T10:24:06Z

I signed the CLA but fail to see how I can rerun the check.

bai · 2019-02-19T06:01:32Z

Thanks for contributing! I re-run the CLA check and all good.

fixed potential deadlock when a heartbeat request fails

6c949c0

ghost added the cla-needed label Feb 18, 2019

ghost removed the cla-needed label Feb 19, 2019

bai merged commit dca1ba6 into IBM:master Feb 19, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fixed potential deadlock when a heartbeat request fails #1286

fixed potential deadlock when a heartbeat request fails #1286

aksdb commented Feb 18, 2019

aksdb commented Feb 18, 2019

bai commented Feb 19, 2019

fixed potential deadlock when a heartbeat request fails #1286

fixed potential deadlock when a heartbeat request fails #1286

Conversation

aksdb commented Feb 18, 2019

aksdb commented Feb 18, 2019

bai commented Feb 19, 2019