Use rdkafka event API instead of the callback API #617

scanterog · 2023-10-26T21:45:43Z

This PR changes how rust-rdkafka interacts with librdkafka. Instead of using the callback API, it moves to the event API. This PR does not change any of the public rdkafka API, but it give us the chance to introduce some optimizations. Some of them might break the current API though and hence this PR does not go into that route yet. For example, consider #273.

Some other benefits:

it allows to extend the lifetime of BorrowMessages. For example, consider the FutureProducer where we need to detach the borrowMessage on errors.
it allows handling the rebalance events outside of the scope of the registered callback. This is specially useful for services using rust-rdkafka to build FFI libraries on top of it.

Performance wise, I've run kafka-benchmark I've got a pretty similar outcome for both the callback vs event based approach. I didn't find much differences here. I've also run long running services on our staging infra and we didn't find noticeable changes on cpu/rss on the producer/consumer side.

There's one main difference here though. Operation errors not related to consumption (like transient broker transport failures) are passed back to the user with the Event API. This is not the case for the Callback API. Details can be found confluentinc/librdkafka#4493. These errors should generally be considered informational as the underlying client will automatically try to recover from any errors encountered, the application does not need to take action on them. However, fatal errors are also propagated via the Event API. These indicates that the particular instance of the client (producer/consumer) becomes non-functional.

src/consumer/base_consumer.rs

mensfeld · 2023-11-04T18:36:29Z

Hey, could you explain to a fellow maintainer (rdkafka-ruby here) what benefits it yields for the consumer? I clearly see benefits for admin and producer (where the delivery reports materialization is easier to handle) but I wonder about the consumer. I would appreciate some explanation as it has been a while since I started thinking about this but I have problems wrapping my head around the consumer part. Is it not possible to yield from rust in the context of rebalance callback? How will it operate on rebalanes where the revocation happens first and prior to it an action like offset storage could take place? Thanks 🙏

scanterog · 2023-11-06T14:01:11Z

Is it not possible to yield from rust in the context of rebalance callback? How will it operate on rebalanes where the revocation happens first and prior to it an action like offset storage could take place?

Hello @mensfeld! The rebalance handler does not really change here for rust-rdkafka. Is there anything in particular that gives that impression? Can you elaborate a little bit more about this?

duarten

LGTM in general!

duarten · 2023-11-06T22:10:32Z

src/consumer/base_consumer.rs

+                }
+            }
+
+            if op_timeout >= timeout {


We set op_timeout to std::cmp::min(timeout, min_poll_interval), so it's never going to be larger than timeout. Shouldn't we do something like:

let now = Instant::now(); let op_timeout = std::cmp::min(timeout, min_poll_interval); // ... let timeout = timeout.saturating_sub(now.elapsed()); if timeout == 0 { return None; }

It seems to work fine for the following common cases. Did I miss something?

If the user provides a timeout set to 100ms, then the op_timeout will be 100ms (as min_poll_interval is 1sec by default). If no msgs or errors have been returned, then on this check op_timeout is equal to timeout and we return None.

If the user provides a timeout set to 2 seconds (over the default for min_poll_interval), then op_timeout will be 1 second. Let's say we poll for 1 second and it does not return a msg/error, then op_timeout is less than timeout and now we do timeout -= op_timeout which set timeout to 1 second. If we get None messages/errors again, then op_timeout is equal to timeout and we return None.

If the user provides timeout set to Never, then op_timeout is set to the default of 1 sec and we keep polling every second until we get a message or error.

I was concerned about something like the following:

the user sets the timeout to 100ms.

After 1ms, we get RD_KAFKA_EVENT_OFFSET_COMMIT event.

We do timeout -= op_timeout, which puts timeout at 0.

We exit.

ah that's a good catch. Ok, I'll fix this.

duarten · 2023-11-06T22:16:12Z

src/consumer/base_consumer.rs

+            } else {
+                while !self.closed() {
+                    self.poll(Duration::from_millis(100));
+                }


Is this safe? Could we drop messages here and have the offsets committed to kafka?

That's a good question. My understanding is that messages won't be forwarded here anymore. In this particular case we had to switch rd_kafka_consumer_close by rd_kafka_consumer_close_queue because rd_kafka_consumer_close will block until the consumer has revoked its assignment by calling Unassign or IncrementalUnassign on the rebalance handler. This translates on the consumer being blocked forever as there's no rebalance callback registered anymore.

Hence we had to move to rd_kafka_consumer_close_queue that allows us to asynchronously close the consumer. In this particular case, by calling Poll we can fwd the rebalance event to the rebalance method on the ConsumerContext following the API doc:

The application must poll/serve this queue until rd_kafka_consumer_closed() returns true.

Alright, sounds good to me.

The `Arc<BaseConsumer<C>>` to `MessageStream` isn't necessary anymore, and the changes to `split_partition_queue` can be reverted as well I think.

This is required as multiple write acks are tied to a single event.

…ndler

…Drop trait

One poll call might not be enough to serve the delivery report callbacks of the purged messages. The current flush impl will call poll multiple times until the queue is empty or timeout.

…ad anything

If timeout::Never is used, poll should eventually return a Message or Error rather than None when handling other events like stats, rebalances, etc.

rdkafka reports consumer errors via RD_KAFKA_EVENT_ERROR but producer errors gets embedded on the ack returned via RD_KAFKA_EVENT_DR. Hence we need to return this event for the consumer case in order to return the error to the user.

Currently if a group.id is not specified we allow the use of the consumer for fetching metadata and watermarks. Keeping this behaviour.

If you have a consumer wrapping this one (FFI cases), the outer consumer must close the queue and serve the events via Poll. Otherwise it will hang forever as prior to calling close there's a rebalance & rdkafka awaits a response before continuing.

With the Event API we propagate generic client instance-level errors, such as broker connection failures, authentication issues, etc. However, fatal errors are also propagated via the Event API. These indicates that the particular instance of the client (producer/consumer) becomes non-functional.

davidblewett

Looks great!

mensfeld · 2023-11-08T16:46:19Z

@scanterog

Is it not possible to yield from rust in the context of rebalance callback? How will it operate on rebalanes where the revocation happens first and prior to it an action like offset storage could take place?

Hello @mensfeld! The rebalance handler does not really change here for rust-rdkafka. Is there anything in particular that gives that impression? Can you elaborate a little bit more about this?

This:

it allows handling the rebalance events outside of the scope of the registered callback. This is specially useful for services using rust-rdkafka to build FFI libraries on top of it.

Since rebalance callbacks are triggered from the thread that is polling the consumer queue (one or both with main), a closure in the callback should allow to run code "outside" (in a way). So I wondered how this is done via events API.

I am myself looking to move to the IO event API for producer and admin but not sure what benefits it yields for regular high level consumers.

scanterog · 2023-11-08T17:10:06Z

it allows handling the rebalance events outside of the scope of the registered callback. This is specially useful for services using rust-rdkafka to build FFI libraries on top of it.

Let me give some more context. Let's assume you provide some FFI layer on top of rust-rdkafka that allows you to create bindings for other langs (like Go/Python etc). Using the callback approach, we register a rebalance callback with rd_kafka_conf_set_rebalance_cb. This callback is responsible for running any *assign calls, such as rd_kafka_assign, within its scope. You cannot perform these *assign calls outside of this function. Consequently, it's challenging to have a rebalance callback in the wrapper language that can build up state based on rebalance events. While you can forward the assign/revoke events, you cannot call *assign, which means you cannot determine when to synchronize state with Kafka when partitions are assigned or revoked.

On the other hand, with the Event API approach, there are no such limitations. You receive the assign/revoke event and can delegate the *assign calls to an external caller. For rust-rdkafka, this translates to overriding the rebalance method of the ConsumerContext. This rebalance method can simply forward the event and delegate the *assign call to the wrapper language. We're not tied to the scope of the rebalance callback anymore.

mensfeld · 2023-11-08T20:00:49Z

@scanterog thank you for your explanation. That simplified things for me a lot 🙏

Millione · 2024-10-14T07:56:30Z

Is there any chance that the consumer never closed in BaseConsumer drop method? Because I ran into a situation where my program never exit that while loop checking whether closed.

https://github.com/fede1024/rust-rdkafka/pull/617/files#diff-816ce199fffc4f08e934909eaf411d79be6ace505e6840d753065e3c8de56ea7R717-R725

mensfeld · 2024-10-14T07:57:39Z

@Millione are you using cooperative sticky assignment strategy?

Millione · 2024-10-14T08:08:51Z

@Millione are you using cooperative sticky assignment strategy?

No, will it make a difference?

mensfeld · 2024-10-14T08:10:03Z

@Millione I thought you ended up with an infinite loop error that can happen with certain combination of cooperative-sticky and some specific partition counts. Please ignore if not used because then it's not the case.

Millione · 2024-10-14T09:44:15Z

@mensfeld Even if in the case you said, I still think it's not reasonable here to loop forever. The implementation should prevent this?

mensfeld · 2024-10-14T13:30:11Z

@Millione I cannot tell you much about the implementation in rust because I maintain the implementation in Ruby. The issue here is, that polling delegates the responsibility to the C layer and it is the C layer that may hang. If that happens it happens. Maybe there are tools to exit out of such hanging code available in Rust, but at least in Ruby, they are of low quality and do not fix anything.

I do not know what is happening in your particular case, but I hold the Rust bindings in high regard in terms of stability and quality.

Millione · 2024-10-17T03:08:22Z

@mensfeld Thanks for your patient explanation, I finally find I am stuck in the poll stage for rebalancing while closing, the last in stack frame is pthread cond wait.

scanterog force-pushed the scanterog/event-based-client branch 10 times, most recently from 94da831 to d25c7d9 Compare November 1, 2023 12:58

scanterog marked this pull request as ready for review November 1, 2023 18:49

davidblewett self-assigned this Nov 1, 2023

davidblewett reviewed Nov 1, 2023

View reviewed changes

src/consumer/base_consumer.rs Outdated Show resolved Hide resolved

scanterog mentioned this pull request Nov 3, 2023

Error propagation differences between the callback and event API confluentinc/librdkafka#4493

Open

7 tasks

scanterog force-pushed the scanterog/event-based-client branch from d25c7d9 to 113af85 Compare November 6, 2023 13:12

duarten reviewed Nov 6, 2023

View reviewed changes

scanterog force-pushed the scanterog/event-based-client branch from 113af85 to f3a0a53 Compare November 7, 2023 17:40

duarten approved these changes Nov 7, 2023

View reviewed changes

scanterog and others added 10 commits November 7, 2023 15:49

Move to Event-based API

19f32bf

Adapt the StreamConsumer to poll the underlying BaseConsumer

b527a3e

Pass arc by value rather than reference and fix generic type.

b897ec9

Refactor to use references and lifetimes rather than Arc.

ebe7e9c

Work on supporting StreamConsumer via lifetimes instead of Arc.

39dec28

The `Arc<BaseConsumer<C>>` to `MessageStream` isn't necessary anymore, and the changes to `split_partition_queue` can be reverted as well I think.

Use Arc for events in BorrowMessage

941cd32

This is required as multiple write acks are tied to a single event.

Adapt producer Flush to the Event API semantics

438af77

Explain why the TPL need to be manuallyDrop on the consumer events ha…

0a36b3d

…ndler

Add comment for no-op method used on RDKafkaMessage impl of the Kafka…

0b885a5

…Drop trait

Update doc comment for BorrowedMessage::from_dr_event

32b0d24

scanterog added 14 commits November 7, 2023 15:49

Replace poll with flush on baseProducer drop

64d2e32

One poll call might not be enough to serve the delivery report callbacks of the purged messages. The current flush impl will call poll multiple times until the queue is empty or timeout.

StreamConsumer Stream impl fixes for the event API

f3173d5

Consumer needs to read from earliest otherwise consumer will never re…

6c8c5f0

…ad anything

Poll should not return None if timeout has not been reached

54893ab

If timeout::Never is used, poll should eventually return a Message or Error rather than None when handling other events like stats, rebalances, etc.

Cargo clippy

c7f83a8

Propagate errors for the consumer

74ff52a

rdkafka reports consumer errors via RD_KAFKA_EVENT_ERROR but producer errors gets embedded on the ack returned via RD_KAFKA_EVENT_DR. Hence we need to return this event for the consumer case in order to return the error to the user.

Adapt commit_transaction to the event api

bb2aee0

Adapt consumer close to the event api

3b98f95

Allow creating a consumer without group.id

2af3671

Currently if a group.id is not specified we allow the use of the consumer for fetching metadata and watermarks. Keeping this behaviour.

Do not panic on transient errors on test_consume_partition_order

4fb2266

Use closed and close_queue methods on drop

7202e7b

Fix op timeout computation logic on poll_queue

978c964

scanterog force-pushed the scanterog/event-based-client branch from f3a0a53 to 978c964 Compare November 7, 2023 18:49

davidblewett approved these changes Nov 8, 2023

View reviewed changes

davidblewett merged commit c719e55 into master Nov 8, 2023
10 checks passed

scanterog deleted the scanterog/event-based-client branch January 4, 2024 19:24

Ten0 mentioned this pull request Jan 13, 2024

Upgrade from 0.34.0 to 0.36.0 causes consumer to stop consuming but keep running #638

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use rdkafka event API instead of the callback API #617

Use rdkafka event API instead of the callback API #617

scanterog commented Oct 26, 2023 •

edited

Loading

mensfeld commented Nov 4, 2023

scanterog commented Nov 6, 2023

duarten left a comment

duarten Nov 6, 2023

scanterog Nov 7, 2023

duarten Nov 7, 2023

scanterog Nov 7, 2023

duarten Nov 6, 2023

scanterog Nov 7, 2023

duarten Nov 7, 2023

davidblewett left a comment

mensfeld commented Nov 8, 2023

scanterog commented Nov 8, 2023

mensfeld commented Nov 8, 2023

Millione commented Oct 14, 2024 •

edited

Loading

mensfeld commented Oct 14, 2024

Millione commented Oct 14, 2024

mensfeld commented Oct 14, 2024

Millione commented Oct 14, 2024

mensfeld commented Oct 14, 2024

Millione commented Oct 17, 2024

Use rdkafka event API instead of the callback API #617

Use rdkafka event API instead of the callback API #617

Conversation

scanterog commented Oct 26, 2023 • edited Loading

mensfeld commented Nov 4, 2023

scanterog commented Nov 6, 2023

duarten left a comment

Choose a reason for hiding this comment

duarten Nov 6, 2023

Choose a reason for hiding this comment

scanterog Nov 7, 2023

Choose a reason for hiding this comment

duarten Nov 7, 2023

Choose a reason for hiding this comment

scanterog Nov 7, 2023

Choose a reason for hiding this comment

duarten Nov 6, 2023

Choose a reason for hiding this comment

scanterog Nov 7, 2023

Choose a reason for hiding this comment

duarten Nov 7, 2023

Choose a reason for hiding this comment

davidblewett left a comment

Choose a reason for hiding this comment

mensfeld commented Nov 8, 2023

scanterog commented Nov 8, 2023

mensfeld commented Nov 8, 2023

Millione commented Oct 14, 2024 • edited Loading

mensfeld commented Oct 14, 2024

Millione commented Oct 14, 2024

mensfeld commented Oct 14, 2024

Millione commented Oct 14, 2024

mensfeld commented Oct 14, 2024

Millione commented Oct 17, 2024

scanterog commented Oct 26, 2023 •

edited

Loading

Millione commented Oct 14, 2024 •

edited

Loading