Supports reading multiple spans per Kafka message #995

codefromthecrypt · 2016-02-23T08:12:39Z

Kafka messages have binary payloads and no key. The binary contents are
serialized TBinaryProtocol thrift messages. This change peeks at thei
first bytes to see if it is a List of structs or not, reading
accordingly.

This approach would need a revision if we ever add a Struct field to
Span. However, that is unlikely. At the point we change the structure of
Span, we'd likely change other aspects which would make it a different
struct completely (see #939). In such case, we'd add a key to the kafka
message of the span version, and not hit the code affected in this
change.

Fixes #979

Kafka messages have binary payloads and no key. The binary contents are serialized TBinaryProtocol thrift messages. This change peeks at thei first bytes to see if it is a List of structs or not, reading accordingly. This approach would need a revision if we ever add a Struct field to Span. However, that is unlikely. At the point we change the structure of Span, we'd likely change other aspects which would make it a different struct completely (see #939). In such case, we'd add a key to the kafka message of the span version, and not hit the code affected in this change. Fixes #979

codefromthecrypt · 2016-02-23T08:13:17Z

cc @clehene @kristofa @prat0318

so after this.. instrumentation can choose to either TBinaryProtocol encode a single span, or a list of spans.

eirslett · 2016-02-24T15:53:29Z

I must admit this feels very hacky and brittle to me... but it could work as a temporary migration path. In the future, I suggest we should only support collecting a list of spans. For Zipkin v2, we could provide a deprecation warning - add warning log messages whenever the collector receives single spans, and not a list of spans?

yurishkuro · 2016-02-24T16:06:58Z

Did anyone do any benchmarks to show that writing/reading Kafka messages with a single span is slowed than messages with batch of spans?

I do agree with @eirslett that APIs that accept just a single span instead of a collector are suboptimal. Although logging a warning would just spam the collector's log, it can't do much about what clients submit.

codefromthecrypt · 2016-02-24T16:37:42Z

I doubt there is an existing benchmark for any Kafka zipkin reporter, much less single span vs list. I think the key here is that list gives more flexibility for instrumentation, including a means to benchmark and report back. Personally, I am cool chasing instrumentation to have them switch to list, as list of one is only several bytes overhead. Kafka is still newish (except Ruby). Also happy to add a "log once" message that notes the span id and endpoint of single-span Kafka messages. That could prevent logs from cluttering. I agree this is hacky just don't know a cheaper way to help folks move off single span without breaking them. Adding a key to the message would seem less hacky, but take more discussion for example. I was hoping to save that energy for v2 (ack list-only .) I also think benchmarks would be helpful, but there's a chicken egg. Ideally, I would like Prat to respond back when he can try list. I'm unsurprised about meh reactions.. Anyone feel we shouldn't go down the peek path with above context in mind?

eirslett · 2016-02-24T17:00:24Z

Don't get me wrong, I'm +1 for this change. I don't like the solution, but I still think it's the best solution - there's no smooth migration path. One possible alternative would be to consume span collections on a separate kafka topic, but then we get lots of additional complexity from handling two topics.

prat0318 · 2016-02-24T17:41:50Z

I will try running the collector with this patch to check the perf

On Wednesday, February 24, 2016, Eirik Sletteberg notifications@github.com
wrote:

Don't get me wrong, I'm +1 for this change. I don't like the solution, but
I still think it's the best solution - there's no smooth migration path.
One possible alternative would be to consume span collections on a separate
kafka topic, but then we get lots of additional complexity from handling
two topics.

—
Reply to this email directly or view it on GitHub
#995 (comment).

Prateek Agarwal

yurishkuro · 2016-02-24T18:49:13Z

is there indeed a lot of setup overhead to use a different topic for different message format?

prat0318 · 2016-02-24T19:08:17Z

From usability perspective, maintaining multiple topics would be hard imo. For testing perf and maintaining backwards compat, it will be helpful though

prat0318 · 2016-03-01T20:39:08Z

In my experiment, each Trace has 9 spans, which means without patch 9 kafka messages per trace.

-	Traces/s/partition	Spans/s/partition	Messages/s/partition
Production	174.5	1570	1570
Consumption	40.5	364	364

With bundling, all 9 spans are batched up in 2 kafka messages.

-	Traces/s/partition	Spans/s/partition	Messages/s/partition
Production	200	1800	400
Consumption	175	1580	351

As we can see, we straight away get a perf improvement of ~4.5 times. This should
be ~9 times if i batch up all spans in a single kafka message (it just needed some
more code changes at my end, so i just went ahead with 2 messages).

So, what i see is Message consumption rate is around constant to ~350 but we can increase
our Trace consumption rate by bundling together. So, i am super excited to have this
patch merged to master. 👍

cc @adriancole @yurishkuro

codefromthecrypt · 2016-03-02T01:26:22Z

I've not heard any feedback against from a kafka transport user, and this clearly will help @prat0318 and move us in the right direction of moving towards lists as the defacto unit-of-transport.

merging

Supports reading multiple spans per Kafka message

codefromthecrypt mentioned this pull request Feb 23, 2016

Zipkin collector version 1.14.1 fails to collect after an invalid span is posted elodina/zipkin-mesos-framework#8

Open

codefromthecrypt mentioned this pull request Mar 1, 2016

Cassandra serializes Dependencies when only dependencies.links is ever read #1008

Open

codefromthecrypt pushed a commit that referenced this pull request Mar 2, 2016

Merge pull request #995 from openzipkin/multi-kafka

fe6cf88

Supports reading multiple spans per Kafka message

codefromthecrypt merged commit fe6cf88 into master Mar 2, 2016

codefromthecrypt deleted the multi-kafka branch March 2, 2016 01:26

codefromthecrypt mentioned this pull request Mar 4, 2016

Refactors such that Kafka bundles spans and shares more code openzipkin/brave#143

Merged

mjbryant mentioned this pull request Sep 21, 2016

Support sending spans in batches Yelp/py_zipkin#6

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Supports reading multiple spans per Kafka message #995

Supports reading multiple spans per Kafka message #995

codefromthecrypt commented Feb 23, 2016

codefromthecrypt commented Feb 23, 2016

eirslett commented Feb 24, 2016

yurishkuro commented Feb 24, 2016

codefromthecrypt commented Feb 24, 2016 via email

eirslett commented Feb 24, 2016

prat0318 commented Feb 24, 2016

yurishkuro commented Feb 24, 2016

prat0318 commented Feb 24, 2016

prat0318 commented Mar 1, 2016

codefromthecrypt commented Mar 2, 2016

Supports reading multiple spans per Kafka message #995

Supports reading multiple spans per Kafka message #995

Conversation

codefromthecrypt commented Feb 23, 2016

codefromthecrypt commented Feb 23, 2016

eirslett commented Feb 24, 2016

yurishkuro commented Feb 24, 2016

codefromthecrypt commented Feb 24, 2016 via email

eirslett commented Feb 24, 2016

prat0318 commented Feb 24, 2016

yurishkuro commented Feb 24, 2016

prat0318 commented Feb 24, 2016

prat0318 commented Mar 1, 2016

codefromthecrypt commented Mar 2, 2016