Zipkin to Application Insights module #33

SergeyKanzhelev · 2017-06-21T20:38:32Z

Issue to discuss ongoing work in PR: #27

The main idea is to allow to report and store Zipkin traces in the format Application Insights recognize so all the power of Application Insights Analytics engine, alerting and proactive detection can be used for those traces.

Also allows to view data collected by Application Insights SDK in Zipkin UI.

aliostad · 2017-06-28T08:09:15Z

@adriancole, @SergeyKanzhelev but I think we need some discussion over it.

I have some concerns over storing zipkin as ApplicationInsight. It would be unlikely that these format are like-for-like and need to understand of possible trade-offs that we might have to make as a result.

My view is that, doing the opposite ie making AI agent to send Zipkin traces, is a better option - I believe considering AI is a late-joiner to the game and people already use many different tools, it is ideal for AI to allow extension points to publish to other systems (zipkin, ELK, graphite, etc)

codefromthecrypt · 2017-06-28T08:25:52Z

@aliostad sorry my note was about sergey being active on many things tracing, and thanking for that engagement. that's what I meant by "apologies to watchers for using github comment as messaging I will delete this". Me thanking him for collaborating in general is independent of what direction is taken on this issue.

aliostad · 2017-06-28T10:09:44Z

@adriancole sure, sorry I think my comment probably came across differently :/ I thought you were about to delete this thread so asked if you do not and keep it open so we can discuss.

Thanks for all the hard work and sorry for the confusion.

aliostad · 2017-06-28T10:13:34Z

Just another note from my experience, we use a modified ELK stack (use conveyorbelt instead of logstash) and we will be independently writing to zipkin.

I think it is ambitious to try to bring AI and zipkin closely together and even if it is possible now, their independence might mean they will part ways further on so I prefer attacking the problem as an integration rather than composition.

codefromthecrypt · 2017-06-28T10:41:41Z

@adriancole <https://github.com/adriancole> sure, sorry I think my comment probably came across differently :/ I thought you were about to delete this thread so asked if you do not and keep it open so we can discuss. Thanks for all the hard work and sorry for the confusion.

heh yeah ok. glad we're on-track again (sorry again for derailing)

SergeyKanzhelev · 2017-06-28T16:30:41Z

@aliostad, you are correct. For me one of the reason for this work is to understand how far apart Application Insights and Zipkin are with regards to data model. And we thinking of creating adapter that works backwards as well. Zipkin may be a perfect tool for the customers who cannot send data to the cloud or just want to use existing tools while using automatic data collection provided by Application Insights .NET SDK. However we still need to overcome the same gap between data models.

Once we will know the difference we can plan accordingly what changes can be done on our side and what we can contribute back. As you said we may never come to the exactly same schemas. And will always maintain the bridge between tools. However I only have the best intentions in mind and want to make it easy for developers to do distributed tracing.

@aliostad I believe Zipkin to Application Insights will be helpful for many people. If you have more concerns - please let me know.

In any case - it will be helpful if you can help with understanding of the data model gap, I was planning to post the list of questions we stumble across with @praveenbarli later today.

SergeyKanzhelev · 2017-06-30T06:47:01Z

So for the most part it's easy to match from Zipkin to Application Insights and back.

There is no concept of base span class. Application Insights defines request as incoming span and dependency as a span leaving process.

Every telemetry item can also have a set of context properties as well as generic name value dictionary.

Both request and dependency defines:

id - matches to span.id
name - matches to span.name with the same semantic
duration - matches to span.duration
Also request defines url and dependency has data - matches to well-known binary annotation http.url and sql.query. If not defined - can fall back to http.path

There are few pieces of information Application Insights typically collects that we cannot find a match for:

Success boolean field. This fields used to indicate that the span execution resulted in error. It can be an exception, bad result or http status code. Does it match semantically to error annotation inside the span?
Source and target strings. This is the name or identifier of a callee or caller service. It's typically used when collaborating components report telemetry to the different "telemetry buckets". So you know where to look for correlated spans. It is also used to indicate the source component on the application map. For SQL it may be sql server and database name. For http communication - host name, etc. I did not find matching concept in Zipkin schemas. Is Endpoint of SA annotation has this meaning?

There are also a few context properties we typically collect. I didn't find matching concepts yet.

cloud.roleInstance - typically a computer name where the component executing the span is running
cloud.roleName - role of the service. Like frontend.
location.ip - the IP address of a client called the service.

There are few more properties defined in Application Insights that we'd love to match, but they don't seems to exist as a strongly typed or well know concepts. I'm thinking there should be a configuration file to match binary annotations to well defined properties and back.

I keep trying different examples I can find to see how Zipkin is used. @adriancole do you know any examples of a good end-to-end demo app instrumented with Zipkin highlighting the best practices and possible scenarios?

codefromthecrypt · 2017-06-30T08:49:28Z

@adriancole <https://github.com/adriancole> do you know any examples of a good end-to-end demo app instrumented with Zipkin highlighting the best practices and possible scenarios?

There's no demo that shows all tracing use cases in the same app. There are a number of example apps, though: https://github.com/criteo/zipkin4net/tree/master/zipkin4net-example https://github.com/openzipkin/brave-webmvc-example https://github.com/openzipkin/zipkin-js-example https://github.com/openzipkin/zipkin-go-opentracing/tree/master/examples Also, at least in Brave, we have integration tests that show common instrumentation use cases: https://github.com/openzipkin/brave/blob/master/instrumentation/http-tests/src/main/java/brave/http/ITHttpClient.java https://github.com/openzipkin/brave/blob/master/instrumentation/http-tests/src/main/java/brave/http/ITHttpServer.java https://github.com/openzipkin/brave/blob/master/brave/src/test/java/brave/features/async/OneWaySpanTest.java Finally, the thrift docs have comments defining a lot of the semantics in greater detail than things on http://zipkin.io https://github.com/openzipkin/zipkin-api/blob/master/thrift/zipkinCore.thrift I suspect that if we needed to, we could make a super demo that included all integration patterns, just that hasn't been requested before.

SergeyKanzhelev · 2017-06-30T19:56:56Z

Thrift docs are very helpful! There are a lot of assumed semantic in annotations. How strict Zipkin collectors typically are? How many special cases there are in Zipkin UI for cases when one span has two CS annotations (or something like this)? If Application Insights storage implementation will reject bad formed spans - will it still cover majority of implementations or you'd recommend to not enforce anything?

codefromthecrypt · 2017-06-30T21:02:36Z

Thrift docs are very helpful! There are a lot of assumed semantic in annotations. How strict Zipkin collectors typically are? How many special cases there are in Zipkin UI for cases when one span has two CS annotations (or something like this)? If Application Insights storage implementation will reject bad formed spans - will it still cover majority of implementations or you'd recommend to not enforce anything?

Collectors will drop malformed (badly encoded data like required fields absent), but errors such as multiple "cs" are not dropped at the moment. Luckily these are indeed errors and instrumentation can be asked to fix them. Seeing that there were multiple "cs" sent help folks fix their instrumentation. Not sure if that answers your question.

SergeyKanzhelev · 2017-07-03T16:08:14Z

Zipkin 2 schema is very similar to Application Insights request and dependency data types. Interesting discussion there as well. It also refers to census spans converter: https://github.com/GoogleCloudPlatform/stackdriver-zipkin/blob/master/translation/src/main/java/com/google/cloud/trace/zipkin/translation/SpanTranslator.java Ideally Application Insights converter should use the same logic to detect the span kind as census uses

SergeyKanzhelev · 2017-07-03T20:43:09Z

@praveenbarli looking at how to map Application Insights data back to Zipkin span - came up with this query so far... Still thinking how to map the rest of possible properties:

//https://github.com/openzipkin/zipkin-api/blob/master/thrift/zipkinCore.thrift
requests 
| project 
    span_trace_id = operation_Id,
    span_name = name,
    span_id = id,
    span_parentId = operation_ParentId,
    span_timestamp = timestamp,
    span_duration = duration,
    annotations_sr_host_servcie_name = cloud_RoleName,
    annotations_ss_host_servcie_name = cloud_RoleName,
    binary_annotations_http_url = url,
    binary_annotations_http_status_code = resultCode,
    binary_annotations_ca_host_ip = client_IP,
    binary_annotations_ca_host_service_name = iff(isnull(source), operation_SyntheticSource, source),
    binary_annotations_extraStringValues = customDimensions,
    binary_annotations_extraDoubleValues = customMeasurements
    //success,
    //session_Id ,
    //user_Id ,
    //user_AuthenticatedId ,
    //user_AccountId ,
    //application_Version , 
    //client_Model ,
    //client_OS ,
    //client_Browser ,
    //cloud_RoleInstance ,
    //appName ,
    //sdkVersion ,
    //itemCount  
    // no intention to expose in Zipkin:
    //itemId ,
    //itemType ,
    //performanceBucket ,
    //operation_Name ,
    //client_Type ,
    //client_City ,
    //client_StateOrProvince ,
    //client_CountryOrRegion ,
    //appId ,
    //iKey
| take 10

You can use a similar query instead of ones you are using now.

praveenbarli · 2017-07-04T00:00:11Z

@SergeyKanzhelev sure. Once, I am able to write all these props I will start using this read query.
One more question - I see in test cases writing Spans with neither cs/cr (dependencies) or sr/ss(requests). Which table would those go?

SergeyKanzhelev · 2017-07-04T04:51:11Z

@praveenbarli use dependency for the local ones.

SergeyKanzhelev mentioned this issue Jun 21, 2017

Add storage support (ex table service or docdb) #8

Open

codefromthecrypt mentioned this issue Jul 4, 2017

Implementation of span2 format openzipkin/zipkin#1644

Closed

22 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zipkin to Application Insights module #33

Zipkin to Application Insights module #33

SergeyKanzhelev commented Jun 21, 2017

aliostad commented Jun 28, 2017 •

edited

Loading

codefromthecrypt commented Jun 28, 2017

aliostad commented Jun 28, 2017

aliostad commented Jun 28, 2017

codefromthecrypt commented Jun 28, 2017 via email

SergeyKanzhelev commented Jun 28, 2017

SergeyKanzhelev commented Jun 30, 2017

codefromthecrypt commented Jun 30, 2017 via email

SergeyKanzhelev commented Jun 30, 2017

codefromthecrypt commented Jun 30, 2017 via email

SergeyKanzhelev commented Jul 3, 2017

SergeyKanzhelev commented Jul 3, 2017

praveenbarli commented Jul 4, 2017 •

edited

Loading

SergeyKanzhelev commented Jul 4, 2017

Zipkin to Application Insights module #33

Zipkin to Application Insights module #33

Comments

SergeyKanzhelev commented Jun 21, 2017

aliostad commented Jun 28, 2017 • edited Loading

codefromthecrypt commented Jun 28, 2017

aliostad commented Jun 28, 2017

aliostad commented Jun 28, 2017

codefromthecrypt commented Jun 28, 2017 via email

SergeyKanzhelev commented Jun 28, 2017

SergeyKanzhelev commented Jun 30, 2017

codefromthecrypt commented Jun 30, 2017 via email

SergeyKanzhelev commented Jun 30, 2017

codefromthecrypt commented Jun 30, 2017 via email

SergeyKanzhelev commented Jul 3, 2017

SergeyKanzhelev commented Jul 3, 2017

praveenbarli commented Jul 4, 2017 • edited Loading

SergeyKanzhelev commented Jul 4, 2017

aliostad commented Jun 28, 2017 •

edited

Loading

praveenbarli commented Jul 4, 2017 •

edited

Loading