-
Notifications
You must be signed in to change notification settings - Fork 29.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
tracing working group #671
tracing working group #671
Comments
quick question, is tracing part of the solution to a larger problem or is it totally separate? could tracing and dtrace support and systemtap support be classified as part of a larger effort the WG would be solving? |
also @thlorenz who is working on some related stuff. |
I think they are all related. At the time of our previous discussion, AsyncListener was being removed from core. Now we have async_wrap, so I think we should reconvene to discuss what we can do with that to make tracing more pleasant. DTrace and SystemTap are just other destinations for trace data, so I think that could be part of the discussion. The topic of the working group could potentially be a bit broader, reaching into other debugging issues, like what to do with domains. |
@Qard IMO the working group should discuss what tracing functionality is more generally wanted, and then the minimalist hooks to allow that functionality to live in user-land should be implemented here. There is a never ending list of features people want, and adding/maintaining all those is the wrong decision. Extending the hooks that currently live in async_wrap sounds great. I'd love to get more feedback on what more is needed. |
I'm working on outreach right now to get wide input in to the roadmap. @trevnorris can you think of a good question to ask individuals and companies that could inform what they need from tracings? Maybe something like "what do you wish you knew about a running Node application that you don't know now?" |
In our previous meeting, we came to a rough consensus on what our problems were as APM providers. Knowing more about what the customers want to see would certainly be valuable too. I agree that a lot of the applications of this would likely be userland stuff, but we can figure out what that is when we get to it. |
@mikeal Would it be possible for devs to provide pseudo examples of what they want? Meaning, they provide some example code and can explain what it is they want traced. Seeing usage cases, imho, would be the most beneficial. |
@Qard do you have a link to the write up for that last meeting? I'm having trouble finding it, and its pretty relevant here. |
@sam-github Yep, here it is. https://gist.github.com/groundwater/942dad5c0c4cfae21af9 These are the trace meeting notes compiled by @groundwater after our previous meeting, for anyone else that wants to have a look. |
@Qard yes! Definitely still interested |
I am definitely interested as well |
I am most definitely interested, both in following up on the conversation we started last year and in whatever broader working-group discussion that might result from Mikeal's canvassing. |
I feel there are two goals here which are solved differently:
I created an issue to collect info about existing user land tools, some of which are addons that interface with v8 in order to pull out profiling info. I'm mostly focusing on 2) ATM, which can be converters, parsers to consume output of perf, dtrace, system tap, etc. and plugins into tools like debuggers, At the moment we have the following (some rather new): I'm most likely missing lots, so please add some you know of
For those interested, some of us are gathering in So a wide scope to cover here, not sure if it makes sense to have this all in one group or if we should split it into one group for each section as outlined in 1) and 2). |
I see the working group mostly focusing on goal 1, but goal 2 ties into it in many ways. Being able to smoothly correlate trace data across JS and C++ boundaries is one thing that comes to mind. Visibility beyond the nebulous "it went into libuv somewhere, it'll probably call back at some point" would be great. Buffer usage in native modules also seems like something that'd be good to get some visibility into. I'm sure there's plenty of areas you all can think of where integrating with the native side could provide some very valuable data. |
I'm very interested in helping this WG get off the ground. This topic is at the core of the "completeness" story for Node IMO. Whenever we interact with companies shifting from more mature platforms, the lack of insight into what their programs are doing is one of the key areas lacking from Node. It could be part of a larger "debugging" topic, but given that this particular group is so focused on tracing and have all been working on this area I think a tracing-focused WG would be a good start. How about y'all make sure that you've collected all of the relevant people with an interest in this area and a history of actually tackling this problem and then we'll try and find a time for you to have a kick-off Hangout to discuss how best to proceed. |
I would like to see the greater debugging issue getting tackled, but I think we should focus on one thing at a time. Given most of us have a background in APM, I think tracing is the obvious first topic. Also, do we have any connections to V8 core people working on the profiling tools? It would be good to coordinate with them. Perhaps @domenic can help with that? |
I believe most of the debugger and profiler work was done by non-Googlers. If you restrict it to just people from the V8 team, Yang is probably the most active in that area. Apropos the tracing WG, sign me on. StrongLoop would be most interested in (async) tracing and getting more metrics out of io.js and libuv (and V8, but that's a separate story.) |
I think a WG that doesn't have a strong...StrongLoop presence would be incomplete, because @sam-github, @rmg, and @piscisaureus have all done significant work in this area, in addition to you, @bnoordhuis. I also think focusing on APM / production performance analysis is a sensible move. "Debugging" is a huge and broad area. |
The consensus from our meeting was roughly "we want to collect arbitrary data, at arbitrary places" which is probably too vague to anyone who did not attend. I would suggest we come up with a dozen well-defined user stories (sorry for the product manager speak) before we all jump into solution land. At a high level, I think the stories should cover at least the following
Other than that, we probably want to discuss constraints the solution must meet
|
So we have clear interest in being involved in a WG expressed by myself, along with: @brycebaril If anyone else commenting here wants in, please say so. Thanks @rvagg for the offer to help get this going. We should figure out how this fits into the Hangouts calendar and get a doodle started. |
@Qard please add me to this list as I'm also interested in being involved. |
This may be of interest to @AndreasMadsen as well, given the existence of @AndreasMadsen/trace and the work he's doing with |
If you guys have specific things you're interested in I can try to reach out. @paulirish might also be a good contact as he's working on dev tools in specific. One thing I'd personally be interested in seeing out of this group is some idea of what hooks into the VM or event loop or runtime environment are necessary for this kind of work. Then maybe we can standardize those and put them in V8 and in browsers after io.js proves them out in the real world. Cf. node-forward/discussions#28. So basically when defining the solution and "polyfilling" it in io.js, give some thought to how generalizable it would be. |
We in @nearform have been dabbling quite a lot with LTTNG and loving it. We have a fork of io.js on the go which adds LTTNG tracepoints in a similar manner to dtrace and ETW: https://github.com/nearform/io.js/tree/tracing. Can we be included in this working group? |
@dberesford I don't see why not. I suspect the focus will be more on dynamic tracepoints rather than static ones, though. If you want to, the LTTNG support can (with a bit of rework) land upstream. |
@bnoordhuis that would be great, we'll get a PR ready for review |
Preliminary feedback from companies I've conducted in the past surfaced a strong need for "Linux debugging/tracing" so I'll volunteer to do any grunt work here to get the group off the ground (schedule first meeting, arrange agenda, write the charter) but in the first meeting I'll probably call out someone to take on the role of facilitator moving forward. Here's a doodle for the first meeting, scheduled for this Wednesday/Thursday. http://doodle.com/x53auvrtffeia2in For building the initial agenda, please propose topics here and I'll put together a list. |
@natduca, re:
I can't find any sign of trace_event.h in https://chromium.googlesource.com/v8/v8 ... am I looking in the wrong place, or are you describing speculative future work? |
Found it linked from https://codereview.chromium.org/827993003 |
Yeah, this patch is stalled because the author has been busy. But its still coming. :) |
@natduca can you comment on the differences and similarities between the Timeline view in Dev Tools, and the |
@sam-github Timeline in DevTools shows sampled data |
@sam-github @thlorenz as in all big software, there's a lot of nuance because we're in between the old thing that is deprecated and the new thing that does't work quite right. :) Timeline view in devtools is almost completely based on chrome://tracing data these days. The corner cases are cpuprofile but we're working on moving that over completely. Same for network panel. The future we're working toward is that all performance data you see in devtools came through the tracing data stream. But, the UI are different: devtools ui is focused on ease of use for a web developer, whereas chrome://tracing is just the raw "good enough for chrome hackers" view. Devtools' timeline won't move to that chrome://tracing ui, though it may pick up ideas from that UI, or vice versa. |
Pretty close, thorsten.
There's at least there UIs here:
Once the above commit lands, all of these will be powered by the trace event data stream (v8 sampling and network being the remaining parts). And after that all 3 UIs will allow import/export of |
@bnoordhuis @sam-github @othiym23 @wraithan @groundwater @brycebaril @trevnorris
We should revisit the tracing situation. Perhaps a working group is in order? Some of you have moved on from APM since our last discussion, some were not present, but your input is valuable.
The text was updated successfully, but these errors were encountered: