[dotnet-trace] Add option to stop tracing on trigger #3125

steveisok · 2022-06-15T15:59:46Z

In order to better support tracing in mobile scenarios, we would like the ability to stop a trace based on a trigger. For example, we are using dotnet-trace to profile an Android application at startup. It would be helpful if we could stop a trace based on a common method name.

An example:

dotnet-trace /trigger:MethodName=<method-name:>

The text was updated successfully, but these errors were encountered:

steveisok · 2022-06-15T16:00:30Z

@tommcdon Can you add the enhancement label? I don't appear to be able to.

tommcdon · 2022-06-15T17:02:11Z

@tommcdon Can you add the enhancement label? I don't appear to be able to.

Added

samsp-msft · 2023-02-27T23:15:59Z

Q: Should there be an API in System.Diagnostics for the app to be able to control tracing. GoLang has you call profiling api's explicitly as part of the app code to start and stop tracing.

We could have an API to:

Start tracing if a tracer is attached, and block if none is attached
Start tracing and not block
Stop tracing
Write a sign-post event into the trace. When debugging an issue, or trying to understand performance, it can be useful to be able to write simple events that can then act as signposts when doing analysis, without having to go to the effort of declaring a custom event, and making sure its collected. It would be the tracing equivalent of System.DIagnostics.Debug.WriteLine.

steveisok · 2023-02-28T21:37:04Z

/cc @lateralusX

lateralusX · 2023-08-29T09:53:57Z

First step will be to have similar support as added to dotnet-monitor around stop trigger. Original idea when implementing that was to share/port that code into dotnet-trace as well so we end up with similar functionality. Alternative is to do something along perfviews abilities to start/stop trace based on custom event triggers, but since we already implemented something similar in dotnet-monitor, I believe it make more sense to align to that implementation (or having multiple options).

One challenge with this implementation is to keep event streaming running with least amount of impact (since a degrade in read performance could lead to dropped events), side effect of this is that once the trigger has been detected and session have been stopped, .nettrace file will still need to include rundown events, so there will always be an unknow number of events past stop trigger that might end up in nettrace file. This could be an issue if the nettrace file will be used together with tools like dotnet-pgo, since that might lead to more methods getting AOT:ed than anticipated. In order to mitigate that scenario + adding support for a bunch of new scenarios, dotnet/runtime#89853 was implemented giving the user ability to use the same method as used as stop trigger as the end filter passed to dotnet-pgo. The new filter capabilities in dotnet-pgo together with ability to merge mibc file and ability in Mono cross compilers to accept multiple mibc files opens up ability to make detailed control on what methods that will an up AOT:ed in the profiled AOT use case.

If we are going to utilize the same capabilities as dotnet-monitor in dotnet-traces stop trigger implementation, the following PR reference the implementation of stop trigger in dotnet-monitor, dotnet/dotnet-monitor#2557.

Fixes #3125 This PR provides users another method to stop a dotnet-trace via a stopping event similar to that of dotnet-monitor, originally implemented in dotnet/dotnet-monitor#2557. Three arguments are added to the `dotnet-trace collect` command to specify a stopping event: | Argument | Description | |----------|----------| |`--stopping-event-provider-name` | A string, parsed as-is, that will stop the trace upon hitting an event with the matching provider name. For a more specific stopping event, additionally provide `--stopping-event-event-name` and/or `--stopping-event-payload-filter`. | | `--stopping-event-event-name` | A string, parsed as-is, that will stop the trace upon hitting an event with the matching event name. Requires `--stopping-event-provider-name` to be set. For a more specific stopping event, additionally provide `--stopping-event-payload-filter`. | | `--stopping-event-payload-filter` | A string, parsed as [payload_field_name]:[payload_field_value] pairs separated by commas, that will stop the trace upon hitting an event with a matching payload. Requires `--stopping-event-provider-name` and `--stopping-event-event-name` to be set. | Note: Though technically `--stopping-event-payload-filter` can be set without needing a `--stopping-event-event-name`, this may lead to mismatched payloads should another `TraceEvent` under the same provider not have that particular payload field name. Until there is a good reason to stop a trace given a payload filter regardless of the event name, we require `--stopping-event-event-name` to be set whenever `--stopping-event-payload-filter` is provided. To stop a trace at a particular event, dotnet-monitor's [approach](https://github.com/dotnet/dotnet-monitor/blob/0820b6911f3ac47b6b5ec867ac906699e5c15787/src/Tools/dotnet-monitor/Trace/TraceUntilEventOperation.cs#L47) using an [EventMonitor](https://github.com/dotnet/diagnostics/blob/main/src/Microsoft.Diagnostics.Monitoring.EventPipe/EventMonitor.cs) is adopted. Upon hitting a TraceEvent with a matching ProviderName, EventName (if specified), and PayloadFilter (if specified), we trigger dotnet-trace's fallback logic to stop the EventPipeSession before the EventStream ends. Note: As the EventStream is being parsed asynchronously, there will be some events that pass through between the time a trace event matching the specified stopping event arguments is parsed and the EventPipeSession is stopped. In addition, this PR modifies `EventMonitor` to use the `ClrTraceEventParser` to parse `TraceEvent` objects under the `Microsoft-Windows-DotNETRuntime` provider, and the `DynamicTraceEventParser` otherwise. The `ClrTraceEventParser` is generated to understand the ETW event manifest for `Microsoft-Windows-DotNETRuntime` events. The previous implementation defaulting to `DynamicTraceEventParser` would not work on non-Windows platforms such as OSX which could not parse the payload to populate `PayloadNames` and `PayloadValue(i)` because there was no manifest available. On the other hand, Windows is able to locate manifest data for known events through the OS. ------------------- ## Testing With an Android App ``` C# private static void PrintA() { Console.WriteLine("A"); Thread.Sleep(1000); } ... private static void PrintK() { Console.WriteLine("K"); Thread.Sleep(1000); } public static void Main(string[] args) { Console.WriteLine("Hello, Android!"); // logcat PrintA(); PrintB(); PrintC(); PrintD(); PrintE(); PrintF(); PrintG(); PrintH(); PrintI(); PrintJ(); PrintK(); while (true) { Thread.Sleep(100); } } ``` Running dotnet-dsrouter to connect the diagnostic tooling with the android device `./artifacts/bin/dotnet-dsrouter/Debug/net6.0/dotnet-dsrouter android -v debug` Tracing with a stopping event args provided `./artifacts/bin/dotnet-trace/Debug/net6.0/dotnet-trace collect -p 16683 --providers Microsoft-Windows-DotNETRuntime:0x1F000080018:5 --stopping-event-provider-name Microsoft-Windows-DotNETRuntime --stopping-event-event-name Method/JittingStarted --stopping-event-payload-filter MethodName:PrintA` [dotnet-dsrouter_20231024_165648.nettrace.zip](https://github.com/dotnet/diagnostics/files/13147788/dotnet-dsrouter_20231024_165648.nettrace.zip) There are no `Method/JittingStarted` events with MethodName `PrintC` through `PrintK` in the `.nettrace`, showing that the trace was indeed stopped after seeing the `PrintA` event. The events after `PrintA` are an artifact of the second note above. Below is the JITStats of the .nettrace opened in perfview, showing that the last method was `PrintB`. <img width="1128" alt="JITStatsPrintA" src="https://github.com/dotnet/diagnostics/assets/16830051/1742baf4-b528-43c3-aef3-b00a576f2fb8">

tommcdon added enhancement New feature or request dotnet-trace labels Jun 15, 2022

tommcdon added this to the 7.0.0 milestone Jun 15, 2022

tommcdon modified the milestones: 7.0.0, 8.0.0 Sep 12, 2022

mdh1418 mentioned this issue Oct 25, 2023

[tools][trace] Add stopping event options to dotnet-trace #4363

Merged

hoyosjs closed this as completed in #4363 Nov 2, 2023

ghost locked as resolved and limited conversation to collaborators Dec 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[dotnet-trace] Add option to stop tracing on trigger #3125

[dotnet-trace] Add option to stop tracing on trigger #3125

steveisok commented Jun 15, 2022 •

edited

Loading

steveisok commented Jun 15, 2022

tommcdon commented Jun 15, 2022

samsp-msft commented Feb 27, 2023

steveisok commented Feb 28, 2023

lateralusX commented Aug 29, 2023

[dotnet-trace] Add option to stop tracing on trigger #3125

[dotnet-trace] Add option to stop tracing on trigger #3125

Comments

steveisok commented Jun 15, 2022 • edited Loading

steveisok commented Jun 15, 2022

tommcdon commented Jun 15, 2022

samsp-msft commented Feb 27, 2023

steveisok commented Feb 28, 2023

lateralusX commented Aug 29, 2023

steveisok commented Jun 15, 2022 •

edited

Loading