Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow dumping of computed metrics per-kernel for further analysis #163

Open
skyreflectedinmirrors opened this issue Aug 17, 2023 · 0 comments
Labels
enhancement New feature or request

Comments

@skyreflectedinmirrors
Copy link
Contributor

Is your feature request related to a problem? Please describe.

This is a proposed extension of the current --save-dfs mechanism.
Essentially, today when using --save-dfs, Omniperf will compute the metrics, apply the min/max/avg, etc. aggregations, and then save that to a file.

In some cases, (e.g., plotting, further data analysis, etc.) it's more useful to be able to get each of the metrics per kernel launch, for instance, so that one could suck in the data-frame and do a kNN (or whatever) to look for correlations of kernel runtime w/ metrics "outliers", or to plot the metrics over multiple invocations, etc.

Describe the solution you'd like

Provide a mode to allow computation of the metrics on each dispatch, and save that result to a file.
This should allow filtering of dispatches, and blocks, as normal, i.e., it only skips the min/max/avg computation steps.

Describe alternatives you've considered

One can walk through each dispatch and use --dispatch <X> to filter the dataframe to just that dispatch, and dump like 8000 different data-frams, but that is... quite slow, particularly with the current metric parsing overheads.

Additional context

None

@skyreflectedinmirrors skyreflectedinmirrors added the enhancement New feature or request label Aug 17, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant