Create superpmi-asmdiffs pipeline #61194

BruceForstall · 2021-11-04T06:31:59Z

Create a new runtime-coreclr superpmi-asmdiffs pipeline that runs SuperPMI asmdiffs for every change in the JIT directory.

The diffs are run on two platforms: Windows x64 and Windows x86. Linux, and Arm64 and Arm32, asm diffs are done using cross-compilers, as follows:

Platform	Asm diffs
Windows x64	win-x64, win-arm64, linux-x64, linux-arm64
Windows x86	win-x86, linux-arm

The resulting summary .md files are uploaded into the pipeline artifacts, one .md file per platform (so, one for the Windows x64 runs and one for the Windows x86 runs). The results are also displayed in "Extensions" page of the AzDO pipeline.

It looks like the runs take about 50 minutes to complete (assuming not much waiting for machines).

The asm diffs pipeline is similar to the "superpmi-replay" pipeline, except:

It determines what an appropriate baseline JIT would be based on the PR commit and how it merges with the main branch. Given this, it downloads the matching baseline JITs from the JIT rolling build artifacts in Azure Storage.
It clones the jitutils repo and builds the jit-analyze tool, needed to generate the summary .md file.
It downloads and adds to the Helix machine payload a "portable" git installation, as git diff is used by jit-analyze for analyzing the generated .dasm files of the diff.
It collects all the various summary.md files into one per platform on which the runs are done, and publishes that to the artifacts and the Extensions page.
It only does one replay (asmdiffs) run, not one for each of a set of multiple stress modes.

As part of this implementation,
a. The azdo_pipelines_util.py was renamed to jitutil.py, and a lot of utility functions from superpmi.py were moved over to it. This was mostly to share the code for downloading and uncompressing .zip files. (There might be some slight changes to the output from the superpmi.py download commands that I'll have to look into.) However, I also moved a bunch of simple, more general helpers, for possible future sharing.
b. jitrollingbuild.py download can now take no arguments and download a baseline JIT (from the JIT rolling build Azure Storage location), for the current enlistment, to the default location. Previously, it required a specific git_hash and target directory. There is similar logic in superpmi.py, but not quite the same.
c. The superpmi.py --no_progress option was made global, and applied in a few more places.

Fixes #59445

ghost · 2021-11-04T06:32:07Z

Tagging subscribers to this area: @JulieLeeMSFT
See info in area-owners.md if you want to be subscribed.

Issue Details

null

Author:	BruceForstall
Assignees:	-
Labels:	`area-CodeGen-coreclr`
Milestone:	-

`asmdiffs` can download coredistools, for example, so don't output download progress if the user wants to suppress it. Also, move the `download` log file to the "uploads" area so it gets uploaded to the AzDO files. Temporarily, only do diffs with benchmarks, so speed things up

Add downloading of portable git for use on Helix for jit-analyze. It is added to the correlation payload that is copied to Helix machines (currently, we only use Windows Helix machines).

Overall summary.md file should be uploaded as well.

BruceForstall · 2021-11-06T06:18:27Z

Sample run AzDO results with forced diffs (loop cloning disabled for diff compiler), with summary in the "Extensions" tab:

https://dev.azure.com/dnceng/public/_build/results?buildId=1457493&view=ms.vss-build-web.run-extensions-tab

and the published artifacts contain the "overall summary" Markdown files in the SuperPMI_Logs_<arch>_checked folders:

https://dev.azure.com/dnceng/public/_build/results?buildId=1457493&view=artifacts&pathAsName=false&type=publishedArtifacts

Add a few more comments

BruceForstall · 2021-11-06T06:59:47Z

@dotnet/jit-contrib This is ready for review. PTAL.

BruceForstall · 2021-11-09T02:05:48Z

Some ideas for improvements:

If there are no diffs, make that explicit, don't just show empty "overall_....md" files in the Extensions page
Create summary.md files for other metrics, especially PerfScore. Maybe these should be summarized as one liners in the Extensions page, but not fully expanded, to prevent that page from getting too cluttered. Make them available for download.
Improve the headers in the overall summary output, e.g., add per-platform headers.
Q: can there be multiple levels of "disclosure" / tree in the output, so we can "expand" to view just one platform at a time, to prevent clutter?
Make sure that JIT assertions, if any, are shown or at least reported very prominently in the Extensions output.
Figure out why the recent runs aren't getting good data when looking for a JIT baseline (e.g., https://dev.azure.com/dnceng/public/_build/results?buildId=1457968&view=logs&j=011e1ec8-6569-5e69-4f06-baf193d1351e&t=bf6cf4cf-6432-59cf-d384-6b3bcf32ede2)

AndyAyersMS · 2021-11-09T02:18:54Z

The extensions thing is pretty cool; I had no idea that existed.

I find the current format requires a lot of scrolling to fully comprehend -- there is a lot of data and the interesting bits can be buried. It would be nice to have everything in one table and perhaps sorted by largest impact or some such. Doesn't work as well for detail/expand collapse but perhaps we can get to this via internal links or something?

Also (assuming interesting/ unexpected diffs appear) is it obvious how to repro exactly what the CI did?

Maybe echo out the repro commands somewhere (including exact base jit version used / exact collections used if we ever start versioning beyond the jit guid)

BruceForstall · 2021-11-09T02:27:22Z

I find the current format requires a lot of scrolling to fully comprehend

The example I gave (here) has a lot of diffs, because I disabled loop cloning to force diffs. However, we are generating diffs for 6 platforms. And we propose to also generate PerfScore diff results.

Do you have any suggestion on how to summarize this better? Kunal suggested something like "Impact" table here.

Printing out repro commands makes sense. Can probably specify the precise baseline git hash used, but assume the diff is the current tree, and then just specify the correct -arch, -target_arch, -target_os options.

Note, however, I don't want to gate this change on creating the "perfect" output.

AndyAyersMS · 2021-11-09T02:32:59Z

Do you have any suggestion on how to summarize this better? Kunal suggested something like "Impact" table here.

Yes, something very much like that would be great.

Note, however, I don't want to gate this change on creating the "perfect" output.

Agreed -- no need to hold this up based on my feedback; it is quite useful even as-is (eg trigger on community PRs that we think should be no diff). As we get mileage on it we will figure out enhancements to make it more useful.

BruceForstall · 2021-11-09T06:18:23Z

eg trigger on community PRs that we think should be no diff

Currently it's set to trigger on every PR that touches the JIT directory. I think we should keep that level of checking.

kunalspathak · 2021-11-09T06:23:30Z

One more optimization we spoke about it to skip creating .dasm files altogether and just return the metrics numbers but it is currently blocked by #52877.

This affects `superpmi.py download` output

BruceForstall · 2021-11-10T00:13:08Z

@kunalspathak, others: I updated this change for the feedback.

BruceForstall · 2021-11-10T16:26:09Z

Here's the Extensions page result we now generate if there are no diffs:

https://dev.azure.com/dnceng/public/_build/results?buildId=1462505&view=ms.vss-build-web.run-extensions-tab

This run completed successfully except the aspnet collection replay has MISSING data, and that is reported as an error code.

…script There will be more changes required if we ever run on non-Windows platforms, so don't keep these partial measures.

kunalspathak

LGTM. Thanks for doing this, can't wait to trigger it on one of my PRs.

eng/pipelines/coreclr/templates/run-superpmi-asmdiffs-job.yml

kunalspathak · 2021-11-10T18:26:21Z

I don't see the logic to upload other metric summary.md, e.g. PerfScore. Are you planning to do that in a follow-up PR?

AndyAyersMS · 2021-11-10T18:41:37Z

Thanks for doing this, can't wait to trigger it on one of my PRs.

Agree -- thanks for working on this.

👍

BruceForstall · 2021-11-10T19:21:36Z

I don't see the logic to upload other metric summary.md, e.g. PerfScore. Are you planning to do that in a follow-up PR?

True, I haven't done that.

I can pass -metrics CodeSize -metrics PerfScore to superpmi.py to add PerfScore to the summary.md. Then, PerfScore and CodeSize will be mixed together in the MD files. IMO, that's ok, even if it potentially creates additional "clutter". However, it disables the recent work to use the "actual" code size instead of just the code size for functions with diffs, which is unfortunate. See #61254 (comment).

Generating the PerfScore metrics separately would require iterating over all the respective MCH directories, invoking jit-analyze specifically for each base/diff pair, and then collecting a summary_PerfScore.md file, which would be separately summarized and uploaded. This is more work than I want to do.

So I think for now I'm not going to add additional metrics to the summary.

Also, reduce timeout values

kunalspathak · 2021-11-10T19:24:54Z

However, it disables the recent work to use the "actual" code size instead of just the code size for functions with diffs, which is unfortunate.

What do you mean? It will print the PerfScore diff at least, right? It won't be accurate with actual PerfScore of everything combined?

BruceForstall · 2021-11-10T19:29:04Z

What do you mean? It will print the PerfScore diff at least, right? It won't be accurate with actual PerfScore of everything combined?

The implementation of the accurate CodeSize doesn't kick in when any metrics are specifically requested:

runtime/src/coreclr/scripts/superpmi.py

Lines 2030 to 2033 in 1d352fc

    
           if self.coreclr_args.metrics: 
        
               command += [ "--metrics", ",".join(self.coreclr_args.metrics) ] 
        
           elif base_bytes is not None and diff_bytes is not None: 
        
               command += [ "--override-total-base-metric", str(base_bytes), "--override-total-diff-metric", str(diff_bytes) ]

There is no "accurate" total PerfScore number currently, just CodeSize.

kunalspathak · 2021-11-10T19:43:55Z

Can we do something like this in superpmi.py itself? I am mostly interested in PerfScore and trying to see easy way to get it in along with this PR.

      command = [ jit_analyze_path, "--md", md_summary_file, "-r", "--base", base_asm_location, "--diff", diff_asm_location ]

      if base_bytes is not None and diff_bytes is not None:
          m_command = command + [ "--override-total-base-metric", str(base_bytes), "--override-total-diff-metric", str(diff_bytes) ]

      run_and_log(m_command, logging.INFO)

    # run again for non-code size metrics.
     non_code_size_metrics = [m for m in self.coreclr_args.metrics if m != 'CodeSize']
      if len(non_code_size_metrics ) > 0:
          m_command = command + [ "--metrics", ",".join(non_code_size_metrics) ]
          run_and_log(m_command, logging.INFO)

kunalspathak · 2021-11-10T20:04:27Z

By the way, I didn't realize until now that we can pass -metric to superpmi.py script.

BruceForstall · 2021-11-10T20:10:19Z

Something like that would work, but we'd then have two .md files, or have to force the second jit-analyze to append to the first one.

I think a better solution is to pass "override" data on a per-metric basis. That is, teach jit-analyze to understand --override-total-base-metric <metric>,<value> instead of just --override-total-base-metric <value> (and support multiple overrides, one per metric). Then, superpmi.py could invoke jit-analyze exactly once. And for now, just pass the CodeSize override arguments.

A not-mutually-exclusive alternative is to teach superpmi.py to (optionally?) create one summary.md file for each metric, e.g., summary_<metric>.md, by either invoking jit-analyze multiple times, or teaching jit-analyze to split the results (the latter would presumably be much faster).

kunalspathak · 2021-11-10T20:14:30Z

or teaching jit-analyze to split the results (the latter would presumably be much faster).

I agree. We can just ask jit-analyze to dump the metrics to <md_filename_from_args>.metrics_name.md.

But then, we need to still change the logic around passing --override-total-base-metric to jit-analyze, right?

BruceForstall · 2021-11-10T20:16:29Z

But then, we need to still change the logic around passing --override-total-base-metric to jit-analyze, right?

Yes, or generalize it as I suggest.

kunalspathak · 2021-11-10T20:17:28Z

Yes, or generalize it as I suggest.

Ok, whatever is easy and clean is fine then.

dotnet-issue-labeler bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Nov 4, 2021

runfoapp bot mentioned this pull request Nov 4, 2021

Linker tests failing with no space left on device on Linux x64 #60927

Closed

BruceForstall mentioned this pull request Nov 4, 2021

superpmi: create asm diffs AzDO pipeline #59445

Closed

BruceForstall added 6 commits November 4, 2021 19:02

Create superpmi-asmdiffs pipeline

9902089

Set -error_limit

4af9979

Move baseline download to setup script

3302e83

Use origin/main for baseline, not main

f817fcb

Fix directory creation

ea3c180

BruceForstall force-pushed the runtime-asmdiffs branch from ceb7fec to 5bd071d Compare November 5, 2021 02:02

BruceForstall added 7 commits November 4, 2021 21:36

Add support for running jit-analyze on the diffs

b777e67

Add summarizing of asm diffs

7e84750

Re-enable diffing all MCH files

a1f5ffa

Rename azdo_pipelines_util.py -> jitutil.py

4f5cc71

Move many functions from superpmi.py to jitutil.py

6857fbe

Add downloading of portable git for use on Helix for jit-analyze. It is added to the correlation payload that is copied to Helix machines (currently, we only use Windows Helix machines).

Run summarize before uploading log files

883e9a4

Overall summary.md file should be uploaded as well.

Fix summarize to walk the tree

c806ca6

BruceForstall force-pushed the runtime-asmdiffs branch from a37188a to c806ca6 Compare November 6, 2021 04:33

Remove change to force asm diffs

b280c90

Add a few more comments

BruceForstall marked this pull request as ready for review November 6, 2021 06:58

BruceForstall requested a review from kunalspathak November 8, 2021 23:32

Only be verbose on copies

5082a6a

This affects `superpmi.py download` output

BruceForstall added 2 commits November 10, 2021 10:13

Assume Windows; don't use osGroup or -platform option to summarize …

8a7e822

…script There will be more changes required if we ever run on non-Windows platforms, so don't keep these partial measures.

Reduce work item timeout to 1:00

f66c864

kunalspathak approved these changes Nov 10, 2021

View reviewed changes

kunalspathak reviewed Nov 10, 2021

View reviewed changes

eng/pipelines/coreclr/templates/run-superpmi-asmdiffs-job.yml Outdated Show resolved Hide resolved

Move asmdiffs .md files to separate log location

646e417

Also, reduce timeout values

BruceForstall merged commit 8fe767f into dotnet:main Nov 10, 2021

BruceForstall deleted the runtime-asmdiffs branch November 10, 2021 21:28

ghost locked as resolved and limited conversation to collaborators Dec 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create superpmi-asmdiffs pipeline #61194

Create superpmi-asmdiffs pipeline #61194

BruceForstall commented Nov 4, 2021 •

edited

Loading

ghost commented Nov 4, 2021

BruceForstall commented Nov 6, 2021

BruceForstall commented Nov 6, 2021

BruceForstall commented Nov 9, 2021

AndyAyersMS commented Nov 9, 2021

BruceForstall commented Nov 9, 2021

AndyAyersMS commented Nov 9, 2021

BruceForstall commented Nov 9, 2021

kunalspathak commented Nov 9, 2021

BruceForstall commented Nov 10, 2021

BruceForstall commented Nov 10, 2021

kunalspathak left a comment

kunalspathak commented Nov 10, 2021

AndyAyersMS commented Nov 10, 2021

BruceForstall commented Nov 10, 2021

kunalspathak commented Nov 10, 2021

BruceForstall commented Nov 10, 2021

kunalspathak commented Nov 10, 2021

kunalspathak commented Nov 10, 2021

BruceForstall commented Nov 10, 2021

kunalspathak commented Nov 10, 2021

BruceForstall commented Nov 10, 2021

kunalspathak commented Nov 10, 2021

Create superpmi-asmdiffs pipeline #61194

Create superpmi-asmdiffs pipeline #61194

Conversation

BruceForstall commented Nov 4, 2021 • edited Loading

ghost commented Nov 4, 2021

BruceForstall commented Nov 6, 2021

BruceForstall commented Nov 6, 2021

BruceForstall commented Nov 9, 2021

AndyAyersMS commented Nov 9, 2021

BruceForstall commented Nov 9, 2021

AndyAyersMS commented Nov 9, 2021

BruceForstall commented Nov 9, 2021

kunalspathak commented Nov 9, 2021

BruceForstall commented Nov 10, 2021

BruceForstall commented Nov 10, 2021

kunalspathak left a comment

Choose a reason for hiding this comment

kunalspathak commented Nov 10, 2021

AndyAyersMS commented Nov 10, 2021

BruceForstall commented Nov 10, 2021

kunalspathak commented Nov 10, 2021

BruceForstall commented Nov 10, 2021

kunalspathak commented Nov 10, 2021

kunalspathak commented Nov 10, 2021

BruceForstall commented Nov 10, 2021

kunalspathak commented Nov 10, 2021

BruceForstall commented Nov 10, 2021

kunalspathak commented Nov 10, 2021

BruceForstall commented Nov 4, 2021 •

edited

Loading