Improve performance reports #81

AlexWaygood · 2023-05-02T15:01:29Z

Differences less than 10 seconds seem to happen quite regularly just due to random variations in GitHub Actions, so don't report performance changes if the difference is less than 10 seconds. We're only really interested in major improvements or regressions here (the variations will be too big for anything more precise), and major improvements or regressions will generally manifest themselves in differences >10 seconds.
Clarify that it's mypy that got faster/slower, not the package we're running mypy on :-)

hauntsaninja · 2023-05-03T06:08:49Z

Thanks, this looks good! Agreed that we're only going to be able to detect really large shifts. That said, I'd like to hold off a week or two before merging, I'm finding it interesting to see how much noise there is.

AlexWaygood · 2023-06-13T09:40:56Z

This has some big merge conflicts now -- are you still interested in this/should I rebase? :)

hauntsaninja

I'm not sure, what do you think?

Biased pro: I liked that all the primer messages on python/mypy#15128 showed a speed up. These were all less than 10s wins (but I think they were real because results were consistent on different commits in the PR).

Con: I've seen some noise specifically on the vision project. I don't remember a lot of other noise, but I've also been a little short of time so haven't been looking too closely.

JelleZijlstra · 2023-06-14T03:53:47Z

I feel like I see noisy output from the performance report fairly regularly and I basically trained myself to ignore it.

AlexWaygood · 2023-06-14T12:00:23Z

Yeah, I agree that there have been PRs where we can be confident that <10s speedups were real, because they were very consistent in the mypy_primer reports. However I feel like I've seen more PRs with random noise from the performance reports, and I think those can be pretty confusing to new contributors (especially at typeshed, where it's pretty unlikely that something will have a significant performance impact unless somebody's adding a massive Union somewhere).

So I guess I can see it both ways? Maybe the ideal would be if the artifact contained a complete performance report, but the comment posted on PRs filtered out any differences <10s. That would be more complex to implement, of course.

Whatever the case, I do think it would be good to clarify that it's the typechecking of these projects that's getting faster/slower — not sure the wording is ideal at the moment 😄

Akuli · 2023-06-14T19:11:53Z

I think those can be pretty confusing to new contributors (especially at typeshed, where it's pretty unlikely that something will have a significant performance impact

IMO we should disable performance reports in typeshed, maybe with a command-line flag to mypy_primer? We don't really care if one package becomes slightly slower, as long as it is reasonably fast overall. This is different in mypy, because changes in mypy affect all/most packages, and also pile up unlike improvements to different packages.

unless somebody's adding a massive Union somewhere).

Mypy handles big unions reasonably fast in most practical cases. There is a fast path that skips the bottleneck, and it has been expanded to cover pretty much anything that comes up in practice.

AlexWaygood · 2023-06-14T19:28:57Z

Mypy handles big unions reasonably fast in most practical cases. There is a fast path that skips the bottleneck, and it has been expanded to cover pretty much anything that comes up in practice.

Hmm, well, every typeshed sync over at mypy at the moment, we religiously cherry-pick a commit to revert a change I made to typeshed a while back that allegedly caused a large performance regression for mypy due to the use of a big union: https://github.com/python/mypy/blob/e14cddbe31c9437502acef022dca376a25d0b50d/misc/sync-typeshed.py#L183

See e.g. the third commit in python/mypy#15165

If mypy really has no performance problems with big unions, we should probably stop doing that :D

hauntsaninja · 2023-06-14T20:09:11Z

Consensus seems it's too noisy, so let's up the threshold to 10s (like this PR does), and see how we feel about noise?

I've definitely caused huge perf regressions with typeshed changes, so I think there's value to it in typeshed if the number can be believable.

I agree that it makes sense to target wording at infrequent contributors, maybe something like "in a single noisy sample, checking was 1.14x slower"

re the sum literal change: Could be worth trying out again? I've lost track of what it was trying to solve though and I haven't yet spotted what is undesirable in mypy's current behaviour

Akuli · 2023-06-14T20:10:33Z

Apparently I was wrong about big unions :)

They used to be really broken. Specifically, there was a nested for loop that did n^2 iterations for a union of n members, so with some AWS related stubs with a union of 200 strings, it took a few minutes to type-check a very short file. But I didn't realize this isn't the only union slowness bug.

hauntsaninja · 2023-06-14T20:12:15Z

Yeah, I improved two more union perf things in mypy recently, so it might be better now

AlexWaygood · 2023-06-14T20:26:34Z

Cool I'll update this PR based on the consensus here!

hauntsaninja reviewed Jun 14, 2023

View reviewed changes

Rebase and address review

93f6b77

AlexWaygood force-pushed the patch-1 branch from 81739a7 to 93f6b77 Compare June 15, 2023 08:27

AlexWaygood requested a review from hauntsaninja June 15, 2023 08:27

hauntsaninja merged commit d99313f into hauntsaninja:master Jun 16, 2023

AlexWaygood deleted the patch-1 branch June 16, 2023 20:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance reports #81

Improve performance reports #81

AlexWaygood commented May 2, 2023

hauntsaninja commented May 3, 2023

AlexWaygood commented Jun 13, 2023 •

edited

Loading

hauntsaninja left a comment

JelleZijlstra commented Jun 14, 2023

AlexWaygood commented Jun 14, 2023

Akuli commented Jun 14, 2023

AlexWaygood commented Jun 14, 2023 •

edited

Loading

hauntsaninja commented Jun 14, 2023

Akuli commented Jun 14, 2023

hauntsaninja commented Jun 14, 2023 •

edited

Loading

AlexWaygood commented Jun 14, 2023

Improve performance reports #81

Improve performance reports #81

Conversation

AlexWaygood commented May 2, 2023

hauntsaninja commented May 3, 2023

AlexWaygood commented Jun 13, 2023 • edited Loading

hauntsaninja left a comment

Choose a reason for hiding this comment

JelleZijlstra commented Jun 14, 2023

AlexWaygood commented Jun 14, 2023

Akuli commented Jun 14, 2023

AlexWaygood commented Jun 14, 2023 • edited Loading

hauntsaninja commented Jun 14, 2023

Akuli commented Jun 14, 2023

hauntsaninja commented Jun 14, 2023 • edited Loading

AlexWaygood commented Jun 14, 2023

AlexWaygood commented Jun 13, 2023 •

edited

Loading

AlexWaygood commented Jun 14, 2023 •

edited

Loading

hauntsaninja commented Jun 14, 2023 •

edited

Loading