Add Regex perf tests from industry benchmarks #2125

stephentoub · 2021-11-10T03:35:39Z

cc: @adamsitnik, @danmoseley, @olsaarik

src/benchmarks/micro/libraries/System.Text.RegularExpressions/Perf.Regex.Industry.cs

danmoseley

LGTM assuming the run time is acceptable to the perf folks.

How long does all this take to run locally?

src/benchmarks/micro/libraries/System.Text.RegularExpressions/Perf.Regex.Industry.cs

stephentoub · 2021-11-10T04:24:49Z

BTW, I tried recompressing with maximum compression gzip and it only saves about 10% fwiw

I generated these locally with new GZipStream(..., CompressionLevel.SmallestSize).

stephentoub · 2021-11-10T04:27:11Z

How long does all this take to run locally?

A while. It's ~200 benchmarks per platform target, e.g. when I run them comparing main vs a pr, it's ~400 benchmarks to be run.

danmoseley · 2021-11-10T04:30:22Z

I generated these locally with new GZipStream(..., CompressionLevel.SmallestSize).

I used 7zip with gzip selected and compression level "Ultra" ... 3200.txt goes from 6.21 MB to 5.93 MB. 7z and bz2 formats are about 4.9 MB, of course we can't read those..

danmoseley · 2021-11-10T04:31:30Z

A while. It's ~200 benchmarks per platform target, e.g. when I run them comparing main vs a pr, it's ~400 benchmarks to be run.

@DrewScoggins how do you feel about run time here? @kunalspathak if we add 200 more scenarios, does that significantly affect triage (eg., if we regress 50 of them together)?

kunalspathak · 2021-11-10T17:59:59Z

@DrewScoggins how do you feel about run time here?

Something that @DrewScoggins would know exactly, but we have fewer arm64 machines and currently backlogged with existing benchmarks itself. With that said, I don't think we should stop ourselves from adding more benchmarks. I think we should just increase the machine capacity.

@kunalspathak if we add 200 more scenarios, does that significantly affect triage (eg., if we regress 50 of them together)?

It depends on how flaky these are. (Eventually) when we improve the noise filtration logic, it shouldn't be a problem.

DrewScoggins · 2021-11-10T18:33:11Z

@DrewScoggins how do you feel about run time here?

Something that @DrewScoggins would know exactly, but we have fewer arm64 machines and currently backlogged with existing benchmarks itself. With that said, I don't think we should stop ourselves from adding more benchmarks. I think we should just increase the machine capacity.

Believe me, I would also love to get more machines! In the meantime, I am running the tests locally to get an idea of the total amount of time they will be adding (26 minutes). Like @kunalspathak said, the only real place where we are resource constrained is on Arm64 machine, but I don't believe that this will be prohibitive, and if it is we can revisit what tests we run on Arm64.

@kunalspathak if we add 200 more scenarios, does that significantly affect triage (eg., if we regress 50 of them together)?

It depends on how flaky these are. (Eventually) when we improve the noise filtration logic, it shouldn't be a problem.

Having a bunch regress all at once from a product change will not be a big deal. We can already handle that scenario.

danmoseley · 2021-11-10T18:50:26Z

thanks @DrewScoggins . Also, after this is in, is it possible to get a 6.0 (and ideally 5.0) baseline number that shows up in the graphs, or does that happen automatically?

DrewScoggins · 2021-11-12T17:40:02Z

If we want to get baseline numbers for 5.0 and 6.0, we will need to backport these tests. It will be easy to do for 6.0, just make a PR to the release/6.0 branch and it will get picked up. For 5.0 it will require some more work, as we didn't have the system we have now for release branches when we forked for 5.0. I have made this issue, #2129, to track the work that we would need to do. In the meantime, maybe we could do a one off run on some lab machines to get some comparison numbers for 5.0 vs 6.0 vs today for these tests?

adamsitnik

LGTM, thank you @stephentoub

Now let's make sure that .NET is the best for all of the test cases ;)

src/harness/BenchmarkDotNet.Extensions/TooManyTestCasesValidator.cs

src/benchmarks/micro/libraries/System.Text.RegularExpressions/Perf.Regex.Industry.cs

adamsitnik · 2021-11-16T14:28:05Z

One of the CI legs failed with mysterious python error:

[2021/11/12 02:46:47][INFO] // Found 5 benchmarks:
[2021/11/12 02:46:47][INFO] //   Perf_Regex_Industry_Leipzig.Count: Job-RCHHKQ(PowerPlanMode=00000000-0000-0000-0000-000000000000, Arguments=/p:DebugType=portable,-bl:benchmarkdotnet.binlog, InvocationCount=1, IterationCount=1, IterationTime=250.0000 ms, MaxIterationCount=20, MinIterationCount=15, RunStrategy=ColdStart, UnrollFactor=1, WarmupCount=0) [Pattern=(?i)Tom|Sawyer|Huckleberry|Finn, Options=None]
[2021/11/12 02:46:47][INFO] //   Perf_Regex_Industry_Leipzig.Count: Job-RCHHKQ(PowerPlanMode=00000000-0000-0000-0000-000000000000, Arguments=/p:DebugType=portable,-bl:benchmarkdotnet.binlog, InvocationCount=1, IterationCount=1, IterationTime=250.0000 ms, MaxIterationCount=20, MinIterationCount=15, RunStrategy=ColdStart, UnrollFactor=1, WarmupCount=0) [Pattern=.{2,4}(Tom|Sawyer|Huckleberry|Finn), Options=Compiled]
[2021/11/12 02:46:47][INFO] //   Perf_Regex_Industry_Leipzig.Count: Job-RCHHKQ(PowerPlanMode=00000000-0000-0000-0000-000000000000, Arguments=/p:DebugType=portable,-bl:benchmarkdotnet.binlog, InvocationCount=1, IterationCount=1, IterationTime=250.0000 ms, MaxIterationCount=20, MinIterationCount=15, RunStrategy=ColdStart, UnrollFactor=1, WarmupCount=0) [Pattern=Twain, Options=None]
[2021/11/12 02:46:47][INFO] //   Perf_Regex_Industry_Leipzig.Count: Job-RCHHKQ(PowerPlanMode=00000000-0000-0000-0000-000000000000, Arguments=/p:DebugType=portable,-bl:benchmarkdotnet.binlog, InvocationCount=1, IterationCount=1, IterationTime=250.0000 ms, MaxIterationCount=20, MinIterationCount=15, RunStrategy=ColdStart, UnrollFactor=1, WarmupCount=0) [Pattern=[a-z]shing, Options=Compiled]
[2021/11/12 02:47:34][INFO] $ popd
Traceback (most recent call last):
  File "C:\h\w\A568097A\p\scripts\benchmarks_ci.py", line 250, in <module>
    __main(sys.argv[1:])
  File "C:\h\w\A568097A\p\scripts\benchmarks_ci.py", line 226, in __main
    micro_benchmarks.run(
  File "C:\h\w\A568097A\p\scripts\micro_benchmarks.py", line 310, in run
    BENCHMARKS_CSPROJ.run(
  File "C:\h\w\A568097A\p\scripts\dotnet.py", line 467, in run
    RunCommand(cmdline, verbose=verbose).run(
  File "C:\h\w\A568097A\p\scripts\performance\common.py", line 211, in run
    (returncode, quoted_cmdline) = self.__runinternal(working_directory)
  File "C:\h\w\A568097A\p\scripts\performance\common.py", line 200, in __runinternal
    for line in iter(proc.stdout.readline, ''):
  File "C:\python3.9.1\lib\codecs.py", line 322, in decode
    (result, consumed) = self._buffer_decode(data, self.errors, final)
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xec in position 334: invalid continuation byte

I'll re-run it.

src/benchmarks/micro/libraries/System.Text.RegularExpressions/Perf.Regex.Industry.cs

adamsitnik · 2021-11-16T14:43:20Z

I was unable to repro the CI failure locally:

git clone https://github.com/stephentoub/performance.git && cd performance && git checkout regextests
py .\scripts\benchmarks_ci.py -f net461 --filter *Regex* --bdn-arguments="--iterationCount 1 --warmupCount 0 --invocationCount 1 --unrollFactor 1 --strategy ColdStart --stopOnFirstError true"

Co-authored-by: Adam Sitnik <adam.sitnik@gmail.com>

danmoseley · 2021-11-17T15:58:43Z

failure --

MinIterationCount=15, RunStrategy=ColdStart, UnrollFactor=1, WarmupCount=0) [Pattern=Tom.{10,25}river|river.{10,25}Tom, Options=None]
[2021/11/17 15:39:15][INFO] //   Perf_Regex_Industry_Leipzig.Count: Job-RCHHKQ(PowerPlanMode=00000000-0000-0000-0000-000000000000, Arguments=/p:DebugType=portable,-bl:benchmarkdotnet.binlog, InvocationCount=1, IterationCount=1, IterationTime=250.0000 ms, MaxIterationCount=20, MinIterationCount=15, RunStrategy=ColdStart, UnrollFactor=1, WarmupCount=0) [Pattern=Twain, Options=Compiled]
[2021/11/17 15:40:03][INFO] $ popd
Traceback (most recent call last):
  File "C:\h\w\A6DB097E\p\scripts\benchmarks_ci.py", line 250, in <module>
    __main(sys.argv[1:])
  File "C:\h\w\A6DB097E\p\scripts\benchmarks_ci.py", line 226, in __main
    micro_benchmarks.run(
  File "C:\h\w\A6DB097E\p\scripts\micro_benchmarks.py", line 310, in run
    BENCHMARKS_CSPROJ.run(
  File "C:\h\w\A6DB097E\p\scripts\dotnet.py", line 467, in run
    RunCommand(cmdline, verbose=verbose).run(
  File "C:\h\w\A6DB097E\p\scripts\performance\common.py", line 211, in run
    (returncode, quoted_cmdline) = self.__runinternal(working_directory)
  File "C:\h\w\A6DB097E\p\scripts\performance\common.py", line 200, in __runinternal
    for line in iter(proc.stdout.readline, ''):
  File "C:\python3.9.1\lib\codecs.py", line 322, in decode
    (result, consumed) = self._buffer_decode(data, self.errors, final)
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xec in position 78: invalid continuation byte

@DrewScoggins have we seen this before?

danmoseley · 2021-11-17T16:22:05Z

BTW surely if you just run this test on .NET Framework locally and it passes that's good enough to check in.

DrewScoggins · 2021-11-17T23:59:01Z

Yes we were seeing stuff like this, and it was fixed with #2108.

BTW surely if you just run this test on .NET Framework locally and it passes that's good enough to check in.

Hard to say, because our lab machines are not identical to the CI machines. We have never had a situation where something works on local machines, but fails on the CI VMs.

Add Regex perf tests from industry benchmarks

5de2d68

danmoseley reviewed Nov 10, 2021

View reviewed changes

src/benchmarks/micro/libraries/System.Text.RegularExpressions/Perf.Regex.Industry.cs Show resolved Hide resolved

danmoseley approved these changes Nov 10, 2021

View reviewed changes

src/benchmarks/micro/libraries/System.Text.RegularExpressions/Perf.Regex.Industry.cs Show resolved Hide resolved

src/benchmarks/micro/libraries/System.Text.RegularExpressions/Perf.Regex.Industry.cs Outdated Show resolved Hide resolved

Address PR feedback, and some tweaks

d9601ca

Remove a few more duplicates

52b2099

adamsitnik approved these changes Nov 16, 2021

View reviewed changes

stephentoub commented Nov 16, 2021

View reviewed changes

src/benchmarks/micro/libraries/System.Text.RegularExpressions/Perf.Regex.Industry.cs Show resolved Hide resolved

Apply suggestions from code review

b29d63f

Co-authored-by: Adam Sitnik <adam.sitnik@gmail.com>

DrewScoggins merged commit 2dc2b14 into dotnet:main Nov 17, 2021

LoopedBard3 mentioned this pull request Nov 18, 2021

MicroBenchmark runs are failing when executing Perf_Regex_Industry_Leipzig tests #2140

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Regex perf tests from industry benchmarks #2125

Add Regex perf tests from industry benchmarks #2125

stephentoub commented Nov 10, 2021

danmoseley left a comment

stephentoub commented Nov 10, 2021

stephentoub commented Nov 10, 2021

danmoseley commented Nov 10, 2021

danmoseley commented Nov 10, 2021

kunalspathak commented Nov 10, 2021

DrewScoggins commented Nov 10, 2021

danmoseley commented Nov 10, 2021

DrewScoggins commented Nov 12, 2021

adamsitnik left a comment

adamsitnik commented Nov 16, 2021

adamsitnik commented Nov 16, 2021

danmoseley commented Nov 17, 2021

danmoseley commented Nov 17, 2021

DrewScoggins commented Nov 17, 2021

Add Regex perf tests from industry benchmarks #2125

Add Regex perf tests from industry benchmarks #2125

Conversation

stephentoub commented Nov 10, 2021

danmoseley left a comment

Choose a reason for hiding this comment

stephentoub commented Nov 10, 2021

stephentoub commented Nov 10, 2021

danmoseley commented Nov 10, 2021

danmoseley commented Nov 10, 2021

kunalspathak commented Nov 10, 2021

DrewScoggins commented Nov 10, 2021

danmoseley commented Nov 10, 2021

DrewScoggins commented Nov 12, 2021

adamsitnik left a comment

Choose a reason for hiding this comment

adamsitnik commented Nov 16, 2021

adamsitnik commented Nov 16, 2021

danmoseley commented Nov 17, 2021

danmoseley commented Nov 17, 2021

DrewScoggins commented Nov 17, 2021