#15979: Switch to google benchmark for pgm dispatch tests #16547

jbaumanTT · 2025-01-09T04:47:41Z

Ticket

Problem description

The current output format works well with a smaller number of tests, but with a large number of tests it's hard to connect the resulting number to the test that created it. This particular hinders storing test results in a database and comparing them across runs.

What's changed

Switch to using the google benchmark framework, which outputs a JSON file containing the results of all tests. By default, running the binary runs all the benchmarks from sweep_pgm_dispatch.sh. The set of tests to be run can be filtered using the --benchmark_filter=<regex> command-line argument.

One-off test cases can be run by passing --custom and the command line arguments as before.

Checklist

Post commit CI passes
Blackhole Post commit (if applicable)
Model regression CI testing passes (if applicable)
Device performance regression CI testing passes (if applicable)
(For models and ops writers) Full new models tests passes
New/Existing tests provide coverage for changes

github-actions

⚠️ Clang-Tidy found issue(s) with the introduced code (1/1)

tests/tt_metal/tt_metal/perf_microbenchmark/dispatch/test_pgm_dispatch.cpp

The current output format works well with a smaller number of tests, but with a large number of tests it's hard to connect the resulting number to the test that created it. This particular hinders storing test results in a database and comparing them across runs. Switch to using the google benchmark framework, which outputs a JSON file containing the results of all tests. By default, running the binary runs all the benchmarks from sweep_pgm_dispatch.sh. The set of tests to be run can be filtered using the `--benchmark_filter=<regex>` command-line argument. One-off test cases can be run by passing `--custom` and the command line arguments as before.

pgkeller

I use filt_pgm_dispatch for the bw_and_latency test as well, so don't delete that.
I still want to dump these results to a spreadsheet, what does that flow look like?
Otherwise looks good

The current output format works well with a smaller number of tests, but with a large number of tests it's hard to connect the resulting number to the test that created it. This particular hinders storing test results in a database and comparing them across runs. Switch to using the google benchmark framework, which outputs a JSON file containing the results of all tests. By default, running the binary runs all the benchmarks from sweep_pgm_dispatch.sh. The set of tests to be run can be filtered using the `--benchmark_filter=<regex>` command-line argument. One-off test cases can be run by passing `--custom` and the command line arguments as before.

jbaumanTT · 2025-01-09T19:35:39Z

I use filt_pgm_dispatch for the bw_and_latency test as well, so don't delete that. I still want to dump these results to a spreadsheet, what does that flow look like? Otherwise looks good

Ok, added back filt_pgm_dispatch.pl. I've added a new json_to_csv.py that can be used to dump to a CSV (one line per test). The workflow is to run

build/test/tt_metal/perf_microbenchmark/dispatch/test_pgm_dispatch --benchmark_out_format=json --benchmark_out=bench.json
tests/tt_metal/tt_metal/perf_microbenchmark/dispatch/json_to_csv.py bench.json > bench.csv

google benchmark framework also has a native way to output CSVs, but it's deprecated and requires some extra work.

This patch also uploads the json file from the CI bots, so it's pretty easy to download that and convert it.

.github/workflows/fast-dispatch-frequent-tests-impl.yaml

tests/tt_metal/tt_metal/perf_microbenchmark/CMakeLists.txt

tests/tt_metal/tt_metal/perf_microbenchmark/dispatch/compare_pgm_dispatch_perf_ci.py

The current output format works well with a smaller number of tests, but with a large number of tests it's hard to connect the resulting number to the test that created it. This particular hinders storing test results in a database and comparing them across runs. Switch to using the google benchmark framework, which outputs a JSON file containing the results of all tests. By default, running the binary runs all the benchmarks from sweep_pgm_dispatch.sh. The set of tests to be run can be filtered using the `--benchmark_filter=<regex>` command-line argument. One-off test cases can be run by passing `--custom` and the command line arguments as before.

.github/workflows/fast-dispatch-frequent-tests-impl.yaml

tests/tt_metal/tt_metal/perf_microbenchmark/dispatch/compare_pgm_dispatch_perf_ci.py

tests/tt_metal/tt_metal/perf_microbenchmark/dispatch/json_to_csv.py

tt-rkim · 2025-01-14T14:06:31Z

@TT-billteng LOOK SEE LOOK SEE
GOOGLE BENCHMARK!!!

The current output format works well with a smaller number of tests, but with a large number of tests it's hard to connect the resulting number to the test that created it. This particular hinders storing test results in a database and comparing them across runs. Switch to using the google benchmark framework, which outputs a JSON file containing the results of all tests. By default, running the binary runs all the benchmarks from sweep_pgm_dispatch.sh. The set of tests to be run can be filtered using the `--benchmark_filter=<regex>` command-line argument. One-off test cases can be run by passing `--custom` and the command line arguments as before.

github-actions bot reviewed Jan 9, 2025

View reviewed changes

tests/tt_metal/tt_metal/perf_microbenchmark/dispatch/test_pgm_dispatch.cpp Outdated Show resolved Hide resolved

jbaumanTT force-pushed the jbauman/googlebenchmark2 branch from 43eafb6 to c1c334e Compare January 9, 2025 07:05

jbaumanTT marked this pull request as ready for review January 9, 2025 17:14

jbaumanTT requested review from abhullar-tt, pgkeller, aliuTT, tt-aho, tt-dma, tt-asaigal, ubcheema and a team as code owners January 9, 2025 17:14

pgkeller reviewed Jan 9, 2025

View reviewed changes

jbaumanTT force-pushed the jbauman/googlebenchmark2 branch from c1c334e to a754edf Compare January 9, 2025 19:24

jbaumanTT force-pushed the jbauman/googlebenchmark2 branch from a754edf to 99a4fc7 Compare January 9, 2025 19:29

tt-rkim reviewed Jan 9, 2025

View reviewed changes

jbaumanTT force-pushed the jbauman/googlebenchmark2 branch from 99a4fc7 to 61c15a4 Compare January 10, 2025 17:40

jbaumanTT force-pushed the jbauman/googlebenchmark2 branch from 61c15a4 to 36ae490 Compare January 10, 2025 18:04

tt-rkim approved these changes Jan 14, 2025

View reviewed changes

jbaumanTT force-pushed the jbauman/googlebenchmark2 branch from 36ae490 to 0c88372 Compare January 14, 2025 16:34

pgkeller approved these changes Jan 15, 2025

View reviewed changes

jbaumanTT merged commit 5017de3 into main Jan 15, 2025
13 checks passed

jbaumanTT deleted the jbauman/googlebenchmark2 branch January 15, 2025 18:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

#15979: Switch to google benchmark for pgm dispatch tests #16547

#15979: Switch to google benchmark for pgm dispatch tests #16547

jbaumanTT commented Jan 9, 2025 •

edited

Loading

github-actions bot left a comment

pgkeller left a comment

jbaumanTT commented Jan 9, 2025

tt-rkim commented Jan 14, 2025

#15979: Switch to google benchmark for pgm dispatch tests #16547

#15979: Switch to google benchmark for pgm dispatch tests #16547

Conversation

jbaumanTT commented Jan 9, 2025 • edited Loading

Ticket

Problem description

What's changed

Checklist

github-actions bot left a comment

Choose a reason for hiding this comment

pgkeller left a comment

Choose a reason for hiding this comment

jbaumanTT commented Jan 9, 2025

tt-rkim commented Jan 14, 2025

jbaumanTT commented Jan 9, 2025 •

edited

Loading