EPCC OpenMP micro-benchmark suite

A fork of the EPCC OpenMP micro-benchmark suite with some improvements

If you just want the original, you can download it here: https://www.epcc.ed.ac.uk/research/computing/performance-characterisation-and-benchmarking/epcc-openmp-micro-benchmark-suite

Some notes on parameter tuning for `schedbench`

Exactly what is being measured is a function of the parameters. Bad parameters can result in measuring the wrong thing. If you are unsure whether you have isolated scheduling overheads, verify that your results look similar to the results in [1]. In particular, you should check that the overheads for the dynamic schedule decreases as the chunk size goes from 1, to 2, to 4.

Here are some pointers to achieve this:

Avoid a low iteration count

A low iteration count in the parallel loop can lead to measuring parallel for-loop overhead.

Avoid a long delay time

A long delay time can make the inner loops take a while. The time spent on scheduling is orders of magnitude less. Because the benchmark measures overhead as avg_test_time - avg_reference_time, this leads to the scheduling overheads getting "drowned out" by the time spent waiting.

Use one thread per physical core

If your system supports simultaneous multithreading (e.g. hyper-threading), you risk having logical threads competing for resources. In my experiments, the parallel tests took 40% longer than the reference test when using 8 threads on my quad-core Intel Core i7-2600K.

Example parameters

The following parameters gave good results on my system:

itersperthr = 8192 (must be changed in schedbench.c)
--outer-repetitions 50 (recommended in epcc paper[1])
--delay-time 0.01
--test-time 2000

References

[1] Bull, J. Mark. "Measuring synchronisation and scheduling overheads in OpenMP." Proceedings of First European Workshop on OpenMP. Vol. 8. 1999.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
bin		bin
.gitignore		.gitignore
Licence.txt		Licence.txt
Makefile		Makefile
Makefile.defs		Makefile.defs
Makefile.defs.clang		Makefile.defs.clang
Makefile.defs.default		Makefile.defs.default
Makefile.defs.gcc		Makefile.defs.gcc
Makefile.defs.hector.cray		Makefile.defs.hector.cray
Makefile.defs.hector.pgi		Makefile.defs.hector.pgi
Makefile.defs.magny0.gnu		Makefile.defs.magny0.gnu
Makefile.defs.magny0.sun		Makefile.defs.magny0.sun
Makefile.defs.stokes.gnu		Makefile.defs.stokes.gnu
Makefile.defs.stokes.intel		Makefile.defs.stokes.intel
README.md		README.md
README.txt		README.txt
arraybench.c		arraybench.c
arraybench.h		arraybench.h
common.c		common.c
common.h		common.h
schedbench.c		schedbench.c
schedbench.h		schedbench.h
set_config.sh		set_config.sh
summarize_schedbench_runs.py		summarize_schedbench_runs.py
summarize_taskbench_runs.py		summarize_taskbench_runs.py
syncbench.c		syncbench.c
syncbench.h		syncbench.h
taskbench.c		taskbench.c
taskbench.h		taskbench.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EPCC OpenMP micro-benchmark suite

Some notes on parameter tuning for `schedbench`

Avoid a low iteration count

Avoid a long delay time

Use one thread per physical core

Example parameters

References

About

Releases

Packages

Languages

License

LangdalP/EPCC-OpenMP-micro-benchmarks

Folders and files

Latest commit

History

Repository files navigation

EPCC OpenMP micro-benchmark suite

Some notes on parameter tuning for schedbench

Avoid a low iteration count

Avoid a long delay time

Use one thread per physical core

Example parameters

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Some notes on parameter tuning for `schedbench`

Packages