Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update benchmarks for 1.8 and 1.9 #1933

Merged
merged 5 commits into from
Jun 2, 2023
Merged

Update benchmarks for 1.8 and 1.9 #1933

merged 5 commits into from
Jun 2, 2023

Conversation

maleadt
Copy link
Member

@maleadt maleadt commented Jun 2, 2023

Also includes some micro-optimizations that make kernel launch really efficient on 1.10:

julia> @benchmark cufunction(identity, Tuple{Nothing})
BenchmarkTools.Trial: 10000 samples with 767 evaluations.
 Range (min … max):  166.140 ns …  1.673 μs  ┊ GC (min … max): 0.00% … 88.29%
 Time  (median):     171.237 ns              ┊ GC (median):    0.00%
 Time  (mean ± σ):   173.108 ns ± 30.041 ns  ┊ GC (mean ± σ):  0.37% ±  1.93%

        ▁▃▆███▅▃▁
  ▁▁▂▂▄▆█████████▇▆▅▄▄▃▃▂▂▂▂▂▂▂▂▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▂▁▁▁▁▁▁▁ ▃
  166 ns          Histogram: frequency by time          193 ns <

 Memory estimate: 32 bytes, allocs estimate: 1.

julia> @benchmark @cuda identity(nothing)
BenchmarkTools.Trial: 10000 samples with 9 evaluations.
 Range (min … max):  2.030 μs …   5.024 μs  ┊ GC (min … max): 0.00% … 0.00%
 Time  (median):     2.252 μs               ┊ GC (median):    0.00%
 Time  (mean ± σ):   2.286 μs ± 165.630 ns  ┊ GC (mean ± σ):  0.00% ± 0.00%

            ▃▆███▅▇▁▁▁
  ▂▁▂▂▂▂▂▃▃▆██████████▇▇▆▆▆▅▅▄▃▃▃▃▃▃▃▃▂▂▂▂▂▂▂▂▂▁▂▂▂▂▂▂▂▂▂▂▂▂▂ ▄
  2.03 μs         Histogram: frequency by time        2.86 μs <

 Memory estimate: 96 bytes, allocs estimate: 2.

[skip tests]
@maleadt maleadt merged commit b210f61 into master Jun 2, 2023
@maleadt maleadt deleted the tb/perf branch June 2, 2023 13:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant