Skip to content

v0.1.7

Pre-release
Pre-release
Compare
Choose a tag to compare
@github-actions github-actions released this 20 Dec 21:08
Benchmark Mode Threads Samples Score Score Error (99.9%) Unit Param: k Param: m Param: n Param: trans Param: transa Param: transb
dev.ludovic.blas.benchmarks.DaxpyBenchmark blas thrpt 1 6 10246033.56 324590.30 ops/s 100
dev.ludovic.blas.benchmarks.DaxpyBenchmark f2j thrpt 1 6 17422407.12 159847.18 ops/s 100
dev.ludovic.blas.benchmarks.DaxpyBenchmark vector thrpt 1 6 53372341.22 1533737.07 ops/s 100
dev.ludovic.blas.benchmarks.DaxpyBenchmark blas thrpt 1 6 97.38 5.59 ops/s 10000000
dev.ludovic.blas.benchmarks.DaxpyBenchmark f2j thrpt 1 6 106.23 1.68 ops/s 10000000
dev.ludovic.blas.benchmarks.DaxpyBenchmark vector thrpt 1 6 122.64 0.50 ops/s 10000000
dev.ludovic.blas.benchmarks.DdotBenchmark blas thrpt 1 6 10435249.72 74396.98 ops/s 100
dev.ludovic.blas.benchmarks.DdotBenchmark f2j thrpt 1 6 10834780.28 278495.34 ops/s 100
dev.ludovic.blas.benchmarks.DdotBenchmark vector thrpt 1 6 44321846.77 1263146.16 ops/s 100
dev.ludovic.blas.benchmarks.DdotBenchmark blas thrpt 1 6 120.62 4.65 ops/s 10000000
dev.ludovic.blas.benchmarks.DdotBenchmark f2j thrpt 1 6 87.96 1.84 ops/s 10000000
dev.ludovic.blas.benchmarks.DdotBenchmark vector thrpt 1 6 151.89 1.62 ops/s 10000000
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 2021958.63 78348.54 ops/s 10 10 10 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 918603.13 7038.17 ops/s 10 10 10 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 918298.89 5131.31 ops/s 10 10 10 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 76528.69 3239.31 ops/s 1000 10 10 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 10360.86 705.81 ops/s 1000 10 10 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 10193.12 56.41 ops/s 1000 10 10 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 82407.62 811.14 ops/s 10 1000 10 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 17616.07 214.34 ops/s 10 1000 10 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 17628.58 50.29 ops/s 10 1000 10 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 2406.41 158.77 ops/s 1000 1000 10 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 168.40 2.01 ops/s 1000 1000 10 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 166.03 2.20 ops/s 1000 1000 10 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 78797.92 1708.58 ops/s 10 10 1000 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 9643.74 447.37 ops/s 10 10 1000 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 9643.09 428.93 ops/s 10 10 1000 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 1947.12 643.80 ops/s 1000 10 1000 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 105.75 1.84 ops/s 1000 10 1000 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 105.94 0.43 ops/s 1000 10 1000 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 1435.52 1370.58 ops/s 10 1000 1000 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 169.56 0.23 ops/s 10 1000 1000 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 169.58 0.41 ops/s 10 1000 1000 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 101.73 6.28 ops/s 1000 1000 1000 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 1.67 0.11 ops/s 1000 1000 1000 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 1.59 0.33 ops/s 1000 1000 1000 N N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 2051378.77 29749.48 ops/s 10 10 10 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 930267.55 2350.46 ops/s 10 10 10 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 1168010.57 198909.60 ops/s 10 10 10 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 78719.86 2263.80 ops/s 1000 10 10 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 10799.45 17.99 ops/s 1000 10 10 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 43889.68 383.42 ops/s 1000 10 10 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 82370.74 2338.18 ops/s 10 1000 10 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 10032.36 100.98 ops/s 10 1000 10 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 13187.77 2747.76 ops/s 10 1000 10 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 2548.44 61.18 ops/s 1000 1000 10 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 103.18 0.54 ops/s 1000 1000 10 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 349.60 12.05 ops/s 1000 1000 10 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 78464.03 2395.29 ops/s 10 10 1000 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 9698.28 12.84 ops/s 10 10 1000 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 12438.11 758.77 ops/s 10 10 1000 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 2121.77 75.43 ops/s 1000 10 1000 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 106.49 1.80 ops/s 1000 10 1000 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 415.38 3.20 ops/s 1000 10 1000 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 1890.07 77.75 ops/s 10 1000 1000 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 98.27 1.14 ops/s 10 1000 1000 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 137.40 2.00 ops/s 10 1000 1000 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 102.58 2.11 ops/s 1000 1000 1000 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 1.03 0.01 ops/s 1000 1000 1000 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 3.47 0.16 ops/s 1000 1000 1000 T N
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 1895217.54 101053.21 ops/s 10 10 10 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 922431.17 1022.94 ops/s 10 10 10 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 916376.97 3629.74 ops/s 10 10 10 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 59787.87 7594.98 ops/s 1000 10 10 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 9859.37 753.44 ops/s 1000 10 10 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 9929.31 503.22 ops/s 1000 10 10 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 79905.21 7760.42 ops/s 10 1000 10 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 16800.46 66.67 ops/s 10 1000 10 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 16796.62 104.18 ops/s 10 1000 10 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 2404.82 88.87 ops/s 1000 1000 10 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 153.55 2.57 ops/s 1000 1000 10 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 151.95 2.01 ops/s 1000 1000 10 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 67836.56 2714.85 ops/s 10 10 1000 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 9102.08 17.28 ops/s 10 10 1000 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 9353.27 557.99 ops/s 10 10 1000 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 1756.23 496.27 ops/s 1000 10 1000 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 69.87 3.92 ops/s 1000 10 1000 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 69.40 2.26 ops/s 1000 10 1000 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 1572.90 1133.61 ops/s 10 1000 1000 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 154.35 0.84 ops/s 10 1000 1000 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 154.41 0.88 ops/s 10 1000 1000 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 101.13 1.92 ops/s 1000 1000 1000 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 1.49 0.02 ops/s 1000 1000 1000 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 1.48 0.01 ops/s 1000 1000 1000 N T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 1942295.50 14127.47 ops/s 10 10 10 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 697122.42 12738.73 ops/s 10 10 10 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 696853.05 17712.65 ops/s 10 10 10 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 62872.02 1442.90 ops/s 1000 10 10 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 10578.85 42.14 ops/s 1000 10 10 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 10591.02 21.79 ops/s 1000 10 10 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 80941.21 2317.60 ops/s 10 1000 10 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 7564.97 368.43 ops/s 10 1000 10 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 7646.03 63.39 ops/s 10 1000 10 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 2552.65 38.84 ops/s 1000 1000 10 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 98.51 1.01 ops/s 1000 1000 10 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 97.57 1.91 ops/s 1000 1000 10 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 68195.44 1469.32 ops/s 10 10 1000 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 7457.21 10.67 ops/s 10 10 1000 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 7450.20 10.43 ops/s 10 10 1000 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 1843.62 88.38 ops/s 1000 10 1000 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 46.52 0.47 ops/s 1000 10 1000 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 46.66 0.20 ops/s 1000 10 1000 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 1736.36 1026.96 ops/s 10 1000 1000 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 73.37 0.22 ops/s 10 1000 1000 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 73.45 0.21 ops/s 10 1000 1000 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark blas thrpt 1 6 101.64 2.74 ops/s 1000 1000 1000 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark f2j thrpt 1 6 0.50 0.00 ops/s 1000 1000 1000 T T
dev.ludovic.blas.benchmarks.DgemmBenchmark vector thrpt 1 6 0.49 0.02 ops/s 1000 1000 1000 T T
dev.ludovic.blas.benchmarks.DgemvBenchmark blas thrpt 1 6 4927941.48 26498.64 ops/s 10 10 N
dev.ludovic.blas.benchmarks.DgemvBenchmark f2j thrpt 1 6 7917230.92 20916.88 ops/s 10 10 N
dev.ludovic.blas.benchmarks.DgemvBenchmark vector thrpt 1 6 7697013.50 20479.52 ops/s 10 10 N
dev.ludovic.blas.benchmarks.DgemvBenchmark blas thrpt 1 6 36674.05 58707.61 ops/s 10000 10 N
dev.ludovic.blas.benchmarks.DgemvBenchmark f2j thrpt 1 6 15856.23 78.18 ops/s 10000 10 N
dev.ludovic.blas.benchmarks.DgemvBenchmark vector thrpt 1 6 15603.38 104.89 ops/s 10000 10 N
dev.ludovic.blas.benchmarks.DgemvBenchmark blas thrpt 1 6 54399.40 3062.97 ops/s 10 10000 N
dev.ludovic.blas.benchmarks.DgemvBenchmark f2j thrpt 1 6 8987.45 30.59 ops/s 10 10000 N
dev.ludovic.blas.benchmarks.DgemvBenchmark vector thrpt 1 6 8967.86 27.78 ops/s 10 10000 N
dev.ludovic.blas.benchmarks.DgemvBenchmark blas thrpt 1 6 22.06 0.73 ops/s 10000 10000 N
dev.ludovic.blas.benchmarks.DgemvBenchmark f2j thrpt 1 6 13.32 0.40 ops/s 10000 10000 N
dev.ludovic.blas.benchmarks.DgemvBenchmark vector thrpt 1 6 13.33 0.28 ops/s 10000 10000 N
dev.ludovic.blas.benchmarks.DgemvBenchmark blas thrpt 1 6 4631849.33 9862.71 ops/s 10 10 T
dev.ludovic.blas.benchmarks.DgemvBenchmark f2j thrpt 1 6 8218050.72 22692.31 ops/s 10 10 T
dev.ludovic.blas.benchmarks.DgemvBenchmark vector thrpt 1 6 11360029.05 256185.49 ops/s 10 10 T
dev.ludovic.blas.benchmarks.DgemvBenchmark blas thrpt 1 6 124303.79 4456.24 ops/s 10000 10 T
dev.ludovic.blas.benchmarks.DgemvBenchmark f2j thrpt 1 6 10853.12 23.88 ops/s 10000 10 T
dev.ludovic.blas.benchmarks.DgemvBenchmark vector thrpt 1 6 42244.05 881.03 ops/s 10000 10 T
dev.ludovic.blas.benchmarks.DgemvBenchmark blas thrpt 1 6 48864.68 1370.07 ops/s 10 10000 T
dev.ludovic.blas.benchmarks.DgemvBenchmark f2j thrpt 1 6 9804.03 971.75 ops/s 10 10000 T
dev.ludovic.blas.benchmarks.DgemvBenchmark vector thrpt 1 6 15307.71 5687.86 ops/s 10 10000 T
dev.ludovic.blas.benchmarks.DgemvBenchmark blas thrpt 1 6 22.15 0.78 ops/s 10000 10000 T
dev.ludovic.blas.benchmarks.DgemvBenchmark f2j thrpt 1 6 9.54 0.06 ops/s 10000 10000 T
dev.ludovic.blas.benchmarks.DgemvBenchmark vector thrpt 1 6 24.48 0.69 ops/s 10000 10000 T
dev.ludovic.blas.benchmarks.DscalBenchmark blas thrpt 1 6 556106.34 1558.12 ops/s 100
dev.ludovic.blas.benchmarks.DscalBenchmark f2j thrpt 1 6 301464.42 408.74 ops/s 100
dev.ludovic.blas.benchmarks.DscalBenchmark vector thrpt 1 6 1128620.54 70004.11 ops/s 100
dev.ludovic.blas.benchmarks.DscalBenchmark blas thrpt 1 6 57.94 1.54 ops/s 10000000
dev.ludovic.blas.benchmarks.DscalBenchmark f2j thrpt 1 6 3.00 0.01 ops/s 10000000
dev.ludovic.blas.benchmarks.DscalBenchmark vector thrpt 1 6 11.05 0.02 ops/s 10000000
dev.ludovic.blas.benchmarks.DspmvBenchmark blas thrpt 1 6 2785286.52 43352.90 ops/s 10
dev.ludovic.blas.benchmarks.DspmvBenchmark f2j thrpt 1 6 7542712.48 21696.12 ops/s 10
dev.ludovic.blas.benchmarks.DspmvBenchmark vector thrpt 1 6 9327522.75 81905.88 ops/s 10
dev.ludovic.blas.benchmarks.DspmvBenchmark blas thrpt 1 6 6484.03 629.13 ops/s 1000
dev.ludovic.blas.benchmarks.DspmvBenchmark f2j thrpt 1 6 1591.89 14.95 ops/s 1000
dev.ludovic.blas.benchmarks.DspmvBenchmark vector thrpt 1 6 5245.87 1708.00 ops/s 1000
dev.ludovic.blas.benchmarks.DsprBenchmark blas thrpt 1 6 3893030.69 22708.49 ops/s 10
dev.ludovic.blas.benchmarks.DsprBenchmark f2j thrpt 1 6 10547617.12 111648.52 ops/s 10
dev.ludovic.blas.benchmarks.DsprBenchmark vector thrpt 1 6 15417785.06 17647.77 ops/s 10
dev.ludovic.blas.benchmarks.DsprBenchmark blas thrpt 1 6 30937.80 1182.03 ops/s 1000
dev.ludovic.blas.benchmarks.DsprBenchmark f2j thrpt 1 6 3339.86 9.22 ops/s 1000
dev.ludovic.blas.benchmarks.DsprBenchmark vector thrpt 1 6 8295.96 69.26 ops/s 1000
dev.ludovic.blas.benchmarks.DsyrBenchmark blas thrpt 1 6 4048848.73 36499.84 ops/s 10
dev.ludovic.blas.benchmarks.DsyrBenchmark f2j thrpt 1 6 10955146.20 516013.94 ops/s 10
dev.ludovic.blas.benchmarks.DsyrBenchmark vector thrpt 1 6 15930602.39 526565.87 ops/s 10
dev.ludovic.blas.benchmarks.DsyrBenchmark blas thrpt 1 6 29257.40 918.46 ops/s 1000
dev.ludovic.blas.benchmarks.DsyrBenchmark f2j thrpt 1 6 3127.22 19.05 ops/s 1000
dev.ludovic.blas.benchmarks.DsyrBenchmark vector thrpt 1 6 7158.78 58.71 ops/s 1000
dev.ludovic.blas.benchmarks.SdotBenchmark blas thrpt 1 6 10836798.77 71666.56 ops/s 100
dev.ludovic.blas.benchmarks.SdotBenchmark f2j thrpt 1 6 10826607.22 48332.79 ops/s 100
dev.ludovic.blas.benchmarks.SdotBenchmark vector thrpt 1 6 48557633.31 2907501.25 ops/s 100
dev.ludovic.blas.benchmarks.SdotBenchmark blas thrpt 1 6 317.53 3.33 ops/s 10000000
dev.ludovic.blas.benchmarks.SdotBenchmark f2j thrpt 1 6 99.15 0.72 ops/s 10000000
dev.ludovic.blas.benchmarks.SdotBenchmark vector thrpt 1 6 315.87 4.83 ops/s 10000000