OpenMathLib
OpenBLAS
Blog
Docs
Changelog
Blog
Docs
Changelog
Overview
Branches
Benchmarks
Runs
Performance improvements of [SD]DOT with loop-unrolling on A64FX
#5358
Merged
Comparing
iha-taisei:dot_unroll
(
f7ad906
) with
develop
(
36c2589
)
CodSpeed Performance Gauge
0%
Untouched
62
Benchmarks
Passed
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syrk[100-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
340.8 µs
339.2 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syrk[100-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
473.2 µs
472 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-100-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
39.9 µs
39.8 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesv[100-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
396 µs
395.1 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-1000-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
83.3 µs
83.2 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesv[100-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
695.8 µs
695.3 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syev[50-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
1.4 ms
1.4 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesv[1000-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
93.3 ms
93.3 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gemm[100-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
470.9 µs
470.8 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgemv[1000-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
7 ms
7 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gemm[100-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
659.2 µs
659.1 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesv[100-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
256.5 µs
256.5 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syev[200-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
58.6 ms
58.6 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syev[50-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
1.3 ms
1.3 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgemv[100-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
230.6 µs
230.6 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgemv[1000-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
14.8 ms
14.8 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syev[200-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
49.1 ms
49.1 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syrk[1000-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
130.4 ms
130.3 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesdd[mn1-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
65.2 ms
65.2 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesdd[mn1-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
93.9 ms
93.8 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgemv[1000-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
26.3 ms
26.3 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gemm[1000-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
117.4 ms
117.4 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syrk[1000-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
227.5 ms
227.5 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesv[1000-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
188.6 ms
188.6 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesv[1000-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
353.6 ms
353.6 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syrk[1000-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
476.4 ms
476.4 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gemm[1000-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
239.4 ms
239.4 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gemm[1000-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
426 ms
426 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesv[1000-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
52.6 ms
52.6 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syrk[1000-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
65.4 ms
65.4 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gemm[100-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
1.2 ms
1.2 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgemv[1000-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
13.9 ms
13.9 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gemm[100-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
272.7 µs
272.8 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syrk[100-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
212.9 µs
213 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gemm[1000-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
875.2 ms
875.6 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-100-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
37.5 µs
37.5 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesv[100-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
936.8 µs
937.4 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-100-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
36.8 µs
36.8 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-1000-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
74.6 µs
74.7 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_nrm2[100-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
35.5 µs
35.6 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syrk[100-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
855.3 µs
856 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesdd[mn0-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
108.7 µs
108.8 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesdd[mn0-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
119.5 µs
119.7 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-100-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
41.7 µs
41.7 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-1000-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
118.5 µs
118.6 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-1000-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
99 µs
99.2 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_daxpy[1000-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
31.9 µs
31.9 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_daxpy[1000-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
40 µs
40.1 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_daxpy[100-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
-1%
23.7 µs
23.8 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dot[1000]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
-1%
27.9 µs
28 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgemv[100-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
-1%
102.9 µs
103.4 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_daxpy[1000-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
-1%
26.9 µs
27.1 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_daxpy[100-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
-1%
25.2 µs
25.4 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_daxpy[100-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
-1%
24.5 µs
24.7 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_nrm2[100-dz]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
-1%
27.9 µs
28.1 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_daxpy[100-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
-1%
23.5 µs
23.6 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgemv[100-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
-1%
139.9 µs
140.7 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgemv[100-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
-1%
148.5 µs
149.5 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_nrm2[1000-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
-1%
30 µs
30.2 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dot[100]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
-1%
21.9 µs
22 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_daxpy[1000-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
-1%
32.2 µs
32.4 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_nrm2[1000-dz]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
-1%
35 µs
35.3 µs
Commits
Click on a commit to change the comparison range
Base
develop
36c2589
-0.14%
Performance improvements of [SD]DOT with loop-unrolling on A64FX
f7ad906
7 months ago
by iha-taisei
© 2026 CodSpeed Technology
Home
Terms
Privacy
Docs