OpenMathLib
OpenBLAS
Blog
Docs
Changelog
Blog
Docs
Changelog
Overview
Branches
Benchmarks
Runs
Rewrite the Haswell SROT/DROT kernel tail loop with AVX2 to get consistent FMA rounding
#5660
Comparing
martin-frbg:issue5658
(
df29cc0
) with
develop
(
18638c7
)
CodSpeed Performance Gauge
0%
Untouched
62
Benchmarks
Mode
CPU Simulation
Wall Time
Memory
Status
Untouched
62 total
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syrk[100-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
341 µs
339.5 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_nrm2[1000-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
30.3 µs
30.2 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-100-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
40.4 µs
40.2 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_nrm2[100-dz]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
28.7 µs
28.6 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syrk[100-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
473.7 µs
472.3 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_daxpy[100-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
24.1 µs
24 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_daxpy[100-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
25.8 µs
25.8 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_daxpy[1000-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
40.5 µs
40.5 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_daxpy[1000-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
27.5 µs
27.5 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_nrm2[1000-dz]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
35.3 µs
35.2 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesv[100-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
394.5 µs
393.8 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-100-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
37.3 µs
37.2 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dot[100]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
22.5 µs
22.4 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-1000-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
99.6 µs
99.4 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-1000-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
83.7 µs
83.6 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-100-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
37.9 µs
37.9 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_nrm2[100-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
38.4 µs
38.4 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgemv[100-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
104.1 µs
104 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-100-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
42.2 µs
42.1 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesv[100-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
696.2 µs
695.8 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesv[100-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
256.9 µs
256.7 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-1000-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
118.9 µs
118.8 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgemv[1000-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
7 ms
7 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syev[50-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
1.3 ms
1.3 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesdd[mn0-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
120.1 µs
120.1 µs
1
2
3
Commits
Click on a commit to change the comparison range
Base
develop
18638c7
+0.05%
Use AVX2 in the tail loop too for consistent FMA rounding
df29cc0
1 day ago
by martin-frbg
© 2026 CodSpeed Technology
Home
Terms
Privacy
Docs