OpenMathLib
OpenBLAS
Blog
Docs
Changelog
Blog
Docs
Changelog
Overview
Branches
Benchmarks
Runs
Multi-thread Performance Improvement of GEMM with DIVIDE_RATE=1 for A64FX
#5353
Merged
Comparing
nakagawa-fj:feature/gemm_divide_rate_for_A64FX
(
5253c8f
) with
develop
(
8f0a1a3
)
CodSpeed Performance Gauge
0%
Untouched
62
Benchmarks
Mode
CPU Simulation
Wall Time
Memory
Status
Untouched
62 total
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-100-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
36.9 µs
36.8 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesv[100-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
395.1 µs
394.1 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syrk[100-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
340.8 µs
340.1 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_nrm2[1000-dz]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
35.1 µs
35 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syrk[100-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
213.2 µs
213 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syrk[100-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
473.3 µs
473.1 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgemv[1000-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
7 ms
7 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesdd[mn1-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
93.8 ms
93.8 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesv[1000-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
93.3 ms
93.3 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesv[100-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
257.3 µs
257.2 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gemm[100-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
470.9 µs
470.8 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-1000-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
99.1 µs
99.1 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syev[200-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
49.1 ms
49.1 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgemv[100-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
148.6 µs
148.6 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gemm[1000-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
117.4 ms
117.4 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syrk[1000-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
65.4 ms
65.4 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syrk[1000-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
227.5 ms
227.5 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syrk[1000-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
130.4 ms
130.4 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesv[1000-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
188.6 ms
188.6 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syrk[1000-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
476.4 ms
476.4 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gemm[1000-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
239.4 ms
239.4 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgemv[1000-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
26.3 ms
26.3 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesv[1000-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
353.6 ms
353.6 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesdd[mn1-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
65.2 ms
65.2 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgemv[1000-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
13.9 ms
13.9 ms
1
2
3
Commits
Click on a commit to change the comparison range
Base
develop
8f0a1a3
-0.13%
Multi-thread Performance Improvement of GEMM with DIVIDE_RATE=1 for
5253c8f
11 months ago
by nakagawa-fj
© 2026 CodSpeed Technology
Home
Terms
Privacy
Docs