OpenMathLib
OpenBLAS
Blog
Docs
Changelog
Blog
Docs
Changelog
Overview
Branches
Benchmarks
Runs
Improvement of 2D thread-partitioned GEMM for M << N case
#5276
Merged
Comparing
nakagawa-fj:gemm_2d_thread_partitioning
(
2351a98
) with
develop
(
a5f701c
)
CodSpeed Performance Gauge
0%
Untouched
62
Benchmarks
Mode
CPU Simulation
Wall Time
Memory
Status
Untouched
62 total
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_nrm2[100-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
+2%
35.3 µs
34.7 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-100-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
39.6 µs
39.4 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_daxpy[100-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
24.6 µs
24.6 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesdd[mn0-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
109 µs
108.6 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syrk[100-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
473.2 µs
471.8 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_daxpy[100-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
23.6 µs
23.6 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesdd[mn0-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
120.1 µs
119.9 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_daxpy[1000-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
31.8 µs
31.7 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-100-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
37.2 µs
37.2 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_daxpy[100-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
23.5 µs
23.4 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syrk[100-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
856.2 µs
855.4 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-100-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
36.7 µs
36.6 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgemv[100-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
230.7 µs
230.5 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgemv[100-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
149.5 µs
149.3 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-1000-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
74.4 µs
74.4 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgemv[100-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
147.8 µs
147.7 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesv[100-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
256.7 µs
256.6 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesv[100-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
395.5 µs
395.4 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-1000-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
98.6 µs
98.6 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syrk[100-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
339.1 µs
339 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syev[50-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
1.3 ms
1.3 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gemm[100-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
272.7 µs
272.7 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-100-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
41.4 µs
41.4 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesv[1000-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
52.6 ms
52.6 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_daxpy[100-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
25.3 µs
25.3 µs
1
2
3
Commits
Click on a commit to change the comparison range
Base
develop
a5f701c
+0.04%
Update 2D thread-partitioned GEMM for M << N case.
2351a98
1 year ago
by nakagawa-fj
© 2026 CodSpeed Technology
Home
Terms
Privacy
Docs