OpenMathLib
OpenBLAS
Blog
Docs
Changelog
Blog
Docs
Changelog
Overview
Branches
Benchmarks
Runs
[ARM64] Add optimized fp16 shgemm kernels for Neoverse N2
#5716
Merged
Comparing
yuanjia111:develop
(
e6eba9f
) with
develop
(
2671786
)
CodSpeed Performance Gauge
0%
Untouched
62
Benchmarks
Mode
CPU Simulation
Wall Time
Memory
Status
Untouched
62 total
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syrk[100-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
341.4 µs
339.9 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-100-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
38.2 µs
38.1 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syrk[100-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
473.9 µs
472.6 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesdd[mn0-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
109.6 µs
109.4 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesv[100-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
394.9 µs
394.3 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesv[100-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
696.1 µs
695.4 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-100-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
37.5 µs
37.4 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgemv[1000-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
7 ms
7 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesv[100-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
257.3 µs
257.1 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgemv[100-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
231.5 µs
231.5 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgemv[100-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
104.3 µs
104.2 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-1000-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
119.1 µs
119.1 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syev[50-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
1.3 ms
1.3 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_dgbmv[1-1000-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
83.8 µs
83.8 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syev[200-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
58.6 ms
58.6 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesv[1000-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
52.6 ms
52.6 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesv[1000-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
188.6 ms
188.6 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syrk[1000-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
476.4 ms
476.4 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gemm[1000-d]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
239.4 ms
239.4 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gemm[1000-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
117.4 ms
117.4 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gemm[1000-z]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
875.6 ms
875.6 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gemm[1000-c]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
426 ms
426 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syev[200-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
49.1 ms
49.1 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_syrk[1000-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
65.4 ms
65.4 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_gesdd[mn1-s]
benchmark/pybench/benchmarks/bench_blas.py
CodSpeed Performance Gauge
0%
65.2 ms
65.2 ms
1
2
3
Commits
Click on a commit to change the comparison range
Base
develop
2671786
-0.15%
Add optimized FP16 shgemm for for NEOVERSEN2 target
e6eba9f
1 month ago
by yuanjia111
© 2026 CodSpeed Technology
Home
Terms
Privacy
Docs