Avatar for the OpenMathLib user
OpenMathLib
OpenBLAS
BlogDocsChangelog

GitHub Actions run

5 days ago 7a1d234 ChipKerchner:RVV_Narrow_Accumulate_FP16_GEMM pull_request

Compare

Base
Search a run

Head
Added ability to accumulate in FP16. Convert BF16 to FP32. For FP16 and BF16 GEMM in RISC-V (BF16 now works for pre-RVA23)
#5640
ChipKerchner:RVV_Narrow_Accumulate_FP16_GEMM
5 days ago
CPU Simulation

Compare
Suggested base runs:
62 total
test_gemm[1000-s]
benchmark/pybench/benchmarks/bench_blas.py
117.4 ms
test_daxpy[100-c]
benchmark/pybench/benchmarks/bench_blas.py
25.2 µs
test_gemm[1000-z]
benchmark/pybench/benchmarks/bench_blas.py
875.6 ms
test_nrm2[1000-dz]
benchmark/pybench/benchmarks/bench_blas.py
35.4 µs
test_daxpy[100-s]
benchmark/pybench/benchmarks/bench_blas.py
24.1 µs
test_syev[200-d]
benchmark/pybench/benchmarks/bench_blas.py
58.6 ms
test_nrm2[1000-d]
benchmark/pybench/benchmarks/bench_blas.py
30.5 µs
test_nrm2[100-dz]
benchmark/pybench/benchmarks/bench_blas.py
28.8 µs
test_daxpy[100-d]
benchmark/pybench/benchmarks/bench_blas.py
24.2 µs
test_syev[50-d]
benchmark/pybench/benchmarks/bench_blas.py
1.4 ms
test_syev[200-s]
benchmark/pybench/benchmarks/bench_blas.py
49.1 ms
test_gesdd[mn1-d]
benchmark/pybench/benchmarks/bench_blas.py
93.8 ms
test_gesdd[mn0-s]
benchmark/pybench/benchmarks/bench_blas.py
109.5 µs
test_syrk[100-s]
benchmark/pybench/benchmarks/bench_blas.py
213.5 µs
test_syev[50-s]
benchmark/pybench/benchmarks/bench_blas.py
1.3 ms
test_daxpy[1000-c]
benchmark/pybench/benchmarks/bench_blas.py
32.8 µs
test_gesv[100-c]
benchmark/pybench/benchmarks/bench_blas.py
696.1 µs
test_syrk[1000-c]
benchmark/pybench/benchmarks/bench_blas.py
227.5 ms
test_gesdd[mn0-d]
benchmark/pybench/benchmarks/bench_blas.py
120.1 µs
test_syrk[100-d]
benchmark/pybench/benchmarks/bench_blas.py
340.5 µs
test_daxpy[100-z]
benchmark/pybench/benchmarks/bench_blas.py
25.9 µs
test_syrk[1000-d]
benchmark/pybench/benchmarks/bench_blas.py
130.4 ms
test_gesv[100-z]
benchmark/pybench/benchmarks/bench_blas.py
937 µs
test_gesv[1000-d]
benchmark/pybench/benchmarks/bench_blas.py
93.3 ms
test_daxpy[1000-s]
benchmark/pybench/benchmarks/bench_blas.py
27.6 µs
© 2026 CodSpeed Technology
Home Terms Privacy Docs