OpenMathLib
OpenBLAS
Blog
Docs
Changelog
Blog
Docs
Changelog
Overview
Branches
Benchmarks
Runs
Performance History
Latest Results
Fix incorrect leading dimension check for SME SGEMM direct kernel path For row-major matrices, the tight-packing condition should be k==lda (A is m×k), n==ldb (B is k×n), and n==ldc (C is m×n). The old check used m==lda and k==ldc, which prevented the SME/direct kernel from being invoked except when m==k==n (square matrices). Fixes https://github.com/OpenMathLib/OpenBLAS/issues/5794
quic:fix/sgemm-direct-leading-dim-check
23 hours ago
Fix incorrect leading dimension check for SME SGEMM direct kernel path For row-major matrices, the tight-packing condition should be k==lda (A is m×k), n==ldb (B is k×n), and n==ldc (C is m×n). The old check used m==lda and k==ldc, which prevented the SME/direct kernel from being invoked except when m==k==n (square matrices). Fixes https://github.com/OpenMathLib/OpenBLAS/issues/5794 Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>
quic:fix/sgemm-direct-leading-dim-check
2 days ago
Use -mpwr10 instead of -mpower10 to support AIX assembler
ayappanec:fix-aix-pwr10
2 days ago
Separate LLVM prerequisites for ARMV9SME and VORTEXM4
martin-frbg:issue5713-2
7 days ago
Merge pull request #5802 from pkubaj/patch-3 Makefile.power: allow using power10 instructions
develop
7 days ago
Merge pull request #5800 from notaz/armv7_sgemm provide a NEON version of arm/sgemm
develop
7 days ago
Makefile.power: allow using power10 instructions OpenBLAS uses various power9 and power10 instructions and they fail to build with errors like in https://github.com/OpenMathLib/OpenBLAS/issues/4793. Just allow the latest ISA, it will generate the same code, but actually allow the binaries to be assembled.
pkubaj:patch-3
7 days ago
Makefile.system: when Clang is used, check if gfortran is used At least on power, GCCVERSIONGT4 is checked for when gfortran is used, like at Makefile.power:59
pkubaj:patch-2
7 days ago
Latest Branches
CodSpeed Performance Gauge
0%
Fix incorrect leading dimension check for SME SGEMM direct kernel path
#5805
2 days ago
c32eefd
quic:fix/sgemm-direct-leading-dim-check
CodSpeed Performance Gauge
0%
Use -mpwr10 instead of -mpower10 to support AIX assembler
#5804
2 days ago
beed554
ayappanec:fix-aix-pwr10
CodSpeed Performance Gauge
0%
Fix compiler check for ARM64 SME support in CMake builds
#5797
7 days ago
d4d02fe
martin-frbg:issue5713-2
© 2026 CodSpeed Technology
Home
Terms
Privacy
Docs