Pull requests
dgemm loop unroll and 4x1 4x2 dgemv implimentationpratiklp00:dgemm_optimization 0%
Fix _Float16 casting issue and reduce LMUL for certain vector instruction from m2 to m1.ChipKerchner:fixRVVSHGEMM 0%
Support for SME1 based strmm_direct kernel for cblas_strmm level 3 APIquic:topic/strmm_direct_sme1 0%
fix symbol naming (underscoring) 0%
© 2025 CodSpeed Technology