Avatar for the jvdd user
jvdd
argminmax
BlogDocsChangelog

feat: add 64-bit aarch64 SIMD implementation

#52Merged
Comparing
aarch64_64bit_simd
(
5b51457
) with
main
(
5602f92
)
CodSpeed Performance Gauge
0%
Untouched
168

Benchmarks

Passed

impl_f16_argmax_rn
benches/bench_f16_return_nan.rs::benches::argminmax_rn_f16_random_array_long::impl_f16_argmax_rn
CodSpeed Performance Gauge
0%
119.5 µs119.4 µs
impl_f16_argminmax_rn
benches/bench_f16_return_nan.rs::benches::argminmax_rn_f16_random_array_long::impl_f16_argminmax_rn
CodSpeed Performance Gauge
0%
119.5 µs119.4 µs
avx2_u8_argmin
benches/bench_u8.rs::benches::argminmax_u8_random_array_long::avx2_u8_argmin
CodSpeed Performance Gauge
0%
61.9 µs61.9 µs
avx2_i8_argmax
benches/bench_i8.rs::benches::argminmax_i8_random_array_long::avx2_i8_argmax
CodSpeed Performance Gauge
0%
62.5 µs62.5 µs
avx2_f16_argminmax_rn
benches/bench_f16_return_nan.rs::benches::argminmax_rn_f16_random_array_long::avx2_f16_argminmax_rn
CodSpeed Performance Gauge
0%
119.3 µs119.2 µs
avx2_u8_argminmax
benches/bench_u8.rs::benches::argminmax_u8_random_array_long::avx2_u8_argminmax
CodSpeed Performance Gauge
0%
73.7 µs73.7 µs
sse_f16_argmax_in
benches/bench_f16_ignore_nan.rs::benches::argminmax_in_f16_random_array_long::sse_f16_argmax_in
CodSpeed Performance Gauge
0%
162 µs162 µs
avx2_f16_argmin_in
benches/bench_f16_ignore_nan.rs::benches::argminmax_in_f16_random_array_long::avx2_f16_argmin_in
CodSpeed Performance Gauge
0%
121.2 µs121.2 µs
sse_f16_argmax_rn
benches/bench_f16_return_nan.rs::benches::argminmax_rn_f16_random_array_long::sse_f16_argmax_rn
CodSpeed Performance Gauge
0%
159.2 µs159.2 µs
avx2_u16_argminmax
benches/bench_u16.rs::benches::argminmax_u16_random_array_long::avx2_u16_argminmax
CodSpeed Performance Gauge
0%
113.1 µs113.1 µs
sse_i16_argminmax
benches/bench_i16.rs::benches::argminmax_i16_random_array_long::sse_i16_argminmax
CodSpeed Performance Gauge
0%
144.2 µs144.2 µs
avx2_f16_argmin_rn
benches/bench_f16_return_nan.rs::benches::argminmax_rn_f16_random_array_long::avx2_f16_argmin_rn
CodSpeed Performance Gauge
0%
119.3 µs119.3 µs
impl_f16_argmin_rn
benches/bench_f16_return_nan.rs::benches::argminmax_rn_f16_random_array_long::impl_f16_argmin_rn
CodSpeed Performance Gauge
0%
119.4 µs119.4 µs
sse_f16_argminmax_in
benches/bench_f16_ignore_nan.rs::benches::argminmax_in_f16_random_array_long::sse_f16_argminmax_in
CodSpeed Performance Gauge
0%
180.2 µs180.1 µs
avx_f32_argmin_in
benches/bench_f32_ignore_nan.rs::benches::argminmax_in_f32_random_array_long::avx_f32_argmin_in
CodSpeed Performance Gauge
0%
211.5 µs211.4 µs
scalar_i8_argmin
benches/bench_i8.rs::benches::argminmax_i8_random_array_long::scalar_i8_argmin
CodSpeed Performance Gauge
0%
253 µs253 µs
scalar_i8_argmax
benches/bench_i8.rs::benches::argminmax_i8_random_array_long::scalar_i8_argmax
CodSpeed Performance Gauge
0%
253 µs253 µs
sse_u32_argmax
benches/bench_u32.rs::benches::argminmax_u32_random_array_long::sse_u32_argmax
CodSpeed Performance Gauge
0%
265.5 µs265.5 µs
scalar_u16_argmin
benches/bench_u16.rs::benches::argminmax_u16_random_array_long::scalar_u16_argmin
CodSpeed Performance Gauge
0%
292.6 µs292.6 µs
sse_i32_argminmax
benches/bench_i32.rs::benches::argminmax_i32_random_array_long::sse_i32_argminmax
CodSpeed Performance Gauge
0%
290.5 µs290.4 µs
sse_u32_argminmax
benches/bench_u32.rs::benches::argminmax_u32_random_array_long::sse_u32_argminmax
CodSpeed Performance Gauge
0%
301.2 µs301.2 µs
scalar_i16_argminmax
benches/bench_i16.rs::benches::argminmax_i16_random_array_long::scalar_i16_argminmax
CodSpeed Performance Gauge
0%
349.5 µs349.5 µs
scalar_f32_argminmax_rn
benches/bench_f32_return_nan.rs::benches::argminmax_rn_f32_random_array_long::scalar_f32_argminmax_rn
CodSpeed Performance Gauge
0%
528.2 µs528.1 µs
avx_f64_argmax_in
benches/bench_f64_ignore_nan.rs::benches::argminmax_in_f64_random_array_long::avx_f64_argmax_in
CodSpeed Performance Gauge
0%
422.1 µs422.1 µs
avx_f64_argminmax_in
benches/bench_f64_ignore_nan.rs::benches::argminmax_in_f64_random_array_long::avx_f64_argminmax_in
CodSpeed Performance Gauge
0%
448.8 µs448.8 µs
avx2_u64_argminmax
benches/bench_u64.rs::benches::argminmax_u64_random_array_long::avx2_u64_argminmax
CodSpeed Performance Gauge
0%
452.3 µs452.3 µs
sse_f64_argminmax_in
benches/bench_f64_ignore_nan.rs::benches::argminmax_in_f64_random_array_long::sse_f64_argminmax_in
CodSpeed Performance Gauge
0%
552 µs552 µs
scalar_i64_argminmax
benches/bench_i64.rs::benches::argminmax_i64_random_array_long::scalar_i64_argminmax
CodSpeed Performance Gauge
0%
629.5 µs629.5 µs
scalar_f64_argminmax_in
benches/bench_f64_ignore_nan.rs::benches::argminmax_in_f64_random_array_long::scalar_f64_argminmax_in
CodSpeed Performance Gauge
0%
658 µs658 µs
scalar_f64_argmin_rn
benches/bench_f64_return_nan.rs::benches::argminmax_rn_f64_random_array_long::scalar_f64_argmin_rn
CodSpeed Performance Gauge
0%
714.8 µs714.8 µs
sse_f16_argmin_in
benches/bench_f16_ignore_nan.rs::benches::argminmax_in_f16_random_array_long::sse_f16_argmin_in
CodSpeed Performance Gauge
0%
162 µs162 µs
scalar_u32_argmin
benches/bench_u32.rs::benches::argminmax_u32_random_array_long::scalar_u32_argmin
CodSpeed Performance Gauge
0%
357.5 µs357.5 µs
avx2_f32_argminmax_rn
benches/bench_f32_return_nan.rs::benches::argminmax_rn_f32_random_array_long::avx2_f32_argminmax_rn
CodSpeed Performance Gauge
0%
235.6 µs235.6 µs
sse_f32_argmin_in
benches/bench_f32_ignore_nan.rs::benches::argminmax_in_f32_random_array_long::sse_f32_argmin_in
CodSpeed Performance Gauge
0%
242.5 µs242.5 µs
avx_f64_argmin_in
benches/bench_f64_ignore_nan.rs::benches::argminmax_in_f64_random_array_long::avx_f64_argmin_in
CodSpeed Performance Gauge
0%
422.1 µs422.1 µs
sse_f32_argmin_rn
benches/bench_f32_return_nan.rs::benches::argminmax_rn_f32_random_array_long::sse_f32_argmin_rn
CodSpeed Performance Gauge
0%
319.1 µs319.1 µs
sse_f64_argmin_in
benches/bench_f64_ignore_nan.rs::benches::argminmax_in_f64_random_array_long::sse_f64_argmin_in
CodSpeed Performance Gauge
0%
484.2 µs484.2 µs
avx2_i32_argmax
benches/bench_i32.rs::benches::argminmax_i32_random_array_long::avx2_i32_argmax
CodSpeed Performance Gauge
0%
211.3 µs211.3 µs
impl_i32_argmax
benches/bench_i32.rs::benches::argminmax_i32_random_array_long::impl_i32_argmax
CodSpeed Performance Gauge
0%
211.5 µs211.5 µs
sse_f32_argmax_in
benches/bench_f32_ignore_nan.rs::benches::argminmax_in_f32_random_array_long::sse_f32_argmax_in
CodSpeed Performance Gauge
0%
242.4 µs242.4 µs
sse_f32_argminmax_in
benches/bench_f32_ignore_nan.rs::benches::argminmax_in_f32_random_array_long::sse_f32_argminmax_in
CodSpeed Performance Gauge
0%
276.4 µs276.4 µs
scalar_f64_argminmax_rn
benches/bench_f64_return_nan.rs::benches::argminmax_rn_f64_random_array_long::scalar_f64_argminmax_rn
CodSpeed Performance Gauge
0%
714.8 µs714.8 µs
avx2_f64_argminmax_rn
benches/bench_f64_return_nan.rs::benches::argminmax_rn_f64_random_array_long::avx2_f64_argminmax_rn
CodSpeed Performance Gauge
0%
487.7 µs487.7 µs
avx2_f64_argmin_rn
benches/bench_f64_return_nan.rs::benches::argminmax_rn_f64_random_array_long::avx2_f64_argmin_rn
CodSpeed Performance Gauge
0%
487.8 µs487.8 µs
sse_f64_argminmax_rn
benches/bench_f64_return_nan.rs::benches::argminmax_rn_f64_random_array_long::sse_f64_argminmax_rn
CodSpeed Performance Gauge
0%
644.1 µs644.1 µs
sse_f64_argmin_rn
benches/bench_f64_return_nan.rs::benches::argminmax_rn_f64_random_array_long::sse_f64_argmin_rn
CodSpeed Performance Gauge
0%
644.1 µs644.1 µs
scalar_u64_argmax
benches/bench_u64.rs::benches::argminmax_u64_random_array_long::scalar_u64_argmax
CodSpeed Performance Gauge
0%
565.5 µs565.5 µs
scalar_f32_argmax_in
benches/bench_f32_ignore_nan.rs::benches::argminmax_in_f32_random_array_long::scalar_f32_argmax_in
CodSpeed Performance Gauge
0%
442.9 µs442.9 µs
scalar_u32_argmax
benches/bench_u32.rs::benches::argminmax_u32_random_array_long::scalar_u32_argmax
CodSpeed Performance Gauge
0%
378.8 µs378.8 µs
impl_f64_argmin_in
benches/bench_f64_ignore_nan.rs::benches::argminmax_in_f64_random_array_long::impl_f64_argmin_in
CodSpeed Performance Gauge
0%
422.3 µs422.3 µs
impl_f64_argminmax_in
benches/bench_f64_ignore_nan.rs::benches::argminmax_in_f64_random_array_long::impl_f64_argminmax_in
CodSpeed Performance Gauge
0%
449 µs449 µs
impl_i32_argminmax
benches/bench_i32.rs::benches::argminmax_i32_random_array_long::impl_i32_argminmax
CodSpeed Performance Gauge
0%
225 µs225 µs
scalar_f64_argmax_in
benches/bench_f64_ignore_nan.rs::benches::argminmax_in_f64_random_array_long::scalar_f64_argmax_in
CodSpeed Performance Gauge
0%
629.5 µs629.5 µs
scalar_u16_argminmax
benches/bench_u16.rs::benches::argminmax_u16_random_array_long::scalar_u16_argminmax
CodSpeed Performance Gauge
0%
349.5 µs349.5 µs
scalar_i32_argmax
benches/bench_i32.rs::benches::argminmax_i32_random_array_long::scalar_i32_argmax
CodSpeed Performance Gauge
0%
357.5 µs357.5 µs
scalar_i64_argmin
benches/bench_i64.rs::benches::argminmax_i64_random_array_long::scalar_i64_argmin
CodSpeed Performance Gauge
0%
544.2 µs544.2 µs
scalar_f64_argmin_in
benches/bench_f64_ignore_nan.rs::benches::argminmax_in_f64_random_array_long::scalar_f64_argmin_in
CodSpeed Performance Gauge
0%
629.5 µs629.5 µs
scalar_i16_argmax
benches/bench_i16.rs::benches::argminmax_i16_random_array_long::scalar_i16_argmax
CodSpeed Performance Gauge
0%
292.6 µs292.6 µs
impl_i16_argminmax
benches/bench_i16.rs::benches::argminmax_i16_random_array_long::impl_i16_argminmax
CodSpeed Performance Gauge
0%
113.3 µs113.3 µs
impl_u8_argmax
benches/bench_u8.rs::benches::argminmax_u8_random_array_long::impl_u8_argmax
CodSpeed Performance Gauge
0%
69.4 µs69.4 µs
scalar_i32_argmin
benches/bench_i32.rs::benches::argminmax_i32_random_array_long::scalar_i32_argmin
CodSpeed Performance Gauge
0%
357.5 µs357.5 µs
impl_i16_argmax
benches/bench_i16.rs::benches::argminmax_i16_random_array_long::impl_i16_argmax
CodSpeed Performance Gauge
0%
106.8 µs106.8 µs
impl_u32_argmax
benches/bench_u32.rs::benches::argminmax_u32_random_array_long::impl_u32_argmax
CodSpeed Performance Gauge
0%
215.9 µs215.9 µs
sse_i64_argmin
benches/bench_i64.rs::benches::argminmax_i64_random_array_long::sse_i64_argmin
CodSpeed Performance Gauge
0%
508.9 µs508.9 µs
sse_u8_argmin
benches/bench_u8.rs::benches::argminmax_u8_random_array_long::sse_u8_argmin
CodSpeed Performance Gauge
0%
68.7 µs68.7 µs
sse_i16_argmax
benches/bench_i16.rs::benches::argminmax_i16_random_array_long::sse_i16_argmax
CodSpeed Performance Gauge
0%
126.1 µs126.1 µs
scalar_u8_argmax
benches/bench_u8.rs::benches::argminmax_u8_random_array_long::scalar_u8_argmax
CodSpeed Performance Gauge
0%
253 µs253 µs
sse_i16_argmin
benches/bench_i16.rs::benches::argminmax_i16_random_array_long::sse_i16_argmin
CodSpeed Performance Gauge
0%
126.1 µs126.1 µs
avx2_i16_argmax
benches/bench_i16.rs::benches::argminmax_i16_random_array_long::avx2_i16_argmax
CodSpeed Performance Gauge
0%
106.6 µs106.6 µs
sse_i32_argmax
benches/bench_i32.rs::benches::argminmax_i32_random_array_long::sse_i32_argmax
CodSpeed Performance Gauge
0%
247.7 µs247.7 µs
scalar_f16_argmax_rn
benches/bench_f16_return_nan.rs::benches::argminmax_rn_f16_random_array_long::scalar_f16_argmax_rn
CodSpeed Performance Gauge
0%
577.1 µs577.1 µs
scalar_i16_argmin
benches/bench_i16.rs::benches::argminmax_i16_random_array_long::scalar_i16_argmin
CodSpeed Performance Gauge
0%
292.6 µs292.6 µs
impl_i8_argmin
benches/bench_i8.rs::benches::argminmax_i8_random_array_long::impl_i8_argmin
CodSpeed Performance Gauge
0%
67.7 µs67.7 µs
scalar_i32_argminmax
benches/bench_i32.rs::benches::argminmax_i32_random_array_long::scalar_i32_argminmax
CodSpeed Performance Gauge
0%
442.8 µs442.8 µs
impl_f32_argmax_rn
benches/bench_f32_return_nan.rs::benches::argminmax_rn_f32_random_array_long::impl_f32_argmax_rn
CodSpeed Performance Gauge
0%
235.8 µs235.8 µs
scalar_u64_argmin
benches/bench_u64.rs::benches::argminmax_u64_random_array_long::scalar_u64_argmin
CodSpeed Performance Gauge
0%
544.2 µs544.2 µs
avx2_i8_argmin
benches/bench_i8.rs::benches::argminmax_i8_random_array_long::avx2_i8_argmin
CodSpeed Performance Gauge
0%
62.5 µs62.5 µs
sse_u64_argminmax
benches/bench_u64.rs::benches::argminmax_u64_random_array_long::sse_u64_argminmax
CodSpeed Performance Gauge
0%
580.2 µs580.2 µs
sse_u16_argmax
benches/bench_u16.rs::benches::argminmax_u16_random_array_long::sse_u16_argmax
CodSpeed Performance Gauge
0%
131.5 µs131.5 µs
impl_u16_argmax
benches/bench_u16.rs::benches::argminmax_u16_random_array_long::impl_u16_argmax
CodSpeed Performance Gauge
0%
106.9 µs106.9 µs
avx2_i8_argminmax
benches/bench_i8.rs::benches::argminmax_i8_random_array_long::avx2_i8_argminmax
CodSpeed Performance Gauge
0%
74.2 µs74.2 µs
scalar_f16_argmin_in
benches/bench_f16_ignore_nan.rs::benches::argminmax_in_f16_random_array_long::scalar_f16_argmin_in
CodSpeed Performance Gauge
0%
548.7 µs548.7 µs
sse_f64_argmax_rn
benches/bench_f64_return_nan.rs::benches::argminmax_rn_f64_random_array_long::sse_f64_argmax_rn
CodSpeed Performance Gauge
0%
644.1 µs644.1 µs
impl_i16_argmin
benches/bench_i16.rs::benches::argminmax_i16_random_array_long::impl_i16_argmin
CodSpeed Performance Gauge
0%
106.8 µs106.8 µs
scalar_u8_argmin
benches/bench_u8.rs::benches::argminmax_u8_random_array_long::scalar_u8_argmin
CodSpeed Performance Gauge
0%
253 µs253 µs
sse_u8_argminmax
benches/bench_u8.rs::benches::argminmax_u8_random_array_long::sse_u8_argminmax
CodSpeed Performance Gauge
0%
86 µs86 µs
avx2_i16_argminmax
benches/bench_i16.rs::benches::argminmax_i16_random_array_long::avx2_i16_argminmax
CodSpeed Performance Gauge
0%
113.1 µs113.1 µs
impl_u8_argmin
benches/bench_u8.rs::benches::argminmax_u8_random_array_long::impl_u8_argmin
CodSpeed Performance Gauge
0%
68.9 µs68.9 µs
impl_u8_argminmax
benches/bench_u8.rs::benches::argminmax_u8_random_array_long::impl_u8_argminmax
CodSpeed Performance Gauge
0%
86.1 µs86.1 µs
sse_i64_argminmax
benches/bench_i64.rs::benches::argminmax_i64_random_array_long::sse_i64_argminmax
CodSpeed Performance Gauge
0%
565.9 µs565.9 µs
sse_u8_argmax
benches/bench_u8.rs::benches::argminmax_u8_random_array_long::sse_u8_argmax
CodSpeed Performance Gauge
0%
69.2 µs69.2 µs
avx2_u32_argmax
benches/bench_u32.rs::benches::argminmax_u32_random_array_long::avx2_u32_argmax
CodSpeed Performance Gauge
0%
215.7 µs215.7 µs
impl_f32_argmax_in
benches/bench_f32_ignore_nan.rs::benches::argminmax_in_f32_random_array_long::impl_f32_argmax_in
CodSpeed Performance Gauge
0%
211.7 µs211.7 µs
scalar_f32_argmax_rn
benches/bench_f32_return_nan.rs::benches::argminmax_rn_f32_random_array_long::scalar_f32_argmax_rn
CodSpeed Performance Gauge
0%
528.2 µs528.2 µs
impl_i8_argminmax
benches/bench_i8.rs::benches::argminmax_i8_random_array_long::impl_i8_argminmax
CodSpeed Performance Gauge
0%
84.9 µs84.9 µs
impl_u64_argmin
benches/bench_u64.rs::benches::argminmax_u64_random_array_long::impl_u64_argmin
CodSpeed Performance Gauge
0%
431 µs431 µs
sse_i8_argminmax
benches/bench_i8.rs::benches::argminmax_i8_random_array_long::sse_i8_argminmax
CodSpeed Performance Gauge
0%
84.7 µs84.7 µs
impl_f32_argminmax_rn
benches/bench_f32_return_nan.rs::benches::argminmax_rn_f32_random_array_long::impl_f32_argminmax_rn
CodSpeed Performance Gauge
0%
235.8 µs235.8 µs
sse_u16_argminmax
benches/bench_u16.rs::benches::argminmax_u16_random_array_long::sse_u16_argminmax
CodSpeed Performance Gauge
0%
149.5 µs149.5 µs
impl_u16_argminmax
benches/bench_u16.rs::benches::argminmax_u16_random_array_long::impl_u16_argminmax
CodSpeed Performance Gauge
0%
113.3 µs113.3 µs
impl_i8_argmax
benches/bench_i8.rs::benches::argminmax_i8_random_array_long::impl_i8_argmax
CodSpeed Performance Gauge
0%
67.6 µs67.6 µs
sse_u16_argmin
benches/bench_u16.rs::benches::argminmax_u16_random_array_long::sse_u16_argmin
CodSpeed Performance Gauge
0%
131.5 µs131.5 µs
avx2_u16_argmin
benches/bench_u16.rs::benches::argminmax_u16_random_array_long::avx2_u16_argmin
CodSpeed Performance Gauge
0%
106.7 µs106.7 µs
impl_u16_argmin
benches/bench_u16.rs::benches::argminmax_u16_random_array_long::impl_u16_argmin
CodSpeed Performance Gauge
0%
106.9 µs106.9 µs
sse_u64_argmin
benches/bench_u64.rs::benches::argminmax_u64_random_array_long::sse_u64_argmin
CodSpeed Performance Gauge
0%
523.1 µs523.1 µs
impl_f64_argmax_in
benches/bench_f64_ignore_nan.rs::benches::argminmax_in_f64_random_array_long::impl_f64_argmax_in
CodSpeed Performance Gauge
0%
422.3 µs422.3 µs
sse_f32_argmax_rn
benches/bench_f32_return_nan.rs::benches::argminmax_rn_f32_random_array_long::sse_f32_argmax_rn
CodSpeed Performance Gauge
0%
319.1 µs319.1 µs
impl_f32_argmin_rn
benches/bench_f32_return_nan.rs::benches::argminmax_rn_f32_random_array_long::impl_f32_argmin_rn
CodSpeed Performance Gauge
0%
235.8 µs235.8 µs
sse_i8_argmin
benches/bench_i8.rs::benches::argminmax_i8_random_array_long::sse_i8_argmin
CodSpeed Performance Gauge
0%
67.5 µs67.5 µs
scalar_u64_argminmax
benches/bench_u64.rs::benches::argminmax_u64_random_array_long::scalar_u64_argminmax
CodSpeed Performance Gauge
0%
629.5 µs629.5 µs
scalar_i64_argmax
benches/bench_i64.rs::benches::argminmax_i64_random_array_long::scalar_i64_argmax
CodSpeed Performance Gauge
0%
544.2 µs544.2 µs
impl_f64_argmax_rn
benches/bench_f64_return_nan.rs::benches::argminmax_rn_f64_random_array_long::impl_f64_argmax_rn
CodSpeed Performance Gauge
0%
487.9 µs487.9 µs
impl_f64_argmin_rn
benches/bench_f64_return_nan.rs::benches::argminmax_rn_f64_random_array_long::impl_f64_argmin_rn
CodSpeed Performance Gauge
0%
487.9 µs487.9 µs
impl_f64_argminmax_rn
benches/bench_f64_return_nan.rs::benches::argminmax_rn_f64_random_array_long::impl_f64_argminmax_rn
CodSpeed Performance Gauge
0%
487.9 µs487.9 µs
impl_i64_argmax
benches/bench_i64.rs::benches::argminmax_i64_random_array_long::impl_i64_argmax
CodSpeed Performance Gauge
0%
430.9 µs430.9 µs
impl_i64_argmin
benches/bench_i64.rs::benches::argminmax_i64_random_array_long::impl_i64_argmin
CodSpeed Performance Gauge
0%
430.9 µs430.9 µs
avx2_i64_argmax
benches/bench_i64.rs::benches::argminmax_i64_random_array_long::avx2_i64_argmax
CodSpeed Performance Gauge
0%
430.7 µs430.7 µs
impl_u64_argminmax
benches/bench_u64.rs::benches::argminmax_u64_random_array_long::impl_u64_argminmax
CodSpeed Performance Gauge
0%
452.5 µs452.5 µs
impl_i64_argminmax
benches/bench_i64.rs::benches::argminmax_i64_random_array_long::impl_i64_argminmax
CodSpeed Performance Gauge
0%
452.4 µs452.4 µs
avx2_i64_argminmax
benches/bench_i64.rs::benches::argminmax_i64_random_array_long::avx2_i64_argminmax
CodSpeed Performance Gauge
0%
452.2 µs452.2 µs
impl_u32_argminmax
benches/bench_u32.rs::benches::argminmax_u32_random_array_long::impl_u32_argminmax
CodSpeed Performance Gauge
0%
225.1 µs225.1 µs
avx2_u32_argminmax
benches/bench_u32.rs::benches::argminmax_u32_random_array_long::avx2_u32_argminmax
CodSpeed Performance Gauge
0%
224.9 µs224.9 µs
impl_u32_argmin
benches/bench_u32.rs::benches::argminmax_u32_random_array_long::impl_u32_argmin
CodSpeed Performance Gauge
0%
215.9 µs215.9 µs
avx2_u32_argmin
benches/bench_u32.rs::benches::argminmax_u32_random_array_long::avx2_u32_argmin
CodSpeed Performance Gauge
0%
215.8 µs215.8 µs
impl_u64_argmax
benches/bench_u64.rs::benches::argminmax_u64_random_array_long::impl_u64_argmax
CodSpeed Performance Gauge
0%
430.9 µs430.9 µs
avx2_u64_argmax
benches/bench_u64.rs::benches::argminmax_u64_random_array_long::avx2_u64_argmax
CodSpeed Performance Gauge
0%
430.8 µs430.8 µs
scalar_i8_argminmax
benches/bench_i8.rs::benches::argminmax_i8_random_array_long::scalar_i8_argminmax
CodSpeed Performance Gauge
0%
359.7 µs359.7 µs
scalar_f16_argminmax_rn
benches/bench_f16_return_nan.rs::benches::argminmax_rn_f16_random_array_long::scalar_f16_argminmax_rn
CodSpeed Performance Gauge
0%
634 µs634 µs
scalar_f16_argminmax_in
benches/bench_f16_ignore_nan.rs::benches::argminmax_in_f16_random_array_long::scalar_f16_argminmax_in
CodSpeed Performance Gauge
0%
776.3 µs776.3 µs
scalar_u32_argminmax
benches/bench_u32.rs::benches::argminmax_u32_random_array_long::scalar_u32_argminmax
CodSpeed Performance Gauge
0%
442.8 µs442.8 µs
scalar_f32_argmin_in
benches/bench_f32_ignore_nan.rs::benches::argminmax_in_f32_random_array_long::scalar_f32_argmin_in
CodSpeed Performance Gauge
0%
442.9 µs442.9 µs
impl_i32_argmin
benches/bench_i32.rs::benches::argminmax_i32_random_array_long::impl_i32_argmin
CodSpeed Performance Gauge
0%
211.5 µs211.5 µs
avx2_i32_argmin
benches/bench_i32.rs::benches::argminmax_i32_random_array_long::avx2_i32_argmin
CodSpeed Performance Gauge
0%
211.4 µs211.4 µs
impl_f32_argminmax_in
benches/bench_f32_ignore_nan.rs::benches::argminmax_in_f32_random_array_long::impl_f32_argminmax_in
CodSpeed Performance Gauge
0%
225.2 µs225.2 µs
impl_f32_argmin_in
benches/bench_f32_ignore_nan.rs::benches::argminmax_in_f32_random_array_long::impl_f32_argmin_in
CodSpeed Performance Gauge
0%
211.6 µs211.6 µs
scalar_f64_argmax_rn
benches/bench_f64_return_nan.rs::benches::argminmax_rn_f64_random_array_long::scalar_f64_argmax_rn
CodSpeed Performance Gauge
0%
714.8 µs714.8 µs
avx2_f16_argmax_in
benches/bench_f16_ignore_nan.rs::benches::argminmax_in_f16_random_array_long::avx2_f16_argmax_in
CodSpeed Performance Gauge
0%
121.2 µs121.2 µs
scalar_f16_argmin_rn
benches/bench_f16_return_nan.rs::benches::argminmax_rn_f16_random_array_long::scalar_f16_argmin_rn
CodSpeed Performance Gauge
0%
577.1 µs577.1 µs
scalar_f16_argmax_in
benches/bench_f16_ignore_nan.rs::benches::argminmax_in_f16_random_array_long::scalar_f16_argmax_in
CodSpeed Performance Gauge
0%
548.7 µs548.7 µs
scalar_f32_argmin_rn
benches/bench_f32_return_nan.rs::benches::argminmax_rn_f32_random_array_long::scalar_f32_argmin_rn
CodSpeed Performance Gauge
0%
528.1 µs528.2 µs
sse_u64_argmax
benches/bench_u64.rs::benches::argminmax_u64_random_array_long::sse_u64_argmax
CodSpeed Performance Gauge
0%
523.1 µs523.1 µs
scalar_f32_argminmax_in
benches/bench_f32_ignore_nan.rs::benches::argminmax_in_f32_random_array_long::scalar_f32_argminmax_in
CodSpeed Performance Gauge
0%
471.3 µs471.4 µs
sse_i64_argmax
benches/bench_i64.rs::benches::argminmax_i64_random_array_long::sse_i64_argmax
CodSpeed Performance Gauge
0%
508.8 µs508.9 µs
sse_f64_argmax_in
benches/bench_f64_ignore_nan.rs::benches::argminmax_in_f64_random_array_long::sse_f64_argmax_in
CodSpeed Performance Gauge
0%
484.2 µs484.2 µs
avx2_f64_argmax_rn
benches/bench_f64_return_nan.rs::benches::argminmax_rn_f64_random_array_long::avx2_f64_argmax_rn
CodSpeed Performance Gauge
0%
487.8 µs487.9 µs
avx2_u64_argmin
benches/bench_u64.rs::benches::argminmax_u64_random_array_long::avx2_u64_argmin
CodSpeed Performance Gauge
0%
430.8 µs430.8 µs
avx2_i64_argmin
benches/bench_i64.rs::benches::argminmax_i64_random_array_long::avx2_i64_argmin
CodSpeed Performance Gauge
0%
430.7 µs430.7 µs
scalar_u8_argminmax
benches/bench_u8.rs::benches::argminmax_u8_random_array_long::scalar_u8_argminmax
CodSpeed Performance Gauge
0%
359.7 µs359.7 µs
sse_f32_argminmax_rn
benches/bench_f32_return_nan.rs::benches::argminmax_rn_f32_random_array_long::sse_f32_argminmax_rn
CodSpeed Performance Gauge
0%
319 µs319 µs
scalar_u16_argmax
benches/bench_u16.rs::benches::argminmax_u16_random_array_long::scalar_u16_argmax
CodSpeed Performance Gauge
0%
292.6 µs292.6 µs
sse_u32_argmin
benches/bench_u32.rs::benches::argminmax_u32_random_array_long::sse_u32_argmin
CodSpeed Performance Gauge
0%
265.5 µs265.5 µs
avx2_f32_argmin_rn
benches/bench_f32_return_nan.rs::benches::argminmax_rn_f32_random_array_long::avx2_f32_argmin_rn
CodSpeed Performance Gauge
0%
235.6 µs235.7 µs
sse_i32_argmin
benches/bench_i32.rs::benches::argminmax_i32_random_array_long::sse_i32_argmin
CodSpeed Performance Gauge
0%
247.7 µs247.8 µs
sse_f16_argmin_rn
benches/bench_f16_return_nan.rs::benches::argminmax_rn_f16_random_array_long::sse_f16_argmin_rn
CodSpeed Performance Gauge
0%
159.2 µs159.2 µs
sse_f16_argminmax_rn
benches/bench_f16_return_nan.rs::benches::argminmax_rn_f16_random_array_long::sse_f16_argminmax_rn
CodSpeed Performance Gauge
0%
159.1 µs159.1 µs
avx2_f32_argmax_rn
benches/bench_f32_return_nan.rs::benches::argminmax_rn_f32_random_array_long::avx2_f32_argmax_rn
CodSpeed Performance Gauge
0%
235.6 µs235.7 µs
avx_f32_argminmax_in
benches/bench_f32_ignore_nan.rs::benches::argminmax_in_f32_random_array_long::avx_f32_argminmax_in
CodSpeed Performance Gauge
0%
225 µs225 µs
avx2_i32_argminmax
benches/bench_i32.rs::benches::argminmax_i32_random_array_long::avx2_i32_argminmax
CodSpeed Performance Gauge
0%
224.8 µs224.8 µs
avx_f32_argmax_in
benches/bench_f32_ignore_nan.rs::benches::argminmax_in_f32_random_array_long::avx_f32_argmax_in
CodSpeed Performance Gauge
0%
211.5 µs211.5 µs
avx2_f16_argmax_rn
benches/bench_f16_return_nan.rs::benches::argminmax_rn_f16_random_array_long::avx2_f16_argmax_rn
CodSpeed Performance Gauge
0%
119.3 µs119.3 µs
avx2_u16_argmax
benches/bench_u16.rs::benches::argminmax_u16_random_array_long::avx2_u16_argmax
CodSpeed Performance Gauge
0%
106.7 µs106.7 µs
avx2_i16_argmin
benches/bench_i16.rs::benches::argminmax_i16_random_array_long::avx2_i16_argmin
CodSpeed Performance Gauge
0%
106.7 µs106.7 µs
impl_f16_argmin_in
benches/bench_f16_ignore_nan.rs::benches::argminmax_in_f16_random_array_long::impl_f16_argmin_in
CodSpeed Performance Gauge
0%
121.3 µs121.4 µs
impl_f16_argminmax_in
benches/bench_f16_ignore_nan.rs::benches::argminmax_in_f16_random_array_long::impl_f16_argminmax_in
CodSpeed Performance Gauge
0%
129 µs129 µs
sse_i8_argmax
benches/bench_i8.rs::benches::argminmax_i8_random_array_long::sse_i8_argmax
CodSpeed Performance Gauge
0%
67.5 µs67.5 µs
avx2_u8_argmax
benches/bench_u8.rs::benches::argminmax_u8_random_array_long::avx2_u8_argmax
CodSpeed Performance Gauge
0%
62.5 µs62.5 µs
avx2_f16_argminmax_in
benches/bench_f16_ignore_nan.rs::benches::argminmax_in_f16_random_array_long::avx2_f16_argminmax_in
CodSpeed Performance Gauge
0%
128.8 µs128.8 µs
impl_f16_argmax_in
benches/bench_f16_ignore_nan.rs::benches::argminmax_in_f16_random_array_long::impl_f16_argmax_in
CodSpeed Performance Gauge
0%
121.4 µs121.5 µs

Commits

Click on a commit to change the comparison range
Base
main
5602f92
+0.01%
fix: fix typo in nightly_simd feature
383c791
2 years ago
by jvdd
0%
fix: use 64-bit lanesize for aarch64 u64
4cbdd0c
2 years ago
by jvdd
0%
feat: add stable f64 ignore nan SIMD for aarch64
fec3b69
2 years ago
by jvdd
0%
feat: add stable f64 return nan SIMD for aarch64
9227628
2 years ago
by jvdd
0%
fix: import std-arch for aarch64
cf6e522
2 years ago
by jvdd
0%
fix: use 64-bit intrinsincs instead of 32-bit ones :see_no_evil:
d0b4d62
2 years ago
by jvdd
0%
fix: MAX_INDEX available on aarch64 for f64 ignore nan
88a0e46
2 years ago
by jvdd
0%
fix: clippy import warnings
0097135
2 years ago
by jvdd
0%
bench: add 64-bit aarch64 benchmarks
8c23a21
2 years ago
by jvdd
0%
perf: use aarch64 where faster than scalar
8d91856
2 years ago
by jvdd
0%
chore: formatting
3b9708c
2 years ago
by jvdd
0%
fix: use correct arg function
b66e8e5
2 years ago
by jvdd
-0.01%
:pen: review code
5b51457
2 years ago
by jvdd
© 2025 CodSpeed Technology
Home Terms Privacy Docs