Avatar for the vortex-data user
vortex-data
vortex
BlogDocsChangelog

Add Mojo AOT-compiled SIMD take/filter kernels for primitive arrays

#7387
Comparing
claude/plan-mojo-simd-kernels-IDywB
(
5f2a781
) with
develop
(
8d9052e
)
CodSpeed Performance Gauge
+82%
Improvement
34
Untouched
1088
Skipped
1455

Benchmarks

2577 total
decompress[u16, (1000, 4)]
encodings/runend/benches/run_end_compress.rs
CodSpeed Performance Gauge
+18%
25.3 µs21.4 µs
decompress[u64, (10000, 16)]
encodings/runend/benches/run_end_compress.rs
CodSpeed Performance Gauge
+18%
75.4 µs64 µs
decompress[u32, (1000, 4)]
encodings/runend/benches/run_end_compress.rs
CodSpeed Performance Gauge
+16%
26.3 µs22.6 µs
varbinview_zip_fragmented_mask
vortex-array/benches/varbinview_zip.rs
CodSpeed Performance Gauge
+12%
7.1 ms6.4 ms
decompress[u8, (1000, 4)]
encodings/runend/benches/run_end_compress.rs
CodSpeed Performance Gauge
+11%
23.8 µs21.4 µs
bench_dict_mask[(0.01, 0.9)]
vortex-array/benches/dict_mask.rs
CodSpeed Performance Gauge
+10%
1.7 ms1.5 ms
bench_dict_mask[(0.5, 0.9)]
vortex-array/benches/dict_mask.rs
CodSpeed Performance Gauge
+10%
1.7 ms1.5 ms
bench_dict_mask[(0.1, 0.9)]
vortex-array/benches/dict_mask.rs
CodSpeed Performance Gauge
+10%
1.7 ms1.5 ms
bench_dict_mask[(0.9, 0.9)]
vortex-array/benches/dict_mask.rs
CodSpeed Performance Gauge
+10%
1.7 ms1.6 ms
bench_dict_mask[(0.01, 0.5)]
vortex-array/benches/dict_mask.rs
CodSpeed Performance Gauge
+10%
1.8 ms1.6 ms
bench_dict_mask[(0.9, 0.5)]
vortex-array/benches/dict_mask.rs
CodSpeed Performance Gauge
+10%
1.8 ms1.6 ms
bench_dict_mask[(0.1, 0.5)]
vortex-array/benches/dict_mask.rs
CodSpeed Performance Gauge
+10%
1.8 ms1.6 ms
bench_dict_mask[(0.5, 0.5)]
vortex-array/benches/dict_mask.rs
CodSpeed Performance Gauge
+10%
1.8 ms1.6 ms
bitwise_not_vortex_buffer_mut[128]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+10%
333.6 ns304.4 ns
decompress[u8, (100000, 256)]
encodings/runend/benches/run_end_compress.rs
CodSpeed Performance Gauge
+9%
75.4 µs68.9 µs
bench_dict_mask[(0.01, 0.1)]
vortex-array/benches/dict_mask.rs
CodSpeed Performance Gauge
+9%
1.9 ms1.7 ms
bench_dict_mask[(0.9, 0.1)]
vortex-array/benches/dict_mask.rs
CodSpeed Performance Gauge
+9%
1.9 ms1.7 ms
bench_dict_mask[(0.1, 0.1)]
vortex-array/benches/dict_mask.rs
CodSpeed Performance Gauge
+9%
1.9 ms1.7 ms
bench_dict_mask[(0.5, 0.1)]
vortex-array/benches/dict_mask.rs
CodSpeed Performance Gauge
+9%
1.9 ms1.7 ms
bench_dict_mask[(0.9, 0.01)]
vortex-array/benches/dict_mask.rs
CodSpeed Performance Gauge
+9%
1.9 ms1.8 ms
bench_dict_mask[(0.1, 0.01)]
vortex-array/benches/dict_mask.rs
CodSpeed Performance Gauge
+9%
1.9 ms1.8 ms
bench_dict_mask[(0.5, 0.01)]
vortex-array/benches/dict_mask.rs
CodSpeed Performance Gauge
+9%
1.9 ms1.8 ms
bench_dict_mask[(0.01, 0.01)]
vortex-array/benches/dict_mask.rs
CodSpeed Performance Gauge
+9%
1.9 ms1.8 ms
bitwise_not_vortex_buffer_mut[1024]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+8%
395.3 ns366.1 ns
decompress[u32, (1000, 16)]
encodings/runend/benches/run_end_compress.rs
CodSpeed Performance Gauge
+7%
20.9 µs19.5 µs

Commits

Click on a commit to change the comparison range
Base
develop
8d9052e
0%
Install Mojo SDK in codspeed benchmark CI for vortex-array
6bfda92
5 days ago
by claude
-0.02%
Fix SIGILL in CI: pin Mojo target to x86-64-v3 (AVX2)
59bb1ea
5 days ago
by claude
-14.68%
Fix nightly rustfmt: split grouped imports, reorder super:: imports
64fdd36
5 days ago
by claude
+14.63%
Deprioritize Mojo below AVX2 on x86_64 in take dispatch
457d81a
5 days ago
by claude
+0.01%
Fix Mojo build in Cargo: pass --target-triple from TARGET env var
2afb139
5 days ago
by claude
0%
Optimize Mojo gather: 4x unroll + skylake target for vpgatherqd
f14c2f8
5 days ago
by claude
-0.01%
Add --mtune to Mojo build for better instruction scheduling
d60f190
5 days ago
by claude
+48.81%
Promote Mojo to top-priority take kernel when available
0d0fd77
5 days ago
by claude
+2.47%
Merge develop, resolve conflicts in Cargo.toml and take/mod.rs
900985a
5 days ago
by claude
-0.05%
Add scalar baseline to runend decode benchmark for codspeed comparison
f2f14b4
5 days ago
by claude
+31.42%
Support u64 ends in Mojo runend decode to hit existing benchmarks
d51800a
5 days ago
by claude
-1.27%
Clean up PR: split kernels per crate, remove unnecessary benchmarks
7ec0e46
5 days ago
by claude
+0.2%
Fix lint: use #[allow(unused)] not #[expect(unused)] for TakeKernelScalar
4d7e86d
5 days ago
by claude
0%
Fix Mojo build on macOS: skip --mcpu=native and --target-triple on Apple targets
5f2a781
5 days ago
by joseph-isaacs
© 2026 CodSpeed Technology
Home Terms Privacy Docs