Avatar for the vortex-data user
vortex-data
vortex
BlogDocsChangelog

chore: speed up CUDA kernel compilation

#7509Merged
Comparing
adroste/split-cuda-kernel-build
(
9f27745
) with
develop
(
6ac69e7
)
CodSpeed Performance Gauge
0%
Untouched
1163
Skipped
1457

Benchmarks

2620 total
bitwise_not_vortex_buffer_mut[128]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+10%
333.6 ns304.4 ns
bitwise_not_vortex_buffer_mut[1024]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+8%
395.3 ns366.1 ns
bitwise_not_vortex_buffer_mut[2048]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+7%
458.9 ns429.7 ns
iter_arrow_buffer[128]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+7%
919.4 ns861.1 ns
binary_search_std
vortex-array/benches/search_sorted.rs
CodSpeed Performance Gauge
+6%
515.3 ns486.1 ns
binary_search_vortex
vortex-array/benches/search_sorted.rs
CodSpeed Performance Gauge
+4%
727.2 ns698.1 ns
value_arrow_buffer[128]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+3%
1,011.1 ns981.9 ns
take_10_contiguous
encodings/fastlanes/benches/bitpacking_take.rs
CodSpeed Performance Gauge
+3%
26.1 µs25.4 µs
set_indices_arrow_buffer[128]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+3%
1.1 µs1.1 µs
patched_take_10_contiguous
encodings/fastlanes/benches/bitpacking_take.rs
CodSpeed Performance Gauge
+2%
31.5 µs30.8 µs
push_vortex_buffer[128]
vortex-buffer/benches/vortex_buffer.rs
CodSpeed Performance Gauge
+2%
1.2 µs1.2 µs
set_indices_arrow_buffer[1024]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+2%
2.5 µs2.4 µs
bitwise_not_vortex_buffer_mut[16384]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+2%
1.3 µs1.3 µs
chunked_bool_canonical_into[(1000, 10)]
vortex-array/benches/chunk_array_builder.rs
CodSpeed Performance Gauge
+2%
60.5 µs59.5 µs
push_n_vortex_buffer[u32, 128]
vortex-buffer/benches/vortex_buffer.rs
CodSpeed Performance Gauge
+2%
1.8 µs1.7 µs
transpose_bmi2
encodings/fastlanes/benches/bit_transpose.rs::x86
CodSpeed Performance Gauge
+2%
1.8 µs1.8 µs
untranspose_bmi2
encodings/fastlanes/benches/bit_transpose.rs::x86
CodSpeed Performance Gauge
+2%
1.8 µs1.8 µs
push_n_vortex_buffer[u8, 128]
vortex-buffer/benches/vortex_buffer.rs
CodSpeed Performance Gauge
+2%
1.9 µs1.9 µs
set_indices_arrow_buffer[2048]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+1%
4 µs4 µs
append_vortex_buffer[128]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+1%
2.1 µs2.1 µs
iter_arrow_buffer[1024]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+1%
4.4 µs4.4 µs
push_n_vortex_buffer[u8, 1024]
vortex-buffer/benches/vortex_buffer.rs
CodSpeed Performance Gauge
+1%
2.3 µs2.2 µs
null_count_run_end[(10000, 256, 0.01)]
encodings/runend/benches/run_end_null_count.rs
CodSpeed Performance Gauge
+1%
4.5 µs4.5 µs
null_count_run_end[(10000, 1024, 0.01)]
encodings/runend/benches/run_end_null_count.rs
CodSpeed Performance Gauge
+1%
4.5 µs4.5 µs
null_count_run_end[(100000, 1024, 0.01)]
encodings/runend/benches/run_end_null_count.rs
CodSpeed Performance Gauge
+1%
4.6 µs4.5 µs

Commits

Click on a commit to change the comparison range
Base
develop
6ac69e7
+0.01%
Avoid Rust recompilation on CUDA kernel changes
9f27745
20 hours ago
by 0ax1
© 2026 CodSpeed Technology
Home Terms Privacy Docs