Avatar for the vortex-data user
vortex-data
vortex
BlogDocsChangelog

feat(cuda): fuse narrower-than-output Dict codes and RunEnd ends

#7617Merged
Comparing
ad/cuda-fuse-ends-codes
(
feeb2b9
) with
develop
(
543dbe7
)
CodSpeed Performance Gauge
0%
Untouched
1106
Archived
57

Benchmarks

1163 total
bitwise_not_vortex_buffer_mut[128]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+10%
333.6 ns304.4 ns
bitwise_not_vortex_buffer_mut[1024]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+9%
366.1 ns336.9 ns
bitwise_not_vortex_buffer_mut[2048]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+7%
429.7 ns400.6 ns
decompress[u8, (1000, 4)]
encodings/runend/benches/run_end_compress.rs
CodSpeed Performance Gauge
+5%
23.2 µs22.1 µs
map_new_output[128]
vortex-buffer/benches/vortex_buffer.rs
CodSpeed Performance Gauge
+4%
789.4 ns760.3 ns
iter_arrow_buffer[128]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+3%
890.3 ns861.1 ns
ScalarBuffer<i32>>, 128]
vortex-buffer/benches/vortex_buffer.rs::map_each[Arrow<arrow_buffer::buffer::scalar
CodSpeed Performance Gauge
+3%
2 µs2 µs
set_indices_arrow_buffer[128]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+3%
1,010.3 ns981.1 ns
bitwise_not_vortex_buffer_mut[16384]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+2%
1.3 µs1.3 µs
take_map[(0.01, 0.1)]
vortex-array/benches/take_patches.rs
CodSpeed Performance Gauge
+2%
162.5 µs158.9 µs
append_arrow_buffer[128]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+2%
1.4 µs1.4 µs
untranspose_bmi2
encodings/fastlanes/benches/bit_transpose.rs::x86
CodSpeed Performance Gauge
+2%
1.8 µs1.8 µs
append_n_arrow_buffer[128]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+2%
1.8 µs1.8 µs
transpose_scalar
encodings/fastlanes/benches/bit_transpose.rs
CodSpeed Performance Gauge
+2%
1.8 µs1.8 µs
ScalarBuffer<i32>>, 1024]
vortex-buffer/benches/vortex_buffer.rs::map_each[Arrow<arrow_buffer::buffer::scalar
CodSpeed Performance Gauge
+2%
3.8 µs3.7 µs
take_indices[(10000, 1024)]
encodings/runend/benches/run_end_compress.rs
CodSpeed Performance Gauge
+2%
57.2 µs56.4 µs
decompress[u16, (1000, 16)]
encodings/runend/benches/run_end_compress.rs
CodSpeed Performance Gauge
+1%
18.2 µs18 µs
map_each[Buffer<i32>, 128]
vortex-buffer/benches/vortex_buffer.rs
CodSpeed Performance Gauge
+1%
2.1 µs2.1 µs
append_vortex_buffer[128]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+1%
2.1 µs2.1 µs
decompress[u16, (1000, 256)]
encodings/runend/benches/run_end_compress.rs
CodSpeed Performance Gauge
+1%
15.9 µs15.7 µs
append_n_arrow_buffer[1024]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+1%
2.6 µs2.6 µs
append_n_arrow_buffer[2048]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+1%
2.6 µs2.6 µs
take_map[(0.01, 0.5)]
vortex-array/benches/take_patches.rs
CodSpeed Performance Gauge
+1%
606.2 µs599.9 µs
ScalarBuffer<i32>>, 2048]
vortex-buffer/benches/vortex_buffer.rs::map_each[Arrow<arrow_buffer::buffer::scalar
CodSpeed Performance Gauge
+1%
5.8 µs5.7 µs
decompress[u32, (10000, 1024)]
encodings/runend/benches/run_end_compress.rs
CodSpeed Performance Gauge
+1%
33.2 µs32.9 µs

Commits

Click on a commit to change the comparison range
Base
develop
543dbe7
+0.01%
feat(cuda): fuse narrower-than-output Dict codes and RunEnd ends
feeb2b9
23 days ago
by 0ax1
© 2026 CodSpeed Technology
Home Terms Privacy Docs