Avatar for the vortex-data user
vortex-data
vortex
BlogDocsChangelog

Reorder agg kernel dispatch, and have Combined use inner accumulators

#7889Merged
Comparing
ngates/aggregate-kernels
(
026d829
) with
develop
(
3e93048
)
CodSpeed Performance Gauge
0%
Untouched
1186
Archived
24

Benchmarks

1210 total
true_count_vortex_buffer[128]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+4%
678.1 ns648.9 ns
take_10_contiguous
encodings/fastlanes/benches/bitpacking_take.rs
CodSpeed Performance Gauge
+4%
28 µs26.9 µs
patched_take_10_contiguous
encodings/fastlanes/benches/bitpacking_take.rs
CodSpeed Performance Gauge
+3%
33 µs31.9 µs
true_count_arrow_buffer[1024]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+3%
936.4 ns907.2 ns
iter_vortex_buffer[128]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+3%
981.1 ns951.9 ns
true_count_arrow_buffer[2048]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+3%
1,006.7 ns977.5 ns
set_indices_vortex_buffer[128]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+3%
1.1 µs1 µs
null_count_run_end[(10000, 1024, 0.01)]
encodings/runend/benches/run_end_null_count.rs
CodSpeed Performance Gauge
+3%
3.3 µs3.2 µs
null_count_run_end[(10000, 256, 0.01)]
encodings/runend/benches/run_end_null_count.rs
CodSpeed Performance Gauge
+3%
3.3 µs3.2 µs
null_count_run_end[(100000, 1024, 0.01)]
encodings/runend/benches/run_end_null_count.rs
CodSpeed Performance Gauge
+3%
3.3 µs3.2 µs
cuda/runend/i32_runlen_10[100M]
vortex-cuda/benches/runend_cuda.rs::benches::benchmark_runend::cuda
CodSpeed Performance Gauge
+2%
1.4 ms1.4 ms
dict_canonicalize_zipfian[16, 1000]
vortex-array/benches/take_primitive.rs
CodSpeed Performance Gauge
+2%
52 µs51 µs
decode_primitives[f32, (1000, 4)]
vortex-array/benches/dict_compress.rs
CodSpeed Performance Gauge
+2%
18.5 µs18.2 µs
decode_primitives[f32, (1000, 2)]
vortex-array/benches/dict_compress.rs
CodSpeed Performance Gauge
+2%
18.5 µs18.2 µs
decode_primitives[f32, (1000, 8)]
vortex-array/benches/dict_compress.rs
CodSpeed Performance Gauge
+2%
18.5 µs18.2 µs
decode_primitives[f32, (1000, 32)]
vortex-array/benches/dict_compress.rs
CodSpeed Performance Gauge
+2%
18.6 µs18.3 µs
untranspose_bmi2
encodings/fastlanes/benches/bit_transpose.rs::x86
CodSpeed Performance Gauge
+2%
1.8 µs1.8 µs
transpose_scalar
encodings/fastlanes/benches/bit_transpose.rs
CodSpeed Performance Gauge
+2%
1.8 µs1.8 µs
decode_primitives[u8, (1000, 32)]
vortex-array/benches/dict_compress.rs
CodSpeed Performance Gauge
+2%
17.4 µs17.1 µs
decode_primitives[u8, (1000, 8)]
vortex-array/benches/dict_compress.rs
CodSpeed Performance Gauge
+2%
17.4 µs17.1 µs
decode_primitives[u8, (1000, 4)]
vortex-array/benches/dict_compress.rs
CodSpeed Performance Gauge
+2%
17.4 µs17.1 µs
decode_primitives[u8, (1000, 512)]
vortex-array/benches/dict_compress.rs
CodSpeed Performance Gauge
+2%
17.4 µs17.1 µs
patched_take_10_stratified
encodings/fastlanes/benches/bitpacking_take.rs
CodSpeed Performance Gauge
+2%
32.1 µs31.6 µs
decode_primitives[f32, (1000, 512)]
vortex-array/benches/dict_compress.rs
CodSpeed Performance Gauge
+2%
19.7 µs19.4 µs
decode_varbin[(1000, 4)]
vortex-array/benches/dict_compress.rs
CodSpeed Performance Gauge
+2%
28.5 µs28.1 µs

Commits

Click on a commit to change the comparison range
Base
develop
3e93048
+31.26%
Reorder kernel dispatch, and have Combined use inner accumulators
7a65ca2
27 days ago
by gatesn
0%
Reorder kernel dispatch, and have Combined use inner accumulators
842b03a
27 days ago
by gatesn
-31.33%
Reorder kernel dispatch, and have Combined use inner accumulators
026d829
27 days ago
by gatesn
© 2026 CodSpeed Technology
Home Terms Privacy Docs