vortex-data
vortex
Blog
Docs
Changelog
Blog
Docs
Changelog
Overview
Branches
Benchmarks
Runs
bench: CUDA host-to-device copy modes
#7815
Merged
Comparing
ad/cuda-copy-benchmarks
(
fc4cda4
) with
develop
(
0b790d2
)
CodSpeed Performance Gauge
0%
Untouched
1206
Unknown Walltime execution environment detected
Using the Walltime instrument on standard Hosted Runners will lead to inconsistent data. For the most accurate results, we recommend using
CodSpeed Macro Runners
: bare-metal machines fine-tuned for performance measurement consistency.
Benchmarks
Mode
CPU Simulation
Wall Time
Memory
Status
Untouched
1206 total
Uses the
Wall Time instrument
to collect wall time performance metrics.
cuda/dict_256vals_bp8bw_codes/dispatch_u32[100M]
vortex-cuda/benches/dynamic_dispatch_cuda.rs::benches::benchmark_dynamic_dispatch::cuda
CodSpeed Performance Gauge
+5%
1.1 ms
1.1 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
true_count_vortex_buffer[128]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+5%
648.9 ns
619.7 ns
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
true_count_arrow_buffer[1024]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+3%
965.6 ns
936.4 ns
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
iter_vortex_buffer[128]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+3%
981.1 ns
951.9 ns
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
true_count_arrow_buffer[2048]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+3%
1 µs
1 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
set_indices_vortex_buffer[128]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+3%
1 µs
1 µs
Uses the
Wall Time instrument
to collect wall time performance metrics.
cuda/alp_for_bp_6bw_f32/dispatch_f32[100M]
vortex-cuda/benches/dynamic_dispatch_cuda.rs::benches::benchmark_dynamic_dispatch::cuda
CodSpeed Performance Gauge
+2%
1.1 ms
1.1 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
decompress[u64, (1000, 16)]
encodings/runend/benches/run_end_compress.rs
CodSpeed Performance Gauge
+2%
22 µs
21.6 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
decompress[u8, (1000, 16)]
encodings/runend/benches/run_end_compress.rs
CodSpeed Performance Gauge
+1%
17.5 µs
17.2 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
true_count_arrow_buffer[16384]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+1%
2 µs
2 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
decompress[u32, (10000, 1024)]
encodings/runend/benches/run_end_compress.rs
CodSpeed Performance Gauge
+1%
34.2 µs
33.8 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
set_indices_vortex_buffer[1024]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+1%
2.3 µs
2.2 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
bitwise_and_vortex_buffer[128]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+1%
4.6 µs
4.6 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
bitwise_or_vortex_buffer[128]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+1%
4.7 µs
4.6 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
take_indices[(1000, 256)]
encodings/runend/benches/run_end_compress.rs
CodSpeed Performance Gauge
+1%
34.6 µs
34.2 µs
Uses the
Wall Time instrument
to collect wall time performance metrics.
cuda/bitpacked_patched_u16/unpack/10%[100M]
vortex-cuda/benches/bitpacked_cuda.rs::benches::benchmark_bitunpack_with_patches::cuda
CodSpeed Performance Gauge
+1%
721.8 µs
714 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
decompress[u64, (1000, 4)]
encodings/runend/benches/run_end_compress.rs
CodSpeed Performance Gauge
+1%
28.2 µs
27.9 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
decompress[u32, (10000, 256)]
encodings/runend/benches/run_end_compress.rs
CodSpeed Performance Gauge
+1%
36.6 µs
36.3 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
encode_varbinview[(10000, 32)]
vortex-array/benches/dict_compress.rs
CodSpeed Performance Gauge
+1%
1 ms
1 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
decompress_alp[f32, (1000, 0.01, 1.0)]
encodings/alp/benches/alp_compress.rs
CodSpeed Performance Gauge
+1%
12.7 µs
12.6 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
take_indices[(1000, 16)]
encodings/runend/benches/run_end_compress.rs
CodSpeed Performance Gauge
+1%
38.3 µs
38 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
decompress_alp[f32, (1000, 0.1, 1.0)]
encodings/alp/benches/alp_compress.rs
CodSpeed Performance Gauge
+1%
13.7 µs
13.6 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
decompress_fsst[(1000, 16, 4)]
encodings/fsst/benches/fsst_compress.rs
CodSpeed Performance Gauge
+1%
82.6 µs
82 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
set_indices_vortex_buffer[2048]
vortex-buffer/benches/vortex_bitbuffer.rs
CodSpeed Performance Gauge
+1%
3.6 µs
3.6 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
decompress[u16, (1000, 256)]
encodings/runend/benches/run_end_compress.rs
CodSpeed Performance Gauge
+1%
17 µs
16.9 µs
1
2
3
4
5
...
49
Commits
Click on a commit to change the comparison range
Base
develop
0b790d2
-0.02%
bench: CUDA host-to-device copy modes
fc4cda4
2 days ago
by 0ax1
© 2026 CodSpeed Technology
Home
Terms
Privacy
Docs