vortex-data
vortex
Blog
Docs
Changelog
Blog
Docs
Changelog
Overview
Branches
Benchmarks
Runs
Performance History
Latest Results
feat[vortex-cuda]: GPU FSST decompression kernel This commit implements on-GPU decompression of the existing FSST encoding. This kernel achieves ~42% max throughput utilization as compared to the `throughput_cuda` benchmark on a DGX spark. CPU work is required to compute the output offsets. The core performance win is buffering up to 24 bytes of decompressed data in three u64 registers and emitting the widest aligned stores possible up to u128 (st.global.v2.u64). The 256-entry symbol table (⤠2 KB) is read directly from global memory. Staging it into shared memory measured ~3% slower at 10M rows and ~15% slower at 1M rows. The hypothesis is that L1 already holds the table after a few iterations and the explicit shared copy adds bank-conflict latency on the warp-divergent symbols[code] reads; the gap is wider at 1M because the kernel is less bandwidth-bound there. Further optimizations would require an encoding change. Splits-style intra-string parallelism (one GPU thread per ~32-compressed-byte chunk instead of per-string) was prototyped on top of this kernel and measured an additional +30% kernel throughput at 1M clickbench URLs, +26% at 5M, +12% at 10M. Four kernel variants are generated for the unsigned widths of codes_offsets (u8/u16/u32/u64); signed integer ptypes are reinterpreted as their unsigned equivalent on the Rust side, so the bit pattern is preserved without copying. Signed-off-by: Alfonso Subiotto Marques <alfonso.subiotto@polarsignals.com>
asubiotto/fsst-cuda
13 minutes ago
skip[fuzz]: less jobs to run for fuzzer (#7788) stop oom Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>
develop
41 minutes ago
Merge branch 'develop' into ji/less-jobs-fuzz
ji/less-jobs-fuzz
1 hour ago
Fix weird signature of with_slots functions (#7758) with_slots doesn't need to pass the original ref to the arrayinner methods as unique ownership arc is handled earlier Signed-off-by: Robert Kruszewski <github@robertk.io>
develop
2 hours ago
Remove bad action (#7786) ## Summary Accidentally merged this... ## Testing N/A Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>
develop
11 hours ago
Update dependency globals to v17.6.0 (#7768) This PR contains the following updates: | Package | Change | [Age](https://docs.renovatebot.com/merge-confidence/) | [Confidence](https://docs.renovatebot.com/merge-confidence/) | |---|---|---|---| | [globals](https://redirect.github.com/sindresorhus/globals) | [`17.5.0` ā `17.6.0`](https://renovatebot.com/diffs/npm/globals/17.5.0/17.6.0) |  |  | --- > [!WARNING] > Some dependencies could not be looked up. Check the [Dependency Dashboard](../issues/357) for more information. --- ### Release Notes <details> <summary>sindresorhus/globals (globals)</summary> ### [`v17.6.0`](https://redirect.github.com/sindresorhus/globals/compare/v17.5.0...6b15870f1c08b60b5b57afe45a703d9ed0be39bc) [Compare Source](https://redirect.github.com/sindresorhus/globals/compare/v17.5.0...v17.6.0) </details> --- ### Configuration š **Schedule**: (UTC) - Branch creation - Between 12:00 AM and 03:59 AM, only on Monday (`* 0-3 * * 1`) - Automerge - At any time (no schedule defined) š¦ **Automerge**: Enabled. ā» **Rebasing**: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox. š **Ignore**: Close this PR and you won't be reminded about this update again. --- - [ ] <!-- rebase-check -->If you want to rebase/retry this PR, check this box --- This PR was generated by [Mend Renovate](https://mend.io/renovate/). View the [repository job log](https://developer.mend.io/github/vortex-data/vortex). <!--renovate-debug:eyJjcmVhdGVkSW5WZXIiOiI0My4xNTkuMiIsInVwZGF0ZWRJblZlciI6IjQzLjE1OS4yIiwidGFyZ2V0QnJhbmNoIjoiZGV2ZWxvcCIsImxhYmVscyI6WyJjaGFuZ2Vsb2cvY2hvcmUiXX0=--> Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>
develop
11 hours ago
Update release-drafter/release-drafter action to v7.2.1
renovate/all-patch-updates
12 hours ago
Update anthropics/claude-code-action digest to 2cc1ac1
renovate/anthropics-claude-code-action-digest
12 hours ago
Latest Branches
CodSpeed Performance Gauge
+10%
feat[vortex-cuda]: GPU FSST decompression kernel
#7776
5 days ago
46670e8
asubiotto/fsst-cuda
CodSpeed Performance Gauge
-11%
skip[fuzz]: less jobs to run for fuzzer
#7788
1 hour ago
a6babab
ji/less-jobs-fuzz
CodSpeed Performance Gauge
-19%
Update release-drafter/release-drafter action to v7.2.1
#7763
12 hours ago
b27561b
renovate/all-patch-updates
Ā© 2026 CodSpeed Technology
Home
Terms
Privacy
Docs