Latest Results
bench: gate CodSpeed-unstable canonicalization benches from simulation
A small set of microbenchmarks report false-positive regressions in nearly
every PR. Their CodSpeed CPU-simulation instruction count is dominated by
output-buffer allocation and glibc `memcpy`/`memmove` (whose `ifunc`-selected
implementation varies across runner images) rather than by Vortex compute, so
they move bidirectionally by 10-90% for unchanged code and CodSpeed flags
"different runtime environments" on the comparisons. They cannot be stabilized
under simulation, so per `docs/developer-guide/benchmarking.md` they are gated
with `#[cfg(not(codspeed))]` and remain available via local `cargo bench`.
Gated from CodSpeed (kept for local runs):
- alp_compress.rs: `decompress_rd` (decode-to-canonical; moved in 7/9 sampled
PRs, 842-1025 us for identical code). `compress_rd` (encode, compute-bound,
never flaky) is kept.
- chunk_array_builder.rs: `chunked_varbinview_*` (string canonicalization,
memcpy-bound; flaky in 6/9 PRs) and `chunked_bool_canonical_into` (also
below the ~16-35 us noise floor, ~2x swings). The compute-bound
`chunked_opt_bool_*` and `chunked_constant_*` benches are kept.
Verified: both suites build and run under `cargo codspeed` (Simulation mode),
the gated benches are excluded while the kept benches still execute, and the
local `cargo bench` path, `cargo fmt`, and `cargo clippy` are clean.
Signed-off-by: Claude <noreply@anthropic.com>
Claude-Session: https://claude.ai/code/session_01GXdjWYp7AbSKwn2bw6GYsfclaude/sharp-planck-i4ifv8 Latest Branches
+2%
claude/sharp-planck-i4ifv8 -7%
+1%
Ā© 2026 CodSpeed Technology