Eventual-Inc
Daft
Blog
Docs
Changelog
Blog
Docs
Changelog
Overview
Branches
Benchmarks
Runs
Performance History
Latest Results
Merge remote-tracking branch 'origin/main' into perf/inline-agg-product # Conflicts: # src/daft-recordbatch/src/ops/inline_agg.rs
BABTUNA:perf/inline-agg-product
2 hours ago
Address review feedback for bround/greatest/least - numeric.py: use raw docstring for unhex() to satisfy ruff D301 (escape sequence \x0f in a regular docstring) - bround.rs: avoid multiplying by non-exact powers of 10 for negative precisions. precision >= 0 keeps multiply-then-divide; precision < 0 now divides-then-multiplies by 10^|p|, eliminating the second rounding step that biased ties (e.g. 250 with d=-2). Add regression test plus an Int32 decimals literal test for BRoundArgs<Option<i32>>. - greatest.rs: deduplicate Greatest/Least via a private GreatestLeastKind trait carrying KEEP_GREATER/NAME/DOCSTRING. Both ScalarUDF impls now share compare_inputs through impl_call::<Self>; public unit structs and typetag::serde tags are preserved for backwards compatibility.
XuQianJin-Stars:feat/spark-math-functions
4 hours ago
ci(pr-test-suite): skip corrupt .profraw files before llvm-cov report Coverage CI was failing in the post-test step with: warning: ./target/daft-coverage-*.profraw: invalid instrumentation profile data (file header is corrupt) error: no profile can be merged error: failed to merge profile data ... (exit status: 1) This happens when an instrumented worker is killed (e.g. SIGTERM/OOM) and leaves a partially-written .profraw behind. llvm-profdata merge then aborts the whole job, even though all pytest tests passed. Probe each daft-coverage-*.profraw with llvm-profdata show before running 'cargo llvm-cov report' and remove unreadable ones, so the report step can proceed with the remaining valid profiles.
XuQianJin-Stars:fix/hf-test-connect-timeout
4 hours ago
ci: retrigger flaky HF 429 on Rust huggingface test
BABTUNA:perf/packed-symbol-groupby
4 hours ago
Merge remote-tracking branch 'origin/main' into feat/temporal-unix-extractors
BABTUNA:feat/temporal-unix-extractors
5 hours ago
test(hf): retry transient network errors in HF integration tests The CI failure for test_read_huggingface_multi_split_dataset showed an httpx.ConnectTimeout (TLS handshake operation timed out) raised from the fallback path, which calls into the datasets library and hits the HF Hub directly. That path was not wrapped in call_with_hf_retry, so any transient network blip became a hard test failure. Changes: - Broaden _is_rate_limit_error to also recognise transient network errors (connect/read/handshake timeouts, connection resets, 5xx gateway responses) in addition to HTTP 429 rate-limit responses. The predicate now matches both exception messages and exception class names, so bare ConnectTimeout()/ReadTimeout() instances are caught. - Wrap the HF-Hub-touching calls in the fallback paths of test_read_huggingface_fallback_on_400_error and test_read_huggingface_multi_split_dataset (both daft.read_huggingface and load_dataset) with call_with_hf_retry so transient failures are retried and ultimately cause a pytest.skip rather than a hard failure, matching the behaviour already used for the main path.
XuQianJin-Stars:fix/hf-test-connect-timeout
5 hours ago
chore: regenerate Cargo.lock after rebase
XuQianJin-Stars:feature/hash-crypto-functions
5 hours ago
fix(goosefs): forward retry/timeout/concurrency to OpenDAL and sparse multiline display Address review feedback on GoosefsConfig: - P1: to_opendal_config now forwards max_retries, retry_timeout_ms, connect_timeout_ms, read_timeout_ms, max_concurrent_requests and max_connections_per_io_thread into the returned config map (only when non-default), so user-provided values are no longer silently dropped at the Daft layer. - P2: multiline_display now gates every numeric/boolean field with an if value != default guard, mirroring how auth_username is handled. A default-constructed GoosefsConfig produces an empty multiline view, and IOConfig::multiline_display omits the GooseFS config = { ... } line entirely in that case. Adds regression tests covering both behaviours.
XuQianJin-Stars:feat/goosefs-support
5 hours ago
Latest Branches
CodSpeed Performance Gauge
0%
perf(inline-agg): add Product accumulator type
#6975
3 hours ago
6eb2732
BABTUNA:perf/inline-agg-product
CodSpeed Performance Gauge
0%
feat: add Spark-compatible math functions (bround, greatest, least, hex, unhex)
#7122
4 hours ago
7d4ffa0
XuQianJin-Stars:feat/spark-math-functions
CodSpeed Performance Gauge
0%
test(hf): retry transient network errors in HF integration tests
#7120
5 hours ago
6166b2c
XuQianJin-Stars:fix/hf-test-connect-timeout
Β© 2026 CodSpeed Technology
Home
Terms
Privacy
Docs