Eventual-Inc
Daft
Blog
Docs
Changelog
Blog
Docs
Changelog
Overview
Branches
Benchmarks
Runs
Performance History
Latest Results
Merge remote-tracking branch 'upstream/main' into codex-sql-read-parquet-ignore-corrupt-files
jackylee-ch:codex-sql-read-parquet-ignore-corrupt-files
53 minutes ago
fix(distributed): use build_in_memory_task_builder in coalesce_tasks to avoid Flight panic
Liusixuuu:fix/into-partitions-flight-shuffle-panic
1 hour ago
feat(droid): Add `daft.datasets.droid` (#7089) ## Changes Made Add a API for interacting with the DROID dataset via the `daft.datasets.droid` module. The current limitations include: * We don't have any APIs to read the numerical data from the `trajectory.h5` files from each episode, such as sensor, observation, and state data. * We can't read the curated version thats stored in the RLDS format (https://droid-dataset.github.io/droid/the-droid-dataset.html#-using-the-dataset). We should add a custom DataSource for that * Some other smaller TODOs sprinkled throughout the file --------- Co-authored-by: Everett Kleven <145285237+everettVT@users.noreply.github.com>
main
2 hours ago
Merge branch 'main' into slade/droid
slade/droid
2 hours ago
Merge remote-tracking branch 'upstream/main' into codex-sql-read-parquet-ignore-corrupt-files
jackylee-ch:codex-sql-read-parquet-ignore-corrupt-files
3 hours ago
Merge remote-tracking branch 'upstream/main' into codex-sql-read-parquet-ignore-corrupt-files
jackylee-ch:codex-sql-read-parquet-ignore-corrupt-files
4 hours ago
feat(datasets): add generic HDF5 reader to daft.datasets Add daft.datasets.hdf5 with two readers: - read(): reads HDF5 files as a lazy DataFrame with one row per leading-dimension entry (e.g. one row per frame for episode-style data like EgoDex). Infers a per-file row axis, broadcasts per-file constants and root attributes onto every row, and builds the output column directly in pyarrow for a zero-copy cast to fixed-shape tensors. - read_datasets(): exploration view listing every dataset (name, shape, dtype) per file, for deciding what to pass to read(). Export the module via daft.datasets so it is importable as `from daft.datasets import hdf5`.
add-hdf5-dataset-reader
5 hours ago
Merge remote-tracking branch 'upstream/main' into codex-sql-read-parquet-ignore-corrupt-files
jackylee-ch:codex-sql-read-parquet-ignore-corrupt-files
6 hours ago
Latest Branches
CodSpeed Performance Gauge
0%
feat(sql): support read_parquet ignore_corrupt_files
#7133
18 hours ago
559baf7
jackylee-ch:codex-sql-read-parquet-ignore-corrupt-files
CodSpeed Performance Gauge
0%
fix(distributed): use build_in_memory_task_builder in coalesce_tasks to avoid Flight panic
#7154
2 hours ago
8f59804
Liusixuuu:fix/into-partitions-flight-shuffle-panic
CodSpeed Performance Gauge
0%
feat(droid): Add `daft.datasets.droid`
#7089
3 hours ago
a6ac034
slade/droid
© 2026 CodSpeed Technology
Home
Terms
Privacy
Docs