Commits
Click on a commit to change the comparison rangePerform split on all files
Refactor into accumulator struct
Rename
Further simplification of accumulator logic
Cleanup into separate accumulator and accumulator context
Account for potentially null TableMetadata
Refactor into Iterator
Refactor into state machine
Convert Parquet file iterator to state machine as well
small cleanup
Reorganization into a separate module
Cleanup to extend this easier for using catalog information
Perform 16 Parquet metadata fetches in parallel
perf: reduce calls to ScanTask::estimate_in_memory_size
Adds unit test
Adds more unit tests
Add feature flag DAFT_ENABLE_AGGRESSIVE_SCANTASK_SPLITTING
Add a benchmarking script
Trigger data materialization in benchmark
Refactors to ParquetFileSplitter to not use state machine
Big refactor to split into multiple files and iterators
Add better docs
Refactor splitter code
nit naming
Refactor Fetchable
reordering for readability
Simplify State logic for FetchParquetMetadataByWindows
impl IntoIterator for SplittableScanTaskRef by propagating the config ref
docstrings
Removed advance_state for more explicit naming
Remove trait