feat(dd): add comprehensive benchmark suite for O_DIRECT optimization
- Create dd's first benchmark suite using divan framework
- Benchmark various block sizes (4K, 8K, 64K, 1M) to measure performance
- Test different dd scenarios: default, partial copy, skip, seek operations
- Measure impact of separate input/output block sizes
- All benchmarks use status=none to avoid output noise
- Benchmarks verify the O_DIRECT buffer alignment optimization
- Follows existing uutils benchmark patterns and conventions