Commits
Click on a commit to change the comparison rangefeat: parallelize sync generate method for improved LLM throughput
- Replace sequential loop with thread-pool executor mapping for multi-input processing
- Preserve ordering, callback behavior, and error propagation
- Add fast path for single input to avoid unnecessary overhead
- Use get_executor_for_config context manager for proper resource management
This optimization improves throughput when processing multiple prompts
without breaking existing functionality or changing the API.