Using the Walltime instrument on standard Hosted Runners will lead to inconsistent data. For the most accurate results, we recommend using CodSpeed Macro Runners: bare-metal machines fine-tuned for performance measurement consistency.
Compare
Base
Search a run
Head
feat(standard-tests): add streaming tool call tests for chat models