BerriAI
litellm
Blog
Docs
Changelog
Blog
Docs
Changelog
Overview
Branches
Benchmarks
Runs
fix(streaming): word-sliced cache replay for streaming completions
#25670
Comparing
michelligabriele:fix/cached-response-streaming-cadence
(
199c7a3
) with
main
(
72a461b
)
CodSpeed Performance Gauge
0%
Untouched
16
Benchmarks
Mode
CPU Simulation
Wall Time
Memory
Status
Untouched
16 total
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_get_model_info_with_provider
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
+1%
81.1 µs
80 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_token_counter_simple_message
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
+1%
238.8 µs
236.1 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_get_model_info_anthropic
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
+1%
79.8 µs
79 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_token_counter_raw_text
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
+1%
184.7 µs
183.1 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_token_counter_with_tools
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
0%
415.5 µs
413.6 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_cost_per_token_openai
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
0%
542 µs
539.6 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_get_llm_provider_azure
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
0%
144.1 µs
143.6 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_token_counter_long_content
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
0%
1.7 ms
1.7 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_get_llm_provider_with_prefix
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
0%
137 µs
136.7 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_get_model_info_openai
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
0%
81 µs
80.9 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_cost_per_token_anthropic
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
0%
540.5 µs
540 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_get_model_cost_key_exact_match
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
0%
79.7 µs
79.6 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_get_model_cost_key_case_insensitive
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
0%
82.1 µs
82.1 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_get_llm_provider_openai
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
0%
137.7 µs
138 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_get_llm_provider_anthropic
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
0%
141.2 µs
141.5 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_token_counter_multi_turn
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
0%
565.7 µs
567.2 µs
Commits
Click on a commit to change the comparison range
Base
main
72a461b
+0.38%
fix(streaming): word-sliced cache replay for stream=true cache hits
199c7a3
16 days ago
by michelligabriele
© 2026 CodSpeed Technology
Home
Terms
Privacy
Docs