BerriAI
litellm
Blog
Docs
Changelog
Blog
Docs
Changelog
Overview
Branches
Benchmarks
Runs
fix(bedrock): avoid double-counting cache tokens in Anthropic Messages streaming usage
#25517
Merged
Comparing
litellm_bedrock-messages-cache-prompt-double-count
(
f0d2d26
) with
main
(
d0e347a
)
CodSpeed Performance Gauge
0%
Untouched
16
Benchmarks
Mode
CPU Simulation
Wall Time
Memory
Status
Untouched
16 total
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_get_model_info_anthropic
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
+2%
85.8 µs
83.8 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_token_counter_long_content
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
+2%
1.7 ms
1.7 ms
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_get_model_info_openai
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
+1%
89 µs
88 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_token_counter_multi_turn
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
0%
572.7 µs
571.5 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_token_counter_simple_message
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
0%
240.3 µs
239.8 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_get_llm_provider_anthropic
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
0%
146.8 µs
146.8 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_token_counter_with_tools
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
0%
418 µs
418.7 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_token_counter_raw_text
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
0%
187.1 µs
187.6 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_get_llm_provider_with_prefix
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
0%
141.8 µs
142.2 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_get_llm_provider_azure
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
0%
148.8 µs
149.2 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_get_model_cost_key_case_insensitive
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
0%
87.3 µs
87.7 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_cost_per_token_openai
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
0%
543.2 µs
545.4 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_get_model_cost_key_exact_match
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
0%
85 µs
85.3 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_cost_per_token_anthropic
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
0%
541.8 µs
544.4 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_get_llm_provider_openai
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
-1%
142.6 µs
143.4 µs
Uses the
CPU Simulation instrument
to collect CPU performance metrics.
test_get_model_info_with_provider
tests/benchmarks/test_benchmarks.py
CodSpeed Performance Gauge
-1%
85.2 µs
85.8 µs
Commits
Click on a commit to change the comparison range
Base
main
d0e347a
+0.11%
fix(bedrock): avoid double-counting cache tokens in Anthropic Messages streaming usage
f0d2d26
7 days ago
by Sameerlite
© 2026 CodSpeed Technology
Home
Terms
Privacy
Docs