
Understanding that your code is slow is one thing. Understanding why it's slow is what lets you fix it. Walltime profiling now automatically collects hardware performance counters during execution, giving you deep insights into CPU cycles, instruction counts, memory operations, and cache behavior.
Performance counters showing cache behavior and memory traffic
Every walltime profile now includes comprehensive hardware metrics that help you pinpoint performance bottlenecks:

See exactly how your memory accesses are served with a detailed breakdown:
The visual memory access pattern gauge shows at a glance where your code spends its time:
Combined with the flame graph, you can now trace performance issues from high-level function calls down to specific memory access patterns causing slowdowns.
Performance counters are automatically collected when running benchmarks on CodSpeed Macro Runners with walltime profiling enabled.
Learn more about Walltime Profiling.