> ## Documentation Index > Fetch the complete documentation index at: https://codspeed.io/docs/llms.txt > Use this file to discover all available pages before exploring further. # Performance Profiling and Flame Graphs > Debug performance issues with detailed flame graphs and execution profiles generated by CodSpeed's instrumentation Cover Image

CodSpeed's profiling capabilities provide deep insights into your application's performance through detailed flame graphs and execution traces. Flame graphs are available for benchmarks using the [CPU Simulation instrument](/instruments/cpu) and the [Walltime instrument](/instruments/walltime). ## Reading Flame Graphs Flame graphs are a visualization tool for profiling software. They provide a graphical representation of your program's execution, making it easier to understand the runtime complexities involved. Let's start with an example: Flame graph Example

Here, each rectangle represents a function. The width of the rectangle is proportional to the amount of time spent in that function. The wider the rectangle, the more time was spent in that function. The vertical axis represents the call depth (a.k.a. stack depth), which represents the call hierarchy. For this example, here is the call hierarchy: 1. The root caller is the `app` function. 2. `app` calls the `init`, `handleRequest` and `terminate` functions. 3. `handleRequest` calls both `authenticateUser` and `processData`. 4. `processData` calls `foo` which in turn calls `bar`. ### Aggregated function calls Functions calls are aggregated, so if a function is called multiple times, the time spent in all calls is aggregated into a single block. Thus, the following code: ```python theme={null} def bar(): pass def foo(): bar() bar() def main(): foo() foo() ``` Will generate the following flame graph: Aggregated functions in a flame graph

Where the `foo` function is called twice and the `bar` function is called 4 times, but the time spent is aggregated into a single block for each function. ### Self-costs In the previous example, we could see the global cost of each function call quite clearly. However, it can be tricky to find out how much time was spent within the function itself. Self costs in flame graphs

In the above illustration, we can see two types of self-costs: * **(implicit) self-costs**: the time spent in the function itself is the whole width of the rectangle since it doesn't call any other functions. * **self-costs**: the self-cost here is visible as the space not occupied by the children of the block. **Self-costs in interpreted languages** In Python or Node.js, the self-cost is the time spent in the function itself, but also the time spent by the interpreter. This means that a function will always have a self-cost, even if the function does nothing. If we change a bit the example, adding a lot of computation directly in the main function: ```python {2-3} theme={null} def main(): for i in range(100): # do some computation foo() foo() ``` Then the flame graph would look like this, with the self-cost of `main` being much bigger than before: Bigger self costs in flame graphs

## Viewing Flame Graphs On the pull request page, you can access the flame graphs of a benchmark by expanding it. Example of flame graphs on a pull request page

Example of flame graphs on a pull request page

Three types of flame graphs are available: * **Base**: flame graph of the [benchmark base run](/features/understanding-the-metrics/#baseline-report-selection) * **Head**: flame graph of the benchmark run from the latest commit of the pull request * **Diff**: difference between the head and the base flame graphs **FFI Support** If you're using [Foreign Function Interface](https://en.wikipedia.org/wiki/Foreign_function_interface), typically calling C/C++/Rust code from Python or Node.js, make sure to generate debug symbols for the foreign functions so that you can see them in the flame graph. ## Inspector Hover any bar to open the span details. This panel shows you what the function is, where it comes from, and how its time is spent. Flamegraph inspector

* **Metadata**: Function name, source file, code origin * **Self time**: Time spent in the function body only, excluding child calls. * **Total time**: Time spent in the function including all its children. The details of metrics displayed depend on the instrument used to collect the profiling data, head over to the respective instrument documentation for more information: * [CPU Simulation metrics](/instruments/cpu#inspector-metrics) * [Walltime metrics](/instruments/walltime#inspector-metrics) ## Color modes ### By Origin Colors by code origin: User, Library, System, or Unknown. Useful to separate your code from dependencies and the kernel. Flamegraph color mode example: By Origin

Flamegraph color mode example: By Origin

### Differential Compares Base vs Head and colors spans by change: slower, faster, added, or removed. Ideal for scanning regressions and wins after a commit. Flamegraph color mode example: Diff

Colors reflect the status of the span compared to the base run: * **Slower**: The span is slower than in the base run. * **Faster**: The span is faster than in the base run. * **Added**: The span has been added in the head run. * **Removed**: The span is removed in the head run. ### By Bottleneck Colors each span by the dominant bound on its self time: instruction-bound, cache-bound, memory-bound, or system-bound. Fast way to see what is blocking work. Flamegraph color mode example: By Bottleneck

Flamegraph color mode example: By Bottleneck

### By Function Colors spans by function symbol so identical functions share a color, no matter where they are called. Helps spot hot functions across call sites. Flamegraph color mode example: By Function

Flamegraph color mode example: By Function

Function colors will be the same across benchmarks, projects, and runs. So you can easily recognize the same function across different runs. ### System Calls toggle Include kernel and low-level runtime contributions. Off keeps focus on application and library code. ## Function list Upon expanding a flamegraph, you can access the function list. And dive in the details of each span.

## Next Steps Learn how to enable CPU Simulation to generate flame graphs Learn how to enable Walltime instrumentation to generate flame graphs Set up automated checks to catch performance issues early Create comprehensive benchmarks for your codebase