# Profiling

URL: /docs/advanced/profiling/
Section: advanced
Tags: advanced, performance, profiling

--------------------------------------------------------------------------------

Profiling Kida includes compiler-emitted profiling instrumentation that tracks block render times, macro calls, include counts, and filter usage. Profiling is opt-in — it has zero overhead when disabled because the accumulator check (get_accumulator() returns None) short-circuits immediately. from kida.render_accumulator import profiled_render Quick Start from kida import Environment, FileSystemLoader from kida.render_accumulator import profiled_render env = Environment(loader=FileSystemLoader(&quot;templates/&quot;)) template = env.get_template(&quot;page.html&quot;) # Normal render — no overhead html = template.render(page=page, site=site) # Profiled render — opt-in metrics with profiled_render() as metrics: html = template.render(page=page, site=site) print(metrics.summary()) Output: { &quot;total_ms&quot;: 12.5, &quot;blocks&quot;: { &quot;content&quot;: {&quot;ms&quot;: 8.2, &quot;calls&quot;: 1}, &quot;nav&quot;: {&quot;ms&quot;: 2.1, &quot;calls&quot;: 1}, &quot;sidebar&quot;: {&quot;ms&quot;: 1.8, &quot;calls&quot;: 1}, }, &quot;macros&quot;: {&quot;render_card&quot;: 15, &quot;format_date&quot;: 8}, &quot;includes&quot;: {&quot;partials/sidebar.html&quot;: 1}, &quot;filters&quot;: {&quot;escape&quot;: 45, &quot;truncate&quot;: 12, &quot;date&quot;: 8}, } How It Works profiled_render() creates a RenderAccumulator and stores it in a ContextVar During rendering, compiler-emitted instrumentation checks get_accumulator() If an accumulator exists, metrics are recorded; otherwise the check is a no-op After the with block exits, the accumulator is removed from the ContextVar Because ContextVar provides thread-local isolation, profiling one render call does not affect concurrent renders. RenderAccumulator The accumulator collects four categories of metrics: Block Timings Every block render is timed. If a block renders multiple times (e.g., in a loop), durations are summed and calls counted: with profiled_render() as metrics: html = template.render(**ctx) for name, timing in metrics.block_timings.items(): print(f&quot;{name}: {timing.duration_ms:.2f}ms ({timing.call_count} calls)&quot;) Macro Calls Macro invocations are counted, including cross-template macro imports: metrics.macro_calls # {&quot;render_card&quot;: 15, &quot;format_date&quot;: 8} Include Counts Track how many times each template is included: metrics.include_counts # {&quot;partials/sidebar.html&quot;: 1, &quot;partials/card.html&quot;: 15} Filter Usage Every filter application is counted: metrics.filter_calls # {&quot;escape&quot;: 45, &quot;truncate&quot;: 12, &quot;upper&quot;: 3} Manual Block Timing Use timed_block() to time custom code sections. It is a no-op when profiling is disabled: from kida.render_accumulator import timed_block with timed_block(&quot;data_fetch&quot;): data = fetch_expensive_data() with timed_block(&quot;render&quot;): html = template.render(data=data) Recording Metrics Manually The accumulator exposes methods for recording custom metrics: from kida.render_accumulator import get_accumulator acc = get_accumulator() if acc is not None: acc.record_block(&quot;custom_section&quot;, duration_ms=5.2) acc.record_macro(&quot;my_macro&quot;) acc.record_include(&quot;partials/widget.html&quot;) acc.record_filter(&quot;my_filter&quot;) Integration Patterns Finding Slow Blocks with profiled_render() as metrics: html = template.render(**ctx) # Sort blocks by render time (summary already sorts descending) summary = metrics.summary() for name, data in summary[&quot;blocks&quot;].items(): if data[&quot;ms&quot;] &gt; 5.0: print(f&quot;SLOW: {name} took {data[&#x27;ms&#x27;]}ms ({data[&#x27;calls&#x27;]} calls)&quot;) Comparing Renders def benchmark_template(template, contexts, runs=10): &quot;&quot;&quot;Average metrics across multiple renders.&quot;&quot;&quot; totals = [] for ctx in contexts[:runs]: with profiled_render() as metrics: template.render(**ctx) totals.append(metrics.total_duration_ms) avg = sum(totals) / len(totals) print(f&quot;Average render: {avg:.2f}ms&quot;) Build System Integration from kida.render_accumulator import profiled_render slow_templates = [] for template, context in build_queue: with profiled_render() as metrics: html = template.render(**context) if metrics.total_duration_ms &gt; 50: slow_templates.append((template.name, metrics.total_duration_ms)) if slow_templates: print(&quot;Slow templates:&quot;) for name, ms in sorted(slow_templates, key=lambda x: x[1], reverse=True): print(f&quot; {name}: {ms:.1f}ms&quot;) API Reference Functions Function Signature Description profiled_render() () -&gt; Iterator[RenderAccumulator] Context manager for profiled rendering timed_block() (name: str) -&gt; Iterator[None] Time a code section (no-op when disabled) get_accumulator() () -&gt; RenderAccumulator | None Get current accumulator or None RenderAccumulator Property / Method Type Description block_timings dict[str, BlockTiming] Block name to timing data macro_calls dict[str, int] Macro name to call count include_counts dict[str, int] Template name to include count filter_calls dict[str, int] Filter name to call count total_duration_ms float Total render duration (property) record_block() (name, duration_ms) -&gt; None Record a block render record_macro() (name) -&gt; None Record a macro invocation record_include() (template_name) -&gt; None Record an include record_filter() (name) -&gt; None Record a filter usage summary() () -&gt; dict Get sorted summary of all metrics BlockTiming Field Type Description name str Block name duration_ms float Total render time in milliseconds call_count int Number of renders All classes are importable from kida.render_accumulator. See Also Performance — Benchmark methodology Static Analysis — Block-level analysis Block Caching — Cache informed by profiling

--------------------------------------------------------------------------------

Metadata:
- Word Count: 617
- Reading Time: 3 minutes