# Observability

URL: /docs/deployment/observability/
Section: deployment
Tags: observability, health-check, metrics, tracing, request-id

--------------------------------------------------------------------------------

Overview Pounce provides three observability primitives out of the box — health checks, request IDs, and Prometheus-compatible metrics — with zero external dependencies. Health Checks Enable a built-in health endpoint that responds before the ASGI app is invoked: pounce myapp:app --health-check-path /health Response { &quot;status&quot;: &quot;ok&quot;, &quot;uptime_seconds&quot;: 3600.1, &quot;worker_id&quot;: 0, &quot;active_connections&quot;: 42 } The response includes Cache-Control: no-cache, no-store to prevent caching by intermediaries. Characteristics Fast — Responds at the worker level, before ASGI dispatch Silent — Excluded from access logs to reduce noise Independent — Works even if your ASGI app is unhealthy Lightweight — JSON payload with status, uptime, worker ID, and connection count Kubernetes Integration apiVersion: v1 kind: Pod spec: containers: - name: app livenessProbe: httpGet: path: /health port: 8000 initialDelaySeconds: 5 periodSeconds: 10 readinessProbe: httpGet: path: /health port: 8000 initialDelaySeconds: 2 periodSeconds: 5 Load Balancer Integration Point your load balancer's health check at the configured path. Since health checks bypass the ASGI app, they reflect the server's true availability — not application-level readiness. Request IDs Every request is assigned a unique identifier for end-to-end tracing. How It Works If a trusted proxy sends X-Request-ID, Pounce uses that value Otherwise, Pounce generates a new UUID4 hex string (32 characters, no dashes) The ID is injected into: Response headers — X-Request-ID on every response ASGI scope — scope[&quot;extensions&quot;][&quot;request_id&quot;] Access logs — Both text and JSON formats Access Log Format Text mode: 127.0.0.1:5000 - &quot;GET / HTTP/1.1&quot; 200 1234 5.2ms [a1b2c3d4e5f6] The request ID is truncated to 12 characters in text mode for readability. JSON mode: { &quot;timestamp&quot;: &quot;2026-02-09T12:00:00+00:00&quot;, &quot;level&quot;: &quot;INFO&quot;, &quot;logger&quot;: &quot;pounce.access&quot;, &quot;method&quot;: &quot;GET&quot;, &quot;path&quot;: &quot;/&quot;, &quot;http_version&quot;: &quot;1.1&quot;, &quot;status&quot;: 200, &quot;bytes_sent&quot;: 1234, &quot;duration_ms&quot;: 5.2, &quot;client&quot;: &quot;127.0.0.1:5000&quot;, &quot;request_id&quot;: &quot;a1b2c3d4e5f67890abcdef1234567890&quot; } App-Level Access Your ASGI app can access the request ID from the scope: async def app(scope, receive, send): request_id = scope.get(&quot;extensions&quot;, {}).get(&quot;request_id&quot;) # Use in your own logging, pass to downstream services, etc. Nginx Forwarding To propagate request IDs from nginx, add X-Request-ID as a proxy header and configure trusted_hosts: proxy_set_header X-Request-ID $request_id; Prometheus Metrics Pounce includes a PrometheusCollector that implements the LifecycleCollector protocol. It tracks standard HTTP server metrics from lifecycle events with zero external dependencies. Setup from pounce import ServerConfig from pounce.metrics import PrometheusCollector from pounce.server import Server collector = PrometheusCollector() config = ServerConfig(host=&quot;0.0.0.0&quot;, workers=4) server = Server(config, app, lifecycle_collector=collector) Metrics Metric Type Description http_requests_total Counter Total requests by status code http_request_duration_seconds Histogram Request duration distribution http_connections_active Gauge Currently open TCP connections http_requests_in_flight Gauge Requests currently being processed http_bytes_sent_total Counter Total response bytes sent Export Format Call collector.export() to get Prometheus text exposition format: # HELP http_requests_total Total HTTP requests. # TYPE http_requests_total counter http_requests_total{method=&quot;unknown&quot;,status=&quot;200&quot;} 1523 http_requests_total{method=&quot;unknown&quot;,status=&quot;404&quot;} 12 # HELP http_request_duration_seconds Request duration in seconds. # TYPE http_request_duration_seconds histogram http_request_duration_seconds_bucket{le=&quot;0.005&quot;} 800 http_request_duration_seconds_bucket{le=&quot;0.01&quot;} 1200 http_request_duration_seconds_bucket{le=&quot;0.025&quot;} 1400 ... http_request_duration_seconds_bucket{le=&quot;+Inf&quot;} 1535 http_request_duration_seconds_sum 45.678 http_request_duration_seconds_count 1535 # HELP http_connections_active Active TCP connections. # TYPE http_connections_active gauge http_connections_active 42 # HELP http_requests_in_flight Requests currently being processed. # TYPE http_requests_in_flight gauge http_requests_in_flight 3 # HELP http_bytes_sent_total Total bytes sent in responses. # TYPE http_bytes_sent_total counter http_bytes_sent_total 15234567 Serving Metrics Expose a /metrics endpoint in your ASGI app: from pounce.metrics import PrometheusCollector collector = PrometheusCollector() async def app(scope, receive, send): if scope[&quot;path&quot;] == &quot;/metrics&quot;: body = collector.export().encode() await send({ &quot;type&quot;: &quot;http.response.start&quot;, &quot;status&quot;: 200, &quot;headers&quot;: [ (b&quot;content-type&quot;, b&quot;text/plain; version=0.0.4; charset=utf-8&quot;), (b&quot;content-length&quot;, str(len(body)).encode()), ], }) await send({&quot;type&quot;: &quot;http.response.body&quot;, &quot;body&quot;: body}) return # ... rest of your app Thread Safety PrometheusCollector uses threading.Lock internally — safe for concurrent access from multiple workers in free-threading mode. JSON Snapshot For programmatic access, use collector.snapshot(): data = collector.snapshot() # { # &quot;requests_total&quot;: {(&quot;&quot;, &quot;200&quot;): 1523, (&quot;&quot;, &quot;404&quot;): 12}, # &quot;duration_sum_seconds&quot;: 45.678, # &quot;duration_count&quot;: 1535, # &quot;connections_active&quot;: 42, # &quot;requests_in_flight&quot;: 3, # &quot;bytes_sent_total&quot;: 15234567, # } Lifecycle Events All observability features build on Pounce's structured lifecycle event system. Every connection and request emits immutable events: Event When ConnectionOpened TCP connection accepted RequestStarted HTTP request headers parsed ResponseCompleted Response fully sent RequestFailed Request handler raised an exception ClientDisconnected Client disconnected mid-request ConnectionClosed TCP connection closed These events flow to any LifecycleCollector — the PrometheusCollector is one implementation, but you can write your own for custom metrics, tracing, or event sourcing. See Also Production — Full production deployment guide Security — Security hardening features ServerConfig — All configuration options

--------------------------------------------------------------------------------

Metadata:
- Word Count: 706
- Reading Time: 4 minutes