# Testing

URL: /milo-cli/docs/quality/testing/
Section: quality
Tags: testing, snapshots, recording, replay

--------------------------------------------------------------------------------

Milo ships with testing utilities purpose-built for the Elm Architecture. Test reducers with action sequences, test views with snapshots, test sagas step-by-step, and record/replay entire sessions. CLI command testing Agent-facing CLIs should test the command contract before interactive behavior. Use four small layers: Schema — function_to_schema(command) matches the function signature. Direct dispatch — cli.invoke([...]) parses argv and returns the expected InvokeResult. MCP dispatch — _call_tool(cli, {...}) returns the same content and structured errorData on malformed input. Verify — milo.verify.verify(&quot;app.py&quot;) passes the same import, discovery, schema, tools/list, and subprocess handshake checks as milo verify. from pathlib import Path from milo.mcp import _call_tool from milo.schema import function_to_schema from milo.verify import verify from app import cli, greet def test_schema_matches_signature(): schema = function_to_schema(greet) assert schema[&quot;required&quot;] == [&quot;name&quot;] assert schema[&quot;properties&quot;][&quot;name&quot;][&quot;type&quot;] == &quot;string&quot; def test_direct_dispatch(): result = cli.invoke([&quot;greet&quot;, &quot;--name&quot;, &quot;Alice&quot;]) assert result.exit_code == 0 assert &quot;Hello, Alice!&quot; in result.output def test_mcp_dispatch(): result = _call_tool(cli, {&quot;name&quot;: &quot;greet&quot;, &quot;arguments&quot;: {&quot;name&quot;: &quot;Agent&quot;}}) assert result[&quot;content&quot;][0][&quot;text&quot;] == &quot;Hello, Agent!&quot; assert &quot;isError&quot; not in result def test_milo_verify_passes(): report = verify(str(Path(__file__).resolve().parents[1] / &quot;app.py&quot;)) assert report.exit_code == 0, report.format() Scaffolded projects from milo new include these layers in tests/test_app.py. Use the reducer, render, saga, and replay helpers below for interactive app behavior. Docs and example drift Milo's own docs use tagged Markdown fences for snippets that should keep working. Only fences marked with milo-docs:* are checked, so long-running MCP servers and external registration commands can stay documented without running in CI. make docs-test That target compiles built-in and example Kida templates, then runs scripts/check_docs_snippets.py against the repo docs. Use: milo-docs:run for deterministic shell snippets. milo-docs:compile for Python or Kida snippets that should parse. milo-docs:skip reason=&lt;why&gt; for commands that must remain visible but should not run automatically. Testing strategies flowchart TB subgraph Unit RS[assert_state] --&gt; Reducer RN[assert_renders] --&gt; Template SG[assert_saga] --&gt; Saga end subgraph Integration Rec[recording_middleware] --&gt; Session[JSONL Session] Session --&gt; Replay[milo replay] end Replay --&gt;|&quot;--assert&quot;| CI[CI Regression] Snapshot testing Render state through a template and compare to a snapshot file: from milo.testing import assert_renders assert_renders( {&quot;count&quot;: 5, &quot;label&quot;: &quot;Total&quot;}, &quot;counter.kida&quot;, snapshot=&quot;tests/snapshots/counter_5.txt&quot;, ) On first run, the snapshot file is created. On subsequent runs, the output is compared to the stored snapshot. ANSI codes are stripped by default. Update snapshots CI mode MILO_UPDATE_SNAPSHOTS=1 pytest pytest # Fails if snapshots don&#x27;t match Reducer testing Feed an action sequence through a reducer and assert the final state: from milo.testing import assert_state from milo import Action assert_state( reducer, None, # initial state [Action(&quot;@@INIT&quot;), Action(&quot;INCREMENT&quot;), Action(&quot;INCREMENT&quot;)], {&quot;count&quot;: 2}, # expected final state ) Note Note assert_state replays actions synchronously — no event loop, no rendering, no sagas. This isolates the reducer logic for fast, deterministic tests. Saga testing Step through a saga generator, asserting each yielded effect: from milo.testing import assert_saga from milo import Call, Put, Action assert_saga( fetch_saga(), [ (Call(fetch_json, (&quot;https://api.example.com&quot;,), {}), {&quot;data&quot;: 42}), (Put(Action(&quot;DATA_LOADED&quot;, payload={&quot;data&quot;: 42})), None), ], ) Each tuple is (expected_effect, value_to_send_back). The test runner asserts the yielded effect matches, then sends the value back into the generator. Session recording Record every action dispatched during an interactive session: Auto-path Custom path app = App(template=&quot;app.kida&quot;, reducer=reducer, initial_state=None, record=True) app.run() # Writes to session.jsonl app = App(template=&quot;app.kida&quot;, reducer=reducer, initial_state=None, record=&quot;my_session.jsonl&quot;) app.run() The recording middleware captures each action with a state hash in JSONL format. Session replay Replay a recorded session for debugging or CI regression testing: # Normal replay at 2x speed milo replay session.jsonl --speed 2.0 # Show state diffs between actions milo replay session.jsonl --diff # Step-by-step interactive replay milo replay session.jsonl --step # CI mode: assert state hashes match milo replay session.jsonl --assert --reducer myapp:reducer Warning Warning The --assert flag compares state hashes at each step against the recorded values. If you change your reducer logic, recorded sessions will fail hash checks — re-record affected sessions after intentional changes. File Text Recording format Sessions are stored as JSONL (one JSON object per line): {&quot;seq&quot;: 0, &quot;type&quot;: &quot;@@INIT&quot;, &quot;payload&quot;: null, &quot;hash&quot;: &quot;a1b2c3d4&quot;} {&quot;seq&quot;: 1, &quot;type&quot;: &quot;@@KEY&quot;, &quot;payload&quot;: {&quot;char&quot;: &quot; &quot;, &quot;name&quot;: null, &quot;ctrl&quot;: false}, &quot;hash&quot;: &quot;e5f6g7h8&quot;} {&quot;seq&quot;: 2, &quot;type&quot;: &quot;@@KEY&quot;, &quot;payload&quot;: {&quot;char&quot;: &quot;r&quot;, &quot;name&quot;: null, &quot;ctrl&quot;: false}, &quot;hash&quot;: &quot;i9j0k1l2&quot;} Each line includes a sequence number, the action, and a hash of the resulting state. This makes recordings both human-readable and machine-verifiable.

--------------------------------------------------------------------------------

Metadata:
- Author: lbliii
- Word Count: 696
- Reading Time: 3 minutes