Architecture - Patitas

Patitas uses a three-stage pipeline: Lexer → Parser → Renderer.

Overview

flowchart LR A[Source Text] --> B[Lexer] B --> C[Tokens] C --> D[Parser] D --> E[AST] E --> F[Renderer] F --> G[HTML]

The lexer is a state-machine tokenizer. No regex.

Key features:

Token types:

from patitas.lexer import Lexer

lexer = Lexer("# Hello **World**")
for token in lexer:
    print(token.type, token.value)

The parser consumes tokens to build the AST.

Key features:

Parsing strategy:

from patitas.parser import Parser

parser = Parser("# Hello")
doc = parser.parse()

The Abstract Syntax Tree uses frozen dataclasses with slots.

Why frozen?

Why slots?

@dataclass(frozen=True, slots=True)
class Heading:
    level: int
    children: tuple[Inline, ...]
    location: SourceLocation | None = None

The HTML renderer traverses the AST using pattern matching.

Key features:

from patitas.renderers.html import HtmlRenderer

renderer = HtmlRenderer(source=source_text)
html = renderer.render(doc)

class Highlighter(Protocol):
    def highlight(self, code: str, lang: str) -> str: ...

class IconResolver(Protocol):
    def resolve(self, name: str) -> str | Inline | None: ...

class DirectiveHandler(Protocol):
    name: str
    def parse(self, ...) -> Block | None: ...
    def render(self, ...) -> None: ...