# Output Formats

URL: /bengal/docs/ship/output-formats/
Section: ship

--------------------------------------------------------------------------------

Output Formats Bengal can generate multiple output formats for your content, enabling search functionality, AI discovery, and programmatic access. Available Formats Per-Page Formats Generated for every page in your site: JSON (index.json): Structured data including metadata, HTML content, plain text, and optional heading-level chunks for RAG. LLM Text (index.txt): AI-friendly plain text format optimized for RAG (Retrieval-Augmented Generation) and LLM consumption. Markdown (index.md): Markdown mirror for coding agents and documentation checkers. Each file includes a short directive pointing agents to the site's llms.txt index. Site-Wide Formats Generated at the site root: Site Index (index.json): A searchable index of all pages (useful for client-side search). Full LLM Text (llm-full.txt): The complete content of your site in a single plain text file. LLMs.txt (llms.txt): Curated site overview per the llms.txt spec — lightweight navigation for AI agents. Build Changelog (changelog.json): Per-build diff of added, modified, and removed pages (for incremental indexing). Agent Manifest (agent.json): Hierarchical site structure with sections and available formats (for agent discovery). Configuration Enable output formats in your config file. YAML (directory config) TOML (single file) # config/_default/outputs.yaml output_formats: enabled: true per_page: [&quot;json&quot;, &quot;llm_txt&quot;, &quot;markdown&quot;] site_wide: [&quot;index_json&quot;] options: excerpt_length: 200 # Excerpt length for site index json_indent: null # null for compact JSON, 2 for pretty-print llm_separator_width: 80 # Width of LLM text separators include_full_content_in_index: false # Include full content in site index include_chunks: true # Heading-level chunks in per-page JSON (for RAG) exclude_sections: [] # Sections to exclude from output formats exclude_patterns: [&quot;404.html&quot;, &quot;search.html&quot;] # Files to exclude # bengal.toml [output_formats] enabled = true per_page = [&quot;json&quot;, &quot;llm_txt&quot;, &quot;markdown&quot;] site_wide = [&quot;index_json&quot;] [output_formats.options] excerpt_length = 200 json_indent = null llm_separator_width = 80 include_full_content_in_index = false include_chunks = true exclude_sections = [] exclude_patterns = [&quot;404.html&quot;, &quot;search.html&quot;] Tip Tip Effective Defaults: The [features] section controls which formats are enabled. With default features (json = true, llm_txt = true), Bengal generates: per_page: [&quot;json&quot;, &quot;llm_txt&quot;, &quot;markdown&quot;] (JSON, LLM text, and Markdown mirrors) site_wide: [&quot;index_json&quot;, &quot;llm_full&quot;, &quot;llms_txt&quot;, &quot;changelog&quot;, &quot;agent_manifest&quot;] (search index, LLM texts, build changelog, and agent manifest) To disable LLM text generation, set features.llm_txt = false in your config. Note Note Visibility: Output formats respect page visibility settings. Hidden pages and drafts are excluded by default. Use exclude_sections or exclude_patterns for additional filtering. Use Cases Client-Side Search Fetch the site index to implement fast, client-side search without a backend. Note Note For larger sites, enable the Pre-built Lunr Index to improve performance. This requires the search optional dependency: pip install &quot;bengal[search]&quot; This generates search-index.json (a pre-serialized Lunr index) in addition to index.json, which loads faster in the browser. Bengal's search backend is explicit and defaults to search.backend: lunr. index.json remains the stable source artifact for client-side search, and search-index.json is emitted only by the Lunr backend when prebuilding is enabled. &lt;!-- Simple search UI --&gt; &lt;input type=&quot;text&quot; id=&quot;search-input&quot; placeholder=&quot;Search...&quot;&gt; &lt;ul id=&quot;search-results&quot;&gt;&lt;/ul&gt; &lt;script&gt; const searchInput = document.getElementById(&#x27;search-input&#x27;); const resultsList = document.getElementById(&#x27;search-results&#x27;); let searchIndex = []; // Fetch index once fetch(&#x27;/index.json&#x27;) .then(response =&gt; response.json()) .then(data =&gt; { searchIndex = data.pages; }); // Filter and display results searchInput.addEventListener(&#x27;input&#x27;, (e) =&gt; { const query = e.target.value.toLowerCase(); if (query.length &lt; 2) { resultsList.innerHTML = &#x27;&#x27;; return; } const results = searchIndex.filter(page =&gt; (page.title &amp;&amp; page.title.toLowerCase().includes(query)) || (page.excerpt &amp;&amp; page.excerpt.toLowerCase().includes(query)) ).slice(0, 10); resultsList.innerHTML = results.map(page =&gt; ` &lt;li&gt; &lt;a href=&quot;${page.href}&quot;&gt; &lt;strong&gt;${page.title}&lt;/strong&gt; &lt;p&gt;${page.excerpt}&lt;/p&gt; &lt;/a&gt; &lt;/li&gt; `).join(&#x27;&#x27;); }); &lt;/script&gt; AI &amp; LLM Discovery Provide llm-full.txt to LLMs to allow them to ingest your entire documentation site efficiently. curl https://mysite.com/llm-full.txt Static API Use your static site as a read-only API for other applications. import requests # Get page data data = requests.get(&#x27;https://mysite.com/docs/intro/index.json&#x27;).json() print(data[&#x27;title&#x27;]) print(data[&#x27;word_count&#x27;])

--------------------------------------------------------------------------------

Metadata:
- Author: lbliii
- Word Count: 593
- Reading Time: 3 minutes