docs: add semantic layer internals concept

2026-07-25 12:01:03 +02:00 · 2026-05-15 12:22:54 -07:00 · 2026-05-15 12:22:54 -07:00 · 4938a8363f
commit 4938a8363f
parent 663143ed5f
6 changed files with 918 additions and 6 deletions
--- a/docs-site/app/docs/[[...slug]]/page.tsx
+++ b/docs-site/app/docs/[[...slug]]/page.tsx
@ -39,20 +39,29 @@ export default async function Page(props: {
  const hero = isHeroPage(params.slug);

  return (
-    <DocsPage toc={page.data.toc}>
+    <DocsPage
+      toc={page.data.toc}
+      className="!mx-0 min-w-0 justify-self-start md:!mx-auto"
+      style={{
+        width: "calc(100vw - 2rem)",
+        maxWidth: "900px",
+      }}
+    >
      {!hero && (
        <>
-          <div className="flex items-start justify-between gap-4">
+          <div className="flex flex-col gap-3 sm:flex-row sm:items-start sm:justify-between sm:gap-4">
            <DocsTitle>{page.data.title}</DocsTitle>
            <DocsPageActions
              markdownUrl={`${page.url}.md`}
              mdxSource={mdxSource}
            />
          </div>
-          <DocsDescription>{page.data.description}</DocsDescription>
+          <DocsDescription className="wrap-anywhere">
+            {page.data.description}
+          </DocsDescription>
        </>
      )}
-      <DocsBody>
+      <DocsBody className="min-w-0 max-w-full wrap-anywhere">
        <MDX components={{ ...defaultMdxComponents, pre: CodeBlock }} />
      </DocsBody>
    </DocsPage>
--- a/docs-site/content/docs/concepts/meta.json
+++ b/docs-site/content/docs/concepts/meta.json
@ -1,5 +1,5 @@
 {
  "title": "Concepts",
  "defaultOpen": true,
-  "pages": ["the-context-layer", "context-as-code"]
+  "pages": ["the-context-layer", "semantic-layer-internals", "context-as-code"]
 }
--- a/docs-site/content/docs/concepts/semantic-layer-internals.mdx
+++ b/docs-site/content/docs/concepts/semantic-layer-internals.mdx
@ -0,0 +1,398 @@
+---
+title: Semantic Layer Internals
+description: How KTX uses join graphs, grain, and relationship metadata to turn context into safe SQL.
+---
+
+KTX is a context layer for agents. This page focuses on one internal subsystem:
+the semantic execution layer that turns reviewed context into safe SQL.
+
+The semantic layer is important, but it is not the whole product. KTX also
+handles schema evidence, wiki context, provenance, validation, and agent
+workflows around those files.
+
+Read the page as a pipeline:
+
+- context inputs feed the semantic engine;
+- evidence becomes a join graph with grain and relationship metadata;
+- review and corrections keep that graph current;
+- the execution engine uses the graph to avoid fan-out and ambiguous joins.
+
+## Where the semantic layer fits
+
+The semantic layer is not a separate product category inside KTX. It is the
+engine that makes the rest of the context actionable for SQL generation.
+
+<div
+  className="not-prose my-8 overflow-hidden rounded-lg border border-fd-border bg-fd-card shadow-sm"
+  aria-label="How context inputs flow through the semantic layer into agent workflows"
+>
+  <div className="grid gap-0 lg:grid-cols-[1fr_2rem_1.12fr_2rem_1fr]">
+    <section className="bg-fd-background p-4">
+      <p className="mb-3 text-[11px] font-semibold uppercase tracking-wide text-fd-muted-foreground">
+        {"Context inputs"}
+      </p>
+      <div className="grid gap-2 text-sm">
+        <div className="border-l-2 border-fd-primary bg-fd-card px-3 py-2">
+          <p className="font-mono text-xs text-fd-foreground">semantic-layer/</p>
+          <p className="mt-1 text-xs leading-5 text-fd-muted-foreground">
+            {"source YAML, measures, joins, grain"}
+          </p>
+        </div>
+        <div className="border-l-2 border-amber-500 bg-fd-card px-3 py-2">
+          <p className="font-mono text-xs text-fd-foreground">wiki/</p>
+          <p className="mt-1 text-xs leading-5 text-fd-muted-foreground">
+            {"business rules, definitions, caveats"}
+          </p>
+        </div>
+        <div className="border-l-2 border-orange-500 bg-fd-card px-3 py-2">
+          <p className="font-mono text-xs text-fd-foreground">raw-sources/</p>
+          <p className="mt-1 text-xs leading-5 text-fd-muted-foreground">
+            {"schema scans, keys, imported metadata"}
+          </p>
+        </div>
+        <div className="border-l-2 border-slate-500 bg-fd-card px-3 py-2 dark:border-cyan-200">
+          <p className="font-mono text-xs text-fd-foreground">provenance</p>
+          <p className="mt-1 text-xs leading-5 text-fd-muted-foreground">
+            {"ingest decisions and review history"}
+          </p>
+        </div>
+      </div>
+    </section>
+
+    <div className="hidden items-center justify-center bg-fd-background lg:flex" aria-hidden="true">
+      <span className="h-px w-full bg-fd-border" />
+    </div>
+
+    <section className="relative bg-[#102226] p-5 text-white dark:bg-[#0b181b]">
+      <div className="absolute inset-y-0 left-0 w-1 bg-fd-primary" />
+      <p className="mb-3 text-[11px] font-semibold uppercase tracking-wide text-cyan-200">
+        {"Semantic layer engine"}
+      </p>
+      <div className="grid gap-2 sm:grid-cols-2">
+        <div className="rounded-md border border-cyan-100/20 bg-white/8 px-3 py-2">
+          <p className="text-sm font-semibold">Join graph</p>
+          <p className="mt-1 text-xs leading-5 text-cyan-50/75">
+            {"sources as nodes, joins as typed edges"}
+          </p>
+        </div>
+        <div className="rounded-md border border-cyan-100/20 bg-white/8 px-3 py-2">
+          <p className="text-sm font-semibold">Grain</p>
+          <p className="mt-1 text-xs leading-5 text-cyan-50/75">
+            {"row identity before aggregation"}
+          </p>
+        </div>
+        <div className="rounded-md border border-cyan-100/20 bg-white/8 px-3 py-2">
+          <p className="text-sm font-semibold">Measures</p>
+          <p className="mt-1 text-xs leading-5 text-cyan-50/75">
+            {"verified formulas and filters"}
+          </p>
+        </div>
+        <div className="rounded-md border border-cyan-100/20 bg-white/8 px-3 py-2">
+          <p className="whitespace-nowrap break-normal text-sm font-semibold">Relationships</p>
+          <p className="mt-1 text-xs leading-5 text-cyan-50/75">
+            {"many_to_one, one_to_many, one_to_one"}
+          </p>
+        </div>
+      </div>
+      <div className="mt-3 rounded-md border border-cyan-100/20 bg-cyan-50/10 px-3 py-2 text-sm">
+        {"Safe query planning before SQL is generated."}
+      </div>
+    </section>
+
+    <div className="hidden items-center justify-center bg-fd-background lg:flex" aria-hidden="true">
+      <span className="h-px w-full bg-fd-border" />
+    </div>
+
+    <section className="bg-fd-muted/35 p-4">
+      <p className="mb-3 text-[11px] font-semibold uppercase tracking-wide text-fd-muted-foreground">
+        {"Agent workflows"}
+      </p>
+      <div className="space-y-2 text-sm">
+        <div className="rounded-md border border-fd-border bg-fd-card px-3 py-2">
+          {"Search sources and wiki pages"}
+        </div>
+        <div className="rounded-md border border-fd-border bg-fd-card px-3 py-2">
+          {"Compile trusted SQL"}
+        </div>
+        <div className="rounded-md border border-fd-border bg-fd-card px-3 py-2">
+          {"Explain metrics and provenance"}
+        </div>
+        <div className="rounded-md border border-fd-border bg-fd-card px-3 py-2">
+          {"Patch files and validate review"}
+        </div>
+      </div>
+    </section>
+  </div>
+</div>
+
+## The join graph KTX builds
+
+A semantic source is a node. A join is an edge with a join condition and a
+relationship type. The graph lets KTX choose valid paths, reject unsafe paths,
+and reason about whether a join preserves or multiplies rows before SQL is
+generated.
+
+- `many_to_one` paths are usually safe for adding dimensions.
+- `one_to_many` paths can multiply fact rows and trigger fan-out handling.
+- Equal-cost paths can be ambiguous, so aliases and explicit joins matter.
+
+<figure
+  className="not-prose my-8 overflow-hidden rounded-lg border border-fd-border bg-fd-card p-4 shadow-sm"
+  aria-label="Example semantic join graph"
+>
+  <div className="grid gap-3 md:grid-cols-[1fr_1fr_1fr]">
+    <div className="rounded-md border border-fd-border bg-fd-background px-4 py-3">
+      <p className="text-sm font-semibold text-fd-foreground">customers</p>
+      <p className="mt-1 text-xs text-fd-muted-foreground">grain: customer_id</p>
+    </div>
+    <div className="rounded-md border-2 border-fd-primary bg-fd-background px-4 py-3">
+      <p className="text-sm font-semibold text-fd-foreground">orders</p>
+      <p className="mt-1 text-xs text-fd-muted-foreground">grain: order_id</p>
+    </div>
+    <div className="rounded-md border border-fd-border bg-fd-background px-4 py-3">
+      <p className="text-sm font-semibold text-fd-foreground">order_items</p>
+      <p className="mt-1 text-xs text-fd-muted-foreground">grain: order_id, line_id</p>
+    </div>
+  </div>
+  <div className="my-3 grid gap-2 text-center text-xs font-medium text-fd-muted-foreground md:grid-cols-[1fr_1fr]">
+    <div>orders -> customers: many_to_one</div>
+    <div>orders -> order_items: one_to_many</div>
+  </div>
+  <figcaption className="mt-4 border-t border-fd-border pt-3 text-left text-xs leading-5 text-fd-muted-foreground">
+    <span className="font-medium text-fd-foreground">{"Example: "}</span>
+    {"refunds joins to orders. Used carefully, it explains net revenue. Joined naively, it can duplicate order-level measures."}
+  </figcaption>
+</figure>
+
+The graph is bidirectional for planning. If `orders -> customers` is
+`many_to_one`, the reverse path is `one_to_many`; KTX keeps that distinction
+instead of treating every join as a neutral edge.
+
+## How KTX builds the graph
+
+KTX starts from evidence, not a blank modeling canvas. Database scans and
+analytics-tool imports create source definitions that an analyst can review.
+
+| Evidence | What it contributes |
+|---|---|
+| Declared primary keys | Initial row grain for each source |
+| Declared foreign keys | Formal join candidates and relationship direction |
+| Inferred relationships | Useful edges when warehouses lack constraints |
+| dbt, MetricFlow, and LookML imports | Existing metrics, dimensions, entities, explores, and joins |
+| Query history | Real join and filter patterns agents should respect |
+| Analyst review | The final authority before context is merged |
+
+Generated YAML is intentionally reviewable. KTX can draft joins and measures,
+but the accepted semantic layer is still the plain-file diff your team approves.
+
+## How KTX keeps the graph current
+
+The semantic layer changes as schemas, metrics, and business rules change. KTX
+keeps that loop explicit instead of hiding it behind a remote runtime.
+
+<div
+  className="not-prose my-8 overflow-hidden rounded-lg border border-fd-border bg-fd-card shadow-sm"
+  aria-label="Semantic layer maintenance loop"
+>
+  <div className="border-b border-fd-border bg-fd-muted/35 px-4 py-3">
+    <p className="text-[11px] font-semibold uppercase tracking-wide text-fd-muted-foreground">
+      {"Semantic maintenance loop"}
+    </p>
+    <p className="mt-1 text-sm leading-6 text-fd-muted-foreground">
+      {"Every accepted correction becomes input to the next graph build."}
+    </p>
+  </div>
+  <div className="p-4">
+    <div className="-mx-4 overflow-x-auto px-4">
+      <div className="relative mx-auto h-[460px] w-[720px] max-w-none md:w-full md:max-w-[760px]">
+        <svg
+          aria-hidden="true"
+          className="absolute inset-0 h-full w-full text-fd-primary"
+          fill="none"
+          viewBox="0 0 760 460"
+        >
+          <g
+            stroke="currentColor"
+            strokeLinecap="round"
+            strokeLinejoin="round"
+            strokeOpacity="0.68"
+            strokeWidth="2.5"
+          >
+            <path d="M 352 80 H 384" />
+            <path d="M 600 80 H 668 V 150" />
+            <path d="M 632 284 V 378 H 626" />
+            <path d="M 408 378 H 376" />
+            <path d="M 160 378 H 96 V 308" />
+            <path d="M 128 172 V 80 H 140" />
+          </g>
+          <g fill="currentColor" fillOpacity="0.96" stroke="none">
+            <polygon points="0,0 -14,-7 -14,7" transform="translate(398 80)" />
+            <polygon points="0,0 -14,-7 -14,7" transform="translate(668 164) rotate(90)" />
+            <polygon points="0,0 -14,-7 -14,7" transform="translate(612 378) rotate(180)" />
+            <polygon points="0,0 -14,-7 -14,7" transform="translate(362 378) rotate(180)" />
+            <polygon points="0,0 -14,-7 -14,7" transform="translate(96 294) rotate(270)" />
+            <polygon points="0,0 -14,-7 -14,7" transform="translate(154 80)" />
+          </g>
+        </svg>
+
+        <div className="absolute left-1/2 top-1/2 flex h-32 w-56 -translate-x-1/2 -translate-y-1/2 flex-col items-center justify-center rounded-md border border-fd-primary/50 bg-fd-background px-4 py-4 text-center shadow-sm">
+          <p className="text-[11px] font-semibold uppercase tracking-wide text-fd-primary">
+            {"reviewed context"}
+          </p>
+          <p className="mt-2 text-sm font-semibold leading-6 text-fd-foreground">
+            {"The accepted graph becomes the starting point for the next build."}
+          </p>
+        </div>
+
+        <div className="absolute left-[160px] top-6 h-28 w-48 rounded-md border-2 border-fd-primary bg-fd-background px-4 py-3 text-sm shadow-sm">
+          <p className="text-[11px] font-semibold uppercase tracking-wide text-fd-muted-foreground">
+            {"Step 1"}
+          </p>
+          <p className="mt-1 font-semibold text-fd-foreground">{"ingest evidence"}</p>
+          <p className="mt-2 text-xs leading-5 text-fd-muted-foreground">
+            {"scan schemas, imports, and accepted files"}
+          </p>
+        </div>
+        <div className="absolute left-[408px] top-6 h-28 w-48 rounded-md border border-fd-border bg-fd-background px-4 py-3 text-sm shadow-sm">
+          <p className="text-[11px] font-semibold uppercase tracking-wide text-fd-muted-foreground">
+            {"Step 2"}
+          </p>
+          <p className="mt-1 font-semibold text-fd-foreground">{"YAML diff"}</p>
+          <p className="mt-2 text-xs leading-5 text-fd-muted-foreground">
+            {"draft source, join, grain, and measure changes"}
+          </p>
+        </div>
+        <div className="absolute left-[536px] top-[172px] h-28 w-48 rounded-md border border-fd-border bg-fd-background px-4 py-3 text-sm shadow-sm">
+          <p className="text-[11px] font-semibold uppercase tracking-wide text-fd-muted-foreground">
+            {"Step 3"}
+          </p>
+          <p className="mt-1 font-semibold text-fd-foreground">{"validation"}</p>
+          <p className="mt-2 text-xs leading-5 text-fd-muted-foreground">
+            {"check relationships, syntax, and unsafe query shapes"}
+          </p>
+        </div>
+        <div className="absolute left-[408px] top-[322px] h-28 w-48 rounded-md border border-fd-border bg-fd-background px-4 py-3 text-sm shadow-sm">
+          <p className="text-[11px] font-semibold uppercase tracking-wide text-fd-muted-foreground">
+            {"Step 4"}
+          </p>
+          <p className="mt-1 font-semibold text-fd-foreground">{"analyst review"}</p>
+          <p className="mt-2 text-xs leading-5 text-fd-muted-foreground">
+            {"accept, edit, or reject generated context"}
+          </p>
+        </div>
+        <div className="absolute left-[160px] top-[322px] h-28 w-48 rounded-md border border-fd-border bg-fd-background px-4 py-3 text-sm shadow-sm">
+          <p className="text-[11px] font-semibold uppercase tracking-wide text-fd-muted-foreground">
+            {"Step 5"}
+          </p>
+          <p className="mt-1 font-semibold text-fd-foreground">{"agent use"}</p>
+          <p className="mt-2 text-xs leading-5 text-fd-muted-foreground">
+            {"serve context to search, explain, and query"}
+          </p>
+        </div>
+        <div className="absolute left-8 top-[172px] h-28 w-48 rounded-md border border-fd-primary/70 bg-fd-background px-4 py-3 text-sm shadow-sm">
+          <p className="text-[11px] font-semibold uppercase tracking-wide text-fd-muted-foreground">
+            {"Step 6"}
+          </p>
+          <p className="mt-1 font-semibold text-fd-foreground">{"corrections"}</p>
+          <p className="mt-2 text-xs leading-5 text-fd-muted-foreground">
+            {"agent and analyst fixes become new evidence"}
+          </p>
+        </div>
+      </div>
+    </div>
+  </div>
+</div>
+
+This matters because semantic correctness is not static. If a source gains a
+new key, a metric changes definition, or an analyst corrects a relationship,
+the next agent gets that reviewed context.
+
+## The modeling problem the graph solves
+
+Fan-out is the classic failure mode. If an order-level measure is joined to
+line-item rows before aggregation, one order can become many rows and revenue
+can be counted more than once.
+
+| Problem | What happens | How KTX avoids it |
+|---|---|---|
+| Order measure joins to `order_items` | `orders.revenue` repeats once per item | Detect the `one_to_many` path and pre-aggregate the order measure |
+| Two independent fact sources share `customers` | Measures from each fact table multiply across the shared dimension | Treat it as a chasm trap and use aggregate-locality planning |
+| Filter lives only across a `one_to_many` path | Filtering after the join changes the measure grain | Reject or localize the filter instead of silently producing unsafe SQL |
+| Multiple equal-cost paths connect the same sources | The join path is ambiguous | Prefer safer paths and use aliases to disambiguate repeated joins |
+
+Many-to-many questions usually show up as multiple one-to-many paths or
+independent fact sources. KTX treats those shapes as fan-out or chasm risks
+unless the query can be planned at a safe grain.
+
+## How the execution engine uses the graph
+
+The planner resolves the sources in a semantic query, chooses a join tree, and
+checks whether any requested dimension or filter crosses a row-multiplying
+edge. The SQL generator then chooses the simple path or the aggregate-locality
+path.
+
+| Naive SQL shape | Semantic-layer SQL shape |
+|---|---|
+| Join facts and dimensions first, then aggregate | Aggregate each fact source at its own grain, then join the results |
+| Put every filter in one outer `WHERE` clause | Keep measure filters with the measure source when locality is needed |
+| Trust the shortest textual join path | Prefer safe relationship paths and reject disconnected sources |
+| Let dimension grain differ across facts | Raise when asymmetric dimensions would fan out another measure |
+
+<div
+  className="not-prose my-8 overflow-hidden rounded-lg border border-fd-border bg-fd-card shadow-sm"
+  aria-label="Fan-out safe execution shape"
+>
+  <div className="grid gap-0 md:grid-cols-2">
+    <section className="border-b border-fd-border bg-fd-background p-4 md:border-b-0 md:border-r">
+      <p className="mb-3 text-[11px] font-semibold uppercase tracking-wide text-fd-muted-foreground">
+        {"Unsafe shape"}
+      </p>
+      <pre className="overflow-x-auto rounded-md bg-fd-muted p-3 text-xs leading-5 text-fd-foreground">
+{`orders
+  join order_items
+  join customers
+group by customer_segment
+sum(orders.amount)`}
+      </pre>
+      <p className="mt-3 text-sm text-fd-muted-foreground">
+        {"The order measure is exposed to line-item fan-out before aggregation."}
+      </p>
+    </section>
+    <section className="bg-fd-background p-4">
+      <p className="mb-3 text-[11px] font-semibold uppercase tracking-wide text-fd-muted-foreground">
+        {"KTX shape"}
+      </p>
+      <pre className="overflow-x-auto rounded-md border border-fd-border bg-fd-muted p-3 text-xs leading-5 text-fd-foreground">
+{`orders_agg as (
+  select customer_id, sum(amount) revenue
+  from orders
+  group by customer_id
+)
+select customers.segment, sum(revenue)
+from orders_agg
+join customers`}
+      </pre>
+      <p className="mt-3 text-sm text-fd-muted-foreground">
+        {"KTX pre-aggregates fact measures at their own grain before joining dimensions."}
+      </p>
+    </section>
+  </div>
+</div>
+
+The result is not magic. It is structured planning: validated sources, typed
+relationships, graph search, fan-out detection, aggregate locality, and final
+dialect transpilation.
+
+## What this means for agents
+
+KTX gives agents a semantic surface they can inspect and improve, not just a
+folder of notes.
+
+- Search semantic sources and related wiki pages before writing SQL.
+- Compile SQL through `ktx sl query` instead of guessing joins.
+- Validate semantic-layer changes before review.
+- Patch YAML and Markdown files in git.
+- Explain metric meaning and provenance from the same accepted context.
+
+Next, read [Writing Context](/docs/guides/writing-context) for the YAML editing
+workflow or [ktx sl](/docs/cli-reference/ktx-sl) for the command reference.
--- a/docs-site/content/docs/concepts/the-context-layer.mdx
+++ b/docs-site/content/docs/concepts/the-context-layer.mdx
@ -191,7 +191,18 @@ KTX organizes context into four pillars:

 Each pillar covers a different kind of context agents need before they can safely write SQL, update semantic definitions, or explain an analytics result.

-**Semantic sources** are YAML definitions that describe your data in terms agents can reason about. Each source maps to a table or SQL query, declares its grain, defines typed columns, specifies valid joins, and exposes named measures with optional filters. This is where "revenue means `sum(amount)` excluding refunds" lives.
+**Semantic sources** are YAML definitions that describe your data in terms
+agents can reason about:
+
+- source tables or SQL queries;
+- row grain;
+- typed columns;
+- valid joins;
+- named measures, filters, and segments.
+
+This is where "revenue means `sum(amount)` excluding refunds" lives. For the
+join graph, fan-out protections, and execution mechanics, read
+[Semantic Layer Internals](/docs/concepts/semantic-layer-internals).

 ```yaml
 name: orders
--- a/docs/superpowers/plans/2026-05-15-semantic-layer-docs.md
+++ b/docs/superpowers/plans/2026-05-15-semantic-layer-docs.md
@ -0,0 +1,328 @@
+# Semantic Layer Docs Implementation Plan
+
+> **For agentic workers:** REQUIRED SUB-SKILL: Use superpowers:subagent-driven-development (recommended) or superpowers:executing-plans to implement this plan task-by-task. Steps use checkbox (`- [x]`) syntax for tracking.
+
+**Goal:** Add a standalone, scannable Concepts page that explains the semantic-layer internals while positioning KTX as a broader context layer.
+
+**Architecture:** Implement this as docs-only MDX content inside the existing Fumadocs tree. The new page uses inline MDX diagrams and Fumadocs color tokens, matching the custom diagram pattern already used in `the-context-layer.mdx`.
+
+**Tech Stack:** MDX, Fumadocs content, Next.js docs site, pnpm workspace commands.
+
+---
+
+### Task 1: Add Concepts Navigation Entry
+
+**Files:**
+- Modify: `docs-site/content/docs/concepts/meta.json`
+
+- [x] **Step 1: Update the Concepts page order**
+
+Replace the `pages` array with:
+
+```json
+{
+  "title": "Concepts",
+  "defaultOpen": true,
+  "pages": ["the-context-layer", "semantic-layer-internals", "context-as-code"]
+}
+```
+
+- [x] **Step 2: Verify JSON parses**
+
+Run:
+
+```bash
+node -e "JSON.parse(require('node:fs').readFileSync('docs-site/content/docs/concepts/meta.json', 'utf8')); console.log('concepts meta ok')"
+```
+
+Expected output:
+
+```text
+concepts meta ok
+```
+
+### Task 2: Create the Semantic Layer Internals Page
+
+**Files:**
+- Create: `docs-site/content/docs/concepts/semantic-layer-internals.mdx`
+
+- [x] **Step 1: Add frontmatter and opening positioning**
+
+Create the page with this frontmatter and opening section:
+
+```mdx
+---
+title: Semantic Layer Internals
+description: How KTX uses join graphs, grain, and relationship metadata to turn context into safe SQL.
+---
+
+KTX is a context layer for agents. Its semantic layer is the query-planning core
+that turns reviewed context into safe SQL.
+
+Use this page to understand the mechanics behind KTX's semantic execution:
+the join graph, how KTX builds and maintains it, and how that graph prevents
+classic analytics errors like fan-out and ambiguous join paths.
+
+| KTX is | KTX is not just |
+|---|---|
+| A context layer for agents | A metric definition store |
+| A system for ingesting, reviewing, and serving analytics context | A markdown saver |
+| A semantic execution layer plus wiki pages, scans, provenance, and agent workflows | A replacement for every BI semantic layer |
+```
+
+- [x] **Step 2: Add the system-fit diagram**
+
+Add a `Where the semantic layer fits` section with a custom `not-prose` diagram.
+The diagram must show:
+
+```text
+Context inputs -> Semantic layer engine -> Agent workflows
+```
+
+The semantic-layer box must be visually prominent and list:
+
+```text
+join graph
+grain
+measures
+relationships
+safe query planning
+```
+
+- [x] **Step 3: Add the join graph section**
+
+Add `## The join graph` with:
+
+- one short paragraph defining nodes and edges;
+- bullets for why the graph matters;
+- an inline diagram using `orders`, `customers`, `order_items`, and `refunds`.
+
+The section must include this claim in plain language:
+
+```text
+The graph lets KTX choose valid paths, reject unsafe paths, and reason about
+whether a join preserves or multiplies rows before SQL is generated.
+```
+
+- [x] **Step 4: Add build and maintenance sections**
+
+Add `## How KTX builds it` and `## How KTX maintains it`.
+
+`How KTX builds it` must cover these inputs:
+
+```text
+declared primary keys
+declared foreign keys
+inferred relationships
+dbt, MetricFlow, and LookML imports
+query history
+analyst review
+```
+
+`How KTX maintains it` must show this loop:
+
+```text
+ingest evidence -> YAML diff -> validation -> analyst review -> agent use -> corrections
+```
+
+- [x] **Step 5: Add the fan-out and safe execution sections**
+
+Add `## Why grain and relationships matter` with a fan-out example comparing
+orders joined to order items. Include a compact table with columns:
+
+```text
+Problem
+What happens
+How KTX avoids it
+```
+
+Add `## How the execution engine uses the graph` with a before/after table:
+
+```text
+Naive SQL shape
+Semantic-layer SQL shape
+```
+
+The safe path must mention:
+
+```text
+pre-aggregates fact measures at their own grain before joining dimensions
+```
+
+- [x] **Step 6: Add agent outcome links**
+
+Add a closing `## What this means for agents` section with bullets explaining
+that agents can:
+
+```text
+search semantic sources
+compile SQL through ktx sl query
+validate changes before review
+patch YAML and Markdown files in git
+explain provenance and metric meaning
+```
+
+End with links to:
+
+```mdx
+[Writing Context](/docs/guides/writing-context)
+[ktx sl](/docs/cli-reference/ktx-sl)
+```
+
+### Task 3: Add the Cross-Link from The Context Layer
+
+**Files:**
+- Modify: `docs-site/content/docs/concepts/the-context-layer.mdx`
+
+- [x] **Step 1: Replace the semantic sources paragraph with a scannable block**
+
+Find the `**Semantic sources**` paragraph under `KTX organizes context into four pillars`.
+Replace the long paragraph with:
+
+```mdx
+**Semantic sources** are YAML definitions that describe your data in terms
+agents can reason about:
+
+- source tables or SQL queries;
+- row grain;
+- typed columns;
+- valid joins;
+- named measures, filters, and segments.
+
+This is where "revenue means `sum(amount)` excluding refunds" lives. For the
+join graph, fan-out protections, and execution mechanics, read
+[Semantic Layer Internals](/docs/concepts/semantic-layer-internals).
+```
+
+- [x] **Step 2: Confirm the page still owns the product positioning**
+
+Search the edited file:
+
+```bash
+rg -n "context layer|Semantic Layer Internals|semantic layer - that's a critical component" docs-site/content/docs/concepts/the-context-layer.mdx
+```
+
+Expected: output includes the existing context-layer framing and the new internals link.
+
+### Task 4: Fix Mobile Docs Header Overflow
+
+**Files:**
+- Modify: `docs-site/app/docs/[[...slug]]/page.tsx`
+
+- [x] **Step 1: Stack title actions on narrow screens**
+
+Replace the non-hero page header wrapper:
+
+```tsx
+<div className="flex items-start justify-between gap-4">
+```
+
+with:
+
+```tsx
+<div className="flex flex-col gap-3 sm:flex-row sm:items-start sm:justify-between sm:gap-4">
+```
+
+This keeps desktop layout unchanged while preventing the action buttons from
+forcing horizontal overflow on mobile.
+
+- [x] **Step 2: Allow the docs article to shrink in the layout grid**
+
+Update the `DocsPage` and `DocsBody` wrappers:
+
+```tsx
+<DocsPage
+  toc={page.data.toc}
+  className="!mx-0 min-w-0 !max-w-[calc(100vw-2rem)] md:!mx-auto md:!max-w-[900px]"
+>
+```
+
+```tsx
+<DocsBody className="min-w-0 max-w-full">
+```
+
+This prevents tables, code blocks, and custom diagrams from forcing the
+Fumadocs main article column wider than the mobile viewport, overrides the
+library's built-in max-width rule on mobile, aligns the article to the left on
+mobile, and preserves the normal centered desktop max width.
+
+If long words still clip under mobile viewport capture, add the same wrapping
+behavior used by the Fumadocs sidebar:
+
+```tsx
+<DocsDescription className="wrap-anywhere">
+  {page.data.description}
+</DocsDescription>
+```
+
+```tsx
+<DocsBody className="min-w-0 max-w-full wrap-anywhere">
+```
+
+- [x] **Step 3: Recheck mobile render**
+
+Capture or inspect a 390px-wide render of:
+
+```text
+http://127.0.0.1:3000/docs/concepts/semantic-layer-internals
+```
+
+Expected: the title, description, action buttons, and positioning block stay
+within the viewport.
+
+### Task 5: Verify Docs Content and Build
+
+**Files:**
+- Check: `docs-site/content/docs/concepts/semantic-layer-internals.mdx`
+- Check: `docs-site/content/docs/concepts/the-context-layer.mdx`
+- Check: `docs-site/content/docs/concepts/meta.json`
+- Check: `docs-site/app/docs/[[...slug]]/page.tsx`
+
+- [x] **Step 1: Run content checks**
+
+Run:
+
+```bash
+rg -n "KTX is a context layer|markdown saver|fan-out|join graph|pre-aggregates|Semantic Layer Internals" docs-site/content/docs/concepts
+```
+
+Expected: matches appear in the new page and the cross-link appears in
+`the-context-layer.mdx`.
+
+- [x] **Step 2: Build the docs site**
+
+Run:
+
+```bash
+pnpm --filter ktx-docs build
+```
+
+Expected: build exits 0.
+
+- [x] **Step 3: Preview locally**
+
+Run:
+
+```bash
+pnpm --filter ktx-docs dev
+```
+
+Open:
+
+```text
+http://localhost:3000/docs/concepts/semantic-layer-internals
+```
+
+Inspect desktop and mobile widths. The opening should clearly position KTX as a
+context layer, the Concepts navigation should list the new page, and diagrams
+should not overlap or produce unreadable text.
+
+- [x] **Step 4: Commit implementation**
+
+Run:
+
+```bash
+git status --short
+git add docs-site/content/docs/concepts/meta.json docs-site/content/docs/concepts/semantic-layer-internals.mdx docs-site/content/docs/concepts/the-context-layer.mdx docs-site/app/docs/[[...slug]]/page.tsx docs/superpowers/plans/2026-05-15-semantic-layer-docs.md
+git commit -m "docs: add semantic layer internals concept"
+```
--- a/docs/superpowers/specs/2026-05-15-semantic-layer-docs-design.md
+++ b/docs/superpowers/specs/2026-05-15-semantic-layer-docs-design.md
@ -0,0 +1,166 @@
+# Semantic Layer Docs Design
+
+**Date:** 2026-05-15
+**Status:** Design - pending implementation plan
+
+## Goal
+
+Add a concise Concepts page that explains the semantic layer as the query
+planning engine inside KTX's broader context layer.
+
+The page should make the technical depth visible to skeptical data users
+without positioning KTX as only a semantic-layer product. Success means a reader
+understands:
+
+- KTX is a context layer for agents.
+- The semantic layer is one core subsystem inside that context layer.
+- The join graph, grain declarations, and relationship metadata are what make
+  generated SQL safer than schema-only or markdown-only approaches.
+- KTX maintains this semantic layer through ingest, validation, analyst edits,
+  and reviewable files.
+
+## Current State
+
+The docs currently explain semantic sources in two places:
+
+- `docs-site/content/docs/concepts/the-context-layer.mdx` describes semantic
+  sources as one pillar of KTX context.
+- `docs-site/content/docs/guides/writing-context.mdx` documents the YAML fields
+  for sources, measures, joins, grain, validation, and common errors.
+
+That content is useful, but the differentiator is not visually obvious. The
+semantic layer is embedded in longer narrative pages, so readers can miss the
+hard parts: join graph construction, fan-out prevention, chasm traps, and query
+planning.
+
+## Positioning
+
+Create a standalone Concepts page with a guarded title such as
+`Semantic Layer Internals` or `The Semantic Engine Inside KTX`.
+
+The first screen must frame the product clearly:
+
+> KTX is a context layer. Its semantic layer is the query-planning core that
+> turns reviewed context into safe SQL.
+
+The page should avoid a title like `Semantic Layer` by itself because that can
+make KTX look like a narrow semantic-layer tool. The page should repeatedly show
+the semantic layer between the broader context inputs and the agent workflows it
+supports.
+
+Add a short cross-link from `the-context-layer.mdx` so the existing overview
+keeps owning the product category. That section should say the semantic layer is
+one critical pillar, then link to the internals page for readers who want the
+mechanics.
+
+## Page Structure
+
+Add `docs-site/content/docs/concepts/semantic-layer-internals.mdx` and include
+it in `docs-site/content/docs/concepts/meta.json` after `the-context-layer`.
+
+Recommended sections:
+
+1. `What this page explains`
+   - One short paragraph.
+   - A two-column `KTX is / KTX is not just` table.
+
+2. `Where the semantic layer fits`
+   - A visual block showing:
+     `context inputs -> semantic layer engine -> agent workflows`.
+   - Inputs include semantic YAML, wiki pages, scans, and provenance.
+   - Outputs include search, SQL generation, explanations, edits, and review.
+
+3. `The join graph`
+   - Explain nodes as semantic sources and edges as validated joins.
+   - Show a small graph with `orders`, `customers`, `order_items`, and
+     `refunds`.
+   - Keep text to one or two short paragraphs plus bullets.
+
+4. `How KTX builds it`
+   - Show a pipeline from database evidence and imported modeling tools to
+     reviewable YAML.
+   - Mention declared keys, inferred relationships, dbt/MetricFlow/LookML
+     imports, query history, validation, and analyst review.
+
+5. `How KTX maintains it`
+   - Show a feedback loop:
+     ingest evidence -> YAML diff -> validation -> analyst review -> agent use
+     -> corrections.
+   - Emphasize that files remain the source of truth.
+
+6. `Why grain and relationships matter`
+   - Use the fan-out problem as the central example.
+   - Compare a naive join against a safe semantic-layer plan.
+   - Explain many-to-one, one-to-many, many-to-many, chasm traps, and ambiguous
+     paths in compact bullets.
+
+7. `How the execution engine uses the graph`
+   - Explain path selection, unsafe path rejection, pre-aggregation into CTEs,
+     filter placement, and dialect transpilation.
+   - Include a small before/after SQL-shape diagram or table.
+
+8. `What this means for agents`
+   - Summarize why this is more than saving markdown:
+     agents can inspect, query, validate, edit, and review the same semantic
+     files.
+   - Link to `Writing Context` and `ktx sl`.
+
+## Scannability Rules
+
+The implementation should shorten long prose blocks across the touched pages.
+
+- Keep most text blocks to one or two paragraphs.
+- Prefer bullets, tables, diagrams, and compact callout blocks between prose.
+- Avoid four-paragraph narrative runs.
+- Use diagrams before dense explanations when the concept is spatial.
+- Keep examples concrete and copy-pasteable.
+
+## Visual Direction
+
+Use the existing docs-site MDX style rather than a new design system. The current
+`the-context-layer.mdx` page already uses custom `not-prose` MDX diagrams with
+Fumadocs color tokens; the new page should follow that pattern.
+
+The diagrams should feel like technical product documentation:
+
+- restrained, dense, and readable;
+- high contrast for the semantic-layer engine box;
+- visible arrows or adjacency that make flow obvious;
+- tables for classification and comparison;
+- no marketing hero, decorative gradients, or generic card-heavy layout.
+
+## Non-goals
+
+- Do not redesign the whole docs site.
+- Do not rename KTX concepts, packages, commands, or directories.
+- Do not claim KTX replaces every BI or semantic-layer system.
+- Do not add implementation details that are not true in the current codebase.
+- Do not expand the page into a long reference for every YAML field; keep that
+  in `Writing Context`.
+
+## Verification
+
+Because this is docs-only work, verification should focus on the docs site:
+
+- Run the docs build or the narrowest available docs-site type/build check.
+- Run formatting or lint checks if the docs package exposes them.
+- Preview the page locally and inspect desktop and mobile widths.
+- Confirm the page is listed in Concepts navigation.
+- Confirm the opening section clearly says KTX is a context layer, not just a
+  semantic-layer tool.
+
+If implementation changes only MDX and metadata, TypeScript workspace tests are
+not required unless the page introduces shared components.
+
+## Acceptance Criteria
+
+- A standalone Concepts page explains the semantic-layer internals.
+- The Context Layer page links to the new internals page without making the
+  overview longer.
+- The new page includes diagrams for the system fit, join graph, maintenance
+  loop, and fan-out-safe execution path.
+- Long prose is broken into scannable sections with bullets, tables, and visual
+  interruptions.
+- The positioning consistently says KTX is a context layer with a semantic
+  execution core.
+- Docs-site verification passes or any skipped check is reported with a reason.