apunkt/ktx

mirror of https://github.com/Kaelio/ktx.git synced 2026-06-10 08:05:14 +02:00

Align docs with current KTX behavior (#106 )

* docs: align docs with current KTX behavior

* fix: generate valid agent sl query command

* docs: clarify KTX product mechanics

* fix: use <ol> for runtime pipeline steps in product mechanics

The PipelineStep component renders <li> elements, but the RuntimeDiagram
wrapper was a plain <div> instead of a list element. This produced invalid
HTML and accessibility warnings. IngestionDiagram already used <ol>.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* Add docs favicon

* docs: add semantic layer internals concept

* docs: refine documentation source label

* docs: clarify company documentation examples

---------

Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-05-15 15:31:51 -04:00

6.5 KiB

Raw Permalink Blame History

Semantic Layer Docs Design

Date: 2026-05-15 Status: Design - pending implementation plan

Goal

Add a concise Concepts page that explains the semantic layer as the query planning engine inside KTX's broader context layer.

The page should make the technical depth visible to skeptical data users without positioning KTX as only a semantic-layer product. Success means a reader understands:

KTX is a context layer for agents.
The semantic layer is one core subsystem inside that context layer.
The join graph, grain declarations, and relationship metadata are what make generated SQL safer than schema-only or markdown-only approaches.
KTX maintains this semantic layer through ingest, validation, analyst edits, and reviewable files.

Current State

The docs currently explain semantic sources in two places:

docs-site/content/docs/concepts/the-context-layer.mdx describes semantic sources as one pillar of KTX context.
docs-site/content/docs/guides/writing-context.mdx documents the YAML fields for sources, measures, joins, grain, validation, and common errors.

That content is useful, but the differentiator is not visually obvious. The semantic layer is embedded in longer narrative pages, so readers can miss the hard parts: join graph construction, fan-out prevention, chasm traps, and query planning.

Positioning

Create a standalone Concepts page with a guarded title such as Semantic Layer Internals or The Semantic Engine Inside KTX.

The first screen must frame the product clearly:

KTX is a context layer. Its semantic layer is the query-planning core that turns reviewed context into safe SQL.

The page should avoid a title like Semantic Layer by itself because that can make KTX look like a narrow semantic-layer tool. The page should repeatedly show the semantic layer between the broader context inputs and the agent workflows it supports.

Add a short cross-link from the-context-layer.mdx so the existing overview keeps owning the product category. That section should say the semantic layer is one critical pillar, then link to the internals page for readers who want the mechanics.

Page Structure

Add docs-site/content/docs/concepts/semantic-layer-internals.mdx and include it in docs-site/content/docs/concepts/meta.json after the-context-layer.

Recommended sections:

What this page explains
- One short paragraph.
- A two-column KTX is / KTX is not just table.
Where the semantic layer fits
- A visual block showing: context inputs -> semantic layer engine -> agent workflows.
- Inputs include semantic YAML, wiki pages, scans, and provenance.
- Outputs include search, SQL generation, explanations, edits, and review.
The join graph
- Explain nodes as semantic sources and edges as validated joins.
- Show a small graph with orders, customers, order_items, and refunds.
- Keep text to one or two short paragraphs plus bullets.
How KTX builds it
- Show a pipeline from database evidence and imported modeling tools to reviewable YAML.
- Mention declared keys, inferred relationships, dbt/MetricFlow/LookML imports, query history, validation, and analyst review.
How KTX maintains it
- Show a feedback loop: ingest evidence -> YAML diff -> validation -> analyst review -> agent use -> corrections.
- Emphasize that files remain the source of truth.
Why grain and relationships matter
- Use the fan-out problem as the central example.
- Compare a naive join against a safe semantic-layer plan.
- Explain many-to-one, one-to-many, many-to-many, chasm traps, and ambiguous paths in compact bullets.
How the execution engine uses the graph
- Explain path selection, unsafe path rejection, pre-aggregation into CTEs, filter placement, and dialect transpilation.
- Include a small before/after SQL-shape diagram or table.
What this means for agents
- Summarize why this is more than saving markdown: agents can inspect, query, validate, edit, and review the same semantic files.
- Link to Writing Context and ktx sl.

Scannability Rules

The implementation should shorten long prose blocks across the touched pages.

Keep most text blocks to one or two paragraphs.
Prefer bullets, tables, diagrams, and compact callout blocks between prose.
Avoid four-paragraph narrative runs.
Use diagrams before dense explanations when the concept is spatial.
Keep examples concrete and copy-pasteable.

Visual Direction

Use the existing docs-site MDX style rather than a new design system. The current the-context-layer.mdx page already uses custom not-prose MDX diagrams with Fumadocs color tokens; the new page should follow that pattern.

The diagrams should feel like technical product documentation:

restrained, dense, and readable;
high contrast for the semantic-layer engine box;
visible arrows or adjacency that make flow obvious;
tables for classification and comparison;
no marketing hero, decorative gradients, or generic card-heavy layout.

Non-goals

Do not redesign the whole docs site.
Do not rename KTX concepts, packages, commands, or directories.
Do not claim KTX replaces every BI or semantic-layer system.
Do not add implementation details that are not true in the current codebase.
Do not expand the page into a long reference for every YAML field; keep that in Writing Context.

Verification

Because this is docs-only work, verification should focus on the docs site:

Run the docs build or the narrowest available docs-site type/build check.
Run formatting or lint checks if the docs package exposes them.
Preview the page locally and inspect desktop and mobile widths.
Confirm the page is listed in Concepts navigation.
Confirm the opening section clearly says KTX is a context layer, not just a semantic-layer tool.

If implementation changes only MDX and metadata, TypeScript workspace tests are not required unless the page introduces shared components.

Acceptance Criteria

A standalone Concepts page explains the semantic-layer internals.
The Context Layer page links to the new internals page without making the overview longer.
The new page includes diagrams for the system fit, join graph, maintenance loop, and fan-out-safe execution path.
Long prose is broken into scannable sections with bullets, tables, and visual interruptions.
The positioning consistently says KTX is a context layer with a semantic execution core.
Docs-site verification passes or any skipped check is reported with a reason.

6.5 KiB Raw Permalink Blame History