ktx is the context layer for analytics agents https://docs.kaelio.com/ktx
Find a file
Andrey Avtomonov da108e556c
Merge pull request #22 from Kaelio/andreybavt/fix-metabase-readiness
fix(cli): report metabase ingest readiness
2026-05-12 13:01:19 +02:00
.github/workflows ci: parallelize KTX CI checks 2026-05-12 01:44:15 +02:00
assets docs: replace README logo with KTX lockup 2026-05-12 12:56:46 +02:00
docs/superpowers Merge pull request #26 from Kaelio/luca-martial/demo-data-wizard-ux 2026-05-12 01:06:45 -04:00
docs-site docs: standardize env variable examples 2026-05-12 12:24:25 +02:00
examples fix: standardize KTX environment variables 2026-05-12 11:21:37 +02:00
packages Merge pull request #22 from Kaelio/andreybavt/fix-metabase-readiness 2026-05-12 13:01:19 +02:00
python Fix historic SQL ingest setup and progress 2026-05-11 22:35:07 +02:00
scripts Merge pull request #31 from Kaelio/andreybavt/pasted-text-attachment 2026-05-12 12:26:33 +02:00
website feat(docs): add Fumadocs site workspace 2026-05-11 01:08:31 -07:00
.gitignore feat(llm): add local AI SDK DevTools tracing 2026-05-12 11:13:14 +02:00
AGENTS.md rename klo to ktx 2026-05-10 23:51:24 +02:00
CLAUDE.md Initial open-source release 2026-05-10 23:12:26 +02:00
conductor.json [codex] Add Conductor workspace scripts (#2) 2026-05-11 09:55:42 +02:00
GEMINI.md Initial open-source release 2026-05-10 23:12:26 +02:00
LICENSE Initial open-source release 2026-05-10 23:12:26 +02:00
package.json feat: npm-managed Python runtime for @kaelio/ktx (#7) 2026-05-11 15:50:34 +02:00
pnpm-lock.yaml feat(llm): add local AI SDK DevTools tracing 2026-05-12 11:13:14 +02:00
pnpm-workspace.yaml chore: move docs site workspace 2026-05-11 16:53:42 +02:00
pyproject.toml rename klo to ktx 2026-05-10 23:51:24 +02:00
README.md Merge pull request #22 from Kaelio/andreybavt/fix-metabase-readiness 2026-05-12 13:01:19 +02:00
release-policy.json feat: npm-managed Python runtime for @kaelio/ktx (#7) 2026-05-11 15:50:34 +02:00
tsconfig.base.json Initial open-source release 2026-05-10 23:12:26 +02:00
uv.lock feat: npm-managed Python runtime for @kaelio/ktx (#7) 2026-05-11 15:50:34 +02:00

KTX

The context layer for analytics agents

npm version License GitHub stars


KTX turns warehouse metadata, semantic definitions, and business knowledge into reviewable project files that agents can use while planning, querying, and updating analytics work.

A KTX project is a directory of plain files — YAML semantic sources, Markdown knowledge pages, and SQLite state — that you commit to git and review in PRs, just like dbt models.

Who KTX is for

KTX is built for analytics engineers and data teams who want data agents to work on real analytics systems — not just generate one-off SQL.

Use KTX when you want agents to:

  • Generate SQL from approved measures and joins
  • Repair semantic definitions through reviewable diffs
  • Explain metric provenance with warehouse evidence
  • Work alongside dbt, LookML, MetricFlow, Looker, Metabase, and modern BI platforms

Works with PostgreSQL, Snowflake, BigQuery, ClickHouse, MySQL, SQL Server, and SQLite.

Quick start

Install the CLI and run the setup wizard:

npm install @kaelio/ktx
npm install -g @kaelio/ktx
ktx setup

The wizard walks through six steps: configuring your LLM provider, setting up embeddings, connecting your database, adding context sources (dbt, LookML, Metabase, Looker, Notion), building context, and installing agent integration.

If it exits before completion, rerun ktx setup to resume where you left off.

Check your project status:

ktx status
KTX project: /home/user/analytics
Project ready: yes
LLM ready: yes (claude-sonnet-4-6)
Embeddings ready: yes (text-embedding-3-small)
Primary sources configured: yes (postgres-warehouse)
Context sources configured: yes (dbt-main)
KTX context built: yes
Agent integration ready: yes (claude-code:project)

Run the packaged demo without installing globally:

npx @kaelio/ktx setup demo --no-input
npx @kaelio/ktx setup demo inspect

The default demo uses packaged sample data and prebuilt context. It does not require API keys, network access, or an LLM provider.

Generate SQL from a semantic-layer source:

npx @kaelio/ktx sl query --project-dir "$PROJECT_DIR" \
  --connection-id warehouse \
  --measure accounts.account_count \
  --dimension accounts.segment \
  --format sql

List and test a configured warehouse connection:

ktx connection list --project-dir "$PROJECT_DIR"
ktx connection test warehouse --project-dir "$PROJECT_DIR"

The connection test prints the configured driver and discovered table count:

Driver: sqlite
Tables: 1

What's in a project

my-project/
├── ktx.yaml                     # Project configuration
├── semantic-layer/
│   └── warehouse/
│       ├── orders.yaml           # Semantic source definitions
│       ├── customers.yaml
│       └── order_items.yaml
├── knowledge/
│   ├── global/
│   │   ├── revenue.md            # Business definitions and rules
│   │   └── segment-classification.md
│   └── user/
│       └── local/
├── raw-sources/
│   └── warehouse/
│       └── live-database/        # Scan artifacts and reports
└── .ktx/
    └── db.sqlite                 # Local state (git-ignored)

Semantic sources and knowledge pages are committed to git. The .ktx/ directory holds ephemeral state and is git-ignored — delete it and KTX rebuilds on the next run.

Scan the demo warehouse

Scan artifacts are written under raw-sources/warehouse/live-database/<syncId>/ in the project directory.

SCAN_OUTPUT="$(ktx scan warehouse --project-dir "$PROJECT_DIR")"
printf '%s\n' "$SCAN_OUTPUT"
SCAN_RUN_ID="$(printf '%s\n' "$SCAN_OUTPUT" | awk '/^Run: / { print $2 }')"
ktx scan status --project-dir "$PROJECT_DIR" "$SCAN_RUN_ID"
ktx scan report --project-dir "$PROJECT_DIR" "$SCAN_RUN_ID"

For non-SQLite drivers, prefer credential references such as --url env:NAME or --url file:PATH over literal credential URLs.

Managed Python runtime

KTX installs its Python runtime only when a Python-backed command needs it. The runtime lives outside the npm cache, is versioned by the installed CLI version, and is managed by ktx runtime commands.

KTX requires uv on PATH to create the managed runtime. Install uv with your system package manager or the official installer before running Python- backed KTX commands. KTX doesn't download uv automatically; run ktx runtime doctor if runtime installation fails:

ktx runtime install --yes
ktx runtime status
ktx runtime doctor
ktx runtime start
ktx runtime stop
ktx runtime prune --dry-run
ktx runtime prune --yes

The release artifact manifest contains the public npm tarball and the bundled kaelio-ktx runtime wheel. The python/ktx-sl and python/ktx-daemon directories remain source packages for development, not public release artifacts.

Serve agents

KTX integrates with coding agents through CLI skills, an MCP server, or both. The setup wizard configures this automatically — here's what each mode looks like.

CLI skills — the agent calls ktx commands directly through a skill file installed in your agent's config (e.g., .claude/skills/ktx/SKILL.md):

ktx sl query --measure orders.revenue --dimension orders.status --format sql
ktx wiki search "revenue definition"
ktx sl validate orders

MCP server — the agent calls KTX tools over the Model Context Protocol:

ktx serve --mcp stdio \
  --user-id local \
  --semantic-compute \
  --execute-queries \
  --yes

This exposes tools for connections, knowledge search, semantic-layer sources, validation, queries, ingestion, and replay. The --semantic-compute flag starts the managed Python runtime for query planning automatically.

The standalone MCP server exposes connection_list, knowledge_search, knowledge_read, knowledge_write, sl_list_sources, sl_read_source, sl_write_source, sl_validate, sl_query, ingest_trigger, ingest_status, ingest_report, and ingest_replay.

Supported agents: Claude Code, Codex, Cursor, OpenCode, and any agent that reads .agents/ skills or MCP configuration.

Workspace packages

Package Purpose
packages/cli CLI entry point
packages/context Core context engine
packages/llm LLM and embedding providers
packages/connector-bigquery BigQuery scan connector
packages/connector-clickhouse ClickHouse scan connector
packages/connector-mysql MySQL scan connector
packages/connector-postgres Postgres scan connector
packages/connector-snowflake Snowflake scan connector
packages/connector-sqlite SQLite scan connector
packages/connector-sqlserver SQL Server scan connector
python/ktx-sl Semantic-layer query planning
python/ktx-daemon Portable compute service

Development

git clone https://github.com/kaelio/ktx.git
cd ktx
pnpm install
uv sync --all-groups
pnpm run build
pnpm run check

Use the development CLI for local testing:

pnpm run setup:dev
pnpm run link:dev
ktx-dev --help

Debug LLM traces

KTX can capture local AI SDK DevTools traces for LLM calls that run through the KTX provider. Enable it with an environment flag when running an LLM-backed command:

KTX_AI_DEVTOOLS_ENABLED=true ktx dev ingest run \
  --connection-id warehouse \
  --adapter metabase

Traces are written to .devtools/generations.json under the current working directory. To inspect them, run:

pnpm dlx @ai-sdk/devtools

Then open http://localhost:4983. These traces are local-development-only and store prompts, model outputs, tool arguments/results, and raw provider payloads in plain text. Do not enable this in production or for sensitive runs.

The repository uses pnpm for TypeScript packages and uv for Python packages. See Contributing for full development setup, testing, and PR guidelines.

License

KTX is licensed under the Apache License, Version 2.0. See LICENSE.