ktx is the context layer for analytics agents https://docs.kaelio.com/ktx
Find a file
2026-05-10 23:12:26 +02:00
.github/workflows Initial open-source release 2026-05-10 23:12:26 +02:00
examples Initial open-source release 2026-05-10 23:12:26 +02:00
packages Initial open-source release 2026-05-10 23:12:26 +02:00
python Initial open-source release 2026-05-10 23:12:26 +02:00
scripts Initial open-source release 2026-05-10 23:12:26 +02:00
.gitignore Initial open-source release 2026-05-10 23:12:26 +02:00
AGENTS.md Initial open-source release 2026-05-10 23:12:26 +02:00
CLAUDE.md Initial open-source release 2026-05-10 23:12:26 +02:00
GEMINI.md Initial open-source release 2026-05-10 23:12:26 +02:00
LICENSE Initial open-source release 2026-05-10 23:12:26 +02:00
package.json Initial open-source release 2026-05-10 23:12:26 +02:00
pnpm-lock.yaml Initial open-source release 2026-05-10 23:12:26 +02:00
pnpm-workspace.yaml Initial open-source release 2026-05-10 23:12:26 +02:00
pyproject.toml Initial open-source release 2026-05-10 23:12:26 +02:00
README.md Initial open-source release 2026-05-10 23:12:26 +02:00
release-policy.json Initial open-source release 2026-05-10 23:12:26 +02:00
tsconfig.base.json Initial open-source release 2026-05-10 23:12:26 +02:00
uv.lock Initial open-source release 2026-05-10 23:12:26 +02:00

KLO

KLO is a workspace-first context layer for database agents. It stores warehouse memory in a project directory, generates and validates semantic-layer YAML, indexes knowledge, scans database schemas, and exposes the result through a CLI and MCP server.

KLO projects are plain files: YAML, Markdown, SQLite state, and generated artifacts. You can inspect them, commit them, and serve them to any MCP client.

What KLO provides

  • Durable warehouse memory with semantic-layer sources and knowledge pages.
  • Native scan connectors for SQLite, Postgres, MySQL, ClickHouse, SQL Server, BigQuery, Snowflake, and PostHog.
  • Agentic ingest with provenance links, tool transcripts, and replay metadata.
  • Local semantic-layer query planning and optional query execution.
  • A stdio MCP server with tools for connections, knowledge, semantic-layer sources, ingest reports, and replay.

Quick start

Run the pre-seeded demo from the repository root:

pnpm install
pnpm run setup:dev
pnpm run klo -- setup demo --no-input
pnpm run klo -- setup demo inspect

The default demo uses packaged sample data and prebuilt context. It does not require API keys, network access, or an LLM provider.

To replay the packaged ingest run, use:

pnpm run klo -- setup demo --mode replay --no-input

To run the full agentic demo with an LLM provider, set a provider key for the current process:

ANTHROPIC_API_KEY=$YOUR_ANTHROPIC_API_KEY \
  pnpm run klo -- setup demo --mode full --no-input

Interactive full-demo setup can prompt for a provider key without writing the key to klo.yaml.

Build a local project

Create a project from the repository root:

uv sync --all-packages
source .venv/bin/activate

PROJECT_DIR="$(mktemp -d)/klo-demo"
pnpm run klo -- init "$PROJECT_DIR" --name klo-demo

Create a SQLite warehouse:

python - "$PROJECT_DIR/demo.db" <<'PY'
import sqlite3
import sys

conn = sqlite3.connect(sys.argv[1])
conn.executescript("""
DROP TABLE IF EXISTS accounts;
CREATE TABLE accounts (
  account_id INTEGER PRIMARY KEY,
  account_name TEXT NOT NULL,
  segment TEXT NOT NULL,
  region TEXT NOT NULL
);
INSERT INTO accounts VALUES
  (1, 'Acme Analytics', 'Mid-Market', 'NA'),
  (2, 'Beacon Bank', 'Enterprise', 'EMEA'),
  (3, 'Cobalt Coffee', 'SMB', 'NA'),
  (4, 'Delta Devices', 'Mid-Market', 'APAC'),
  (5, 'Evergreen Energy', 'Enterprise', 'NA');
""")
conn.close()
PY

Replace the generated klo.yaml:

cat > "$PROJECT_DIR/klo.yaml" <<YAML
project: klo-demo
connections:
  warehouse:
    driver: sqlite
    path: $PROJECT_DIR/demo.db
    readonly: true
storage:
  state: sqlite
  search: sqlite-fts5
  git:
    auto_commit: true
    author: "klo <klo@example.com>"
memory:
  auto_commit: true
YAML

Write and validate a semantic-layer source:

pnpm run klo -- sl write accounts --project-dir "$PROJECT_DIR" \
  --connection-id warehouse --yaml 'name: accounts
table: accounts
description: CRM accounts with segmentation attributes.
grain:
  - account_id
columns:
  - name: account_id
    type: number
  - name: account_name
    type: string
  - name: segment
    type: string
  - name: region
    type: string
measures:
  - name: account_count
    expr: count(account_id)
joins: []
'

pnpm run klo -- sl validate accounts --project-dir "$PROJECT_DIR" \
  --connection-id warehouse

Generate SQL and execute the query:

pnpm run klo -- sl query --project-dir "$PROJECT_DIR" \
  --connection-id warehouse \
  --measure accounts.account_count \
  --dimension accounts.segment \
  --order-by accounts.account_count:desc \
  --limit 5 \
  --format sql

pnpm run klo -- sl query --project-dir "$PROJECT_DIR" \
  --connection-id warehouse \
  --measure accounts.account_count \
  --dimension accounts.segment \
  --order-by accounts.account_count:desc \
  --limit 5 \
  --execute \
  --max-rows 5

List and test the warehouse connection:

pnpm run klo -- connection list --project-dir "$PROJECT_DIR"
pnpm run klo -- connection test warehouse --project-dir "$PROJECT_DIR"

The connection test prints the configured driver and discovered table count:

Driver: sqlite
Tables: 1

Scan the demo warehouse

Scan artifacts are written under raw-sources/warehouse/live-database/<syncId>/ in the project directory.


SCAN_OUTPUT="$(pnpm run klo -- scan warehouse --project-dir "$PROJECT_DIR")"
printf '%s\n' "$SCAN_OUTPUT"
SCAN_RUN_ID="$(printf '%s\n' "$SCAN_OUTPUT" | awk '/^Run: / { print $2 }')"
pnpm run klo -- scan status --project-dir "$PROJECT_DIR" "$SCAN_RUN_ID"
pnpm run klo -- scan report --project-dir "$PROJECT_DIR" "$SCAN_RUN_ID"

For non-SQLite drivers, prefer credential references such as --url env:NAME or --url file:PATH over literal credential URLs.

Serve MCP

Start the Python compute daemon in one terminal:

source .venv/bin/activate
uv run klo-daemon serve-http --host 127.0.0.1 --port 8765

Start the stdio MCP server in another terminal:

pnpm run klo -- serve --mcp stdio --project-dir "$PROJECT_DIR" \
  --user-id local \
  --semantic-compute-url http://127.0.0.1:8765 \
  --execute-queries

The MCP server exposes connection_list, knowledge_search, knowledge_read, knowledge_write, sl_list_sources, sl_read_source, sl_write_source, sl_validate, sl_query, ingest_trigger, ingest_status, ingest_report, and ingest_replay.

Workspace packages

  • packages/context: core TypeScript context library.
  • packages/cli: CLI wrapper over the context package.
  • packages/llm: LLM and embedding provider helpers.
  • packages/connector-bigquery: BigQuery scan connector.
  • packages/connector-clickhouse: ClickHouse scan connector.
  • packages/connector-mysql: MySQL scan connector.
  • packages/connector-postgres: Postgres scan connector.
  • packages/connector-posthog: PostHog scan connector.
  • packages/connector-snowflake: Snowflake scan connector.
  • packages/connector-sqlite: SQLite scan connector.
  • packages/connector-sqlserver: SQL Server scan connector.
  • python/klo-sl: semantic-layer engine.
  • python/klo-daemon: portable compute service for semantic-layer operations.

Development

Install dependencies and run checks:

pnpm install
pnpm run check
uv sync --all-packages
source .venv/bin/activate
uv run pytest

Use the optional development binary when you want a local klo-dev command:

pnpm run link:dev
klo-dev --help

The repository uses pnpm for TypeScript packages and uv for Python packages.

Release status

This repository is prepared for source publication. Package publishing is still disabled by release-policy.json; registry names, public versions, package visibility, and provenance policy must be chosen before publishing artifacts to npm or Python package indexes.

Build local package artifacts with:

source .venv/bin/activate
pnpm run artifacts:check
pnpm run release:readiness

License

KLO is licensed under the Apache License, Version 2.0. See LICENSE.