ktx/docs-site/content/docs/guides/llm-configuration.mdx

---
title: LLM configuration
description: Configure KTX LLM providers, model roles, and prompt caching.
---

KTX uses the top-level `llm` block in `ktx.yaml` for text generation,
structured extraction, and ingest or memory agent loops.

## Backends

Set `llm.provider.backend` to one of these values:

- `anthropic`: Use the Anthropic API through `ANTHROPIC_API_KEY` or the
  configured `api_key` reference.
- `vertex`: Use Vertex AI Anthropic models through Google Cloud credentials.
- `gateway`: Use AI Gateway-compatible Anthropic model ids.
- `claude-code`: Use your local Claude Code session through the Claude Agent
  SDK. KTX removes provider-routing environment variables from Claude Code
  child processes, so this backend doesn't silently fall back to
  `ANTHROPIC_API_KEY`, Vertex, Gateway, or Bedrock credentials.

## Claude Code

Use aliases or full Claude model IDs in `llm.models`:

```yaml
llm:
  provider:
    backend: claude-code
  models:
    default: sonnet
    triage: haiku
    candidateExtraction: sonnet
    curator: sonnet
    reconcile: sonnet
    repair: sonnet
```

During setup, choose the Claude Code backend interactively or pass the model in
automation:

```bash
ktx setup --llm-backend claude-code --llm-model opus --no-input
```

For Claude Code, `sonnet`, `opus`, and `haiku` map to the current KTX defaults.
You can also pass a full Claude model ID, such as `claude-opus-4-7`.

`claude-code` keeps KTX tool boundaries intact. KTX exposes only the MCP tools
needed for the current KTX agent loop, disables Claude Code built-in tools,
keeps plugins empty, and denies every non-KTX tool request through
`canUseTool`. The Claude Agent SDK may still report host-discovered slash
commands, skills, and subagent names in init metadata; that metadata is not an
execution grant for KTX agent loops.

## Prompt caching

`llm.promptCaching` has partial parity on `claude-code`. KTX doesn't pass
Anthropic cache-control markers to the Claude Agent SDK. Status and doctor warn
when you configure prompt-cache TTL, tool, or history fields that the Claude
Agent SDK backend ignores.
feat: add claude-code llm backend with runtime port (#115) * docs: revise claude-code ingest backend spec * docs: keep claude-code spec focused on ingest * docs: expand claude-code spec to full llm parity * Refine claude-code backend spec after adversarial review iteration 1 * Refine claude-code backend spec after adversarial review iteration 2 * Refine claude-code backend spec after adversarial review iteration 3 * feat: recognize claude-code llm backend * feat: add ktx llm runtime port * feat: add claude-code llm runtime * feat: route non-agent llm calls through runtime * feat: run ingest agents through llm runtime * feat: support claude-code setup and status * test: verify claude-code backend runtime * docs: add claude-code backend v1 runtime plan * fix: close claude-code runtime isolation checks * fix: warn on claude-code prompt caching during setup * chore: verify claude-code v1 closure * docs: add claude-code backend v1 isolation closure plan * fix: update claude-code ingest setup guidance * docs: add claude-code backend v1 ingest guidance closure plan * docs: align claude-code isolation spec with sdk metadata * test: cover claude-code host discovery metadata * fix: tolerate claude-code host discovery metadata * docs: clarify claude-code host discovery metadata * docs: add claude-code auth-probe isolation fix plan * chore: prepare kaelio ktx rc1 release * chore: add semantic release workflow * fix: unblock ci checks * chore(release): 0.1.0-rc.1 * feat: add Claude Code model selection to setup * fix: keep git maintenance attached in local repos 2026-05-16 12:06:34 +02:00			`---`
			`title: LLM configuration`
			`description: Configure KTX LLM providers, model roles, and prompt caching.`
			`---`

			KTX uses the top-level `llm` block in `ktx.yaml` for text generation,
			`structured extraction, and ingest or memory agent loops.`

			`## Backends`

			Set `llm.provider.backend` to one of these values:

			- `anthropic`: Use the Anthropic API through `ANTHROPIC_API_KEY` or the
			configured `api_key` reference.
			- `vertex`: Use Vertex AI Anthropic models through Google Cloud credentials.
			- `gateway`: Use AI Gateway-compatible Anthropic model ids.
			- `claude-code`: Use your local Claude Code session through the Claude Agent
			`SDK. KTX removes provider-routing environment variables from Claude Code`
			`child processes, so this backend doesn't silently fall back to`
			`ANTHROPIC_API_KEY`, Vertex, Gateway, or Bedrock credentials.

			`## Claude Code`

			Use aliases or full Claude model IDs in `llm.models`:

			```yaml
			`llm:`
			`provider:`
			`backend: claude-code`
			`models:`
			`default: sonnet`
			`triage: haiku`
			`candidateExtraction: sonnet`
			`curator: sonnet`
			`reconcile: sonnet`
			`repair: sonnet`
			```

			`During setup, choose the Claude Code backend interactively or pass the model in`
			`automation:`

			```bash
			`ktx setup --llm-backend claude-code --llm-model opus --no-input`
			```

			For Claude Code, `sonnet`, `opus`, and `haiku` map to the current KTX defaults.
			You can also pass a full Claude model ID, such as `claude-opus-4-7`.

			`claude-code` keeps KTX tool boundaries intact. KTX exposes only the MCP tools
			`needed for the current KTX agent loop, disables Claude Code built-in tools,`
			`keeps plugins empty, and denies every non-KTX tool request through`
			`canUseTool`. The Claude Agent SDK may still report host-discovered slash
			`commands, skills, and subagent names in init metadata; that metadata is not an`
			`execution grant for KTX agent loops.`

			`## Prompt caching`

			`llm.promptCaching` has partial parity on `claude-code`. KTX doesn't pass
			`Anthropic cache-control markers to the Claude Agent SDK. Status and doctor warn`
			`when you configure prompt-cache TTL, tool, or history fields that the Claude`
			`Agent SDK backend ignores.`