ktx/scripts
Andrey Avtomonov 494618ab14
feat: add codex llm backend for ktx runtime work (#253)
* feat: add codex sdk runner foundation

* feat: parse codex runtime events

* feat: expose codex runtime mcp tools

* feat: add codex llm runtime

* feat: wire codex llm backend

* test: avoid Array.fromAsync in codex runner test

* docs: document codex llm backend

* fix: tighten codex runtime config ownership

* fix: use codex sdk env and thread options

* fix: parse codex sdk event shapes

* test: add codex backend live smoke

* docs: clarify codex backend isolation

* fix: drive codex loop metrics from mcp events

* fix: enforce codex local step budget

* docs: disclose codex isolation limits

* fix: count all codex agent steps and stream step callbacks live

The agent-loop step budget only counted completed mcp_tool_call items, so
built-in command_execution steps (which the public Codex SDK/CLI surface can
still expose) never decremented the budget, letting ingest/reconciliation run
past stepBudget until Codex stopped on its own. onStepFinish was also replayed
only after the whole stream drained, so live work_unit_step / reconciliation
progress appeared stuck until the Codex process exited.

collectEvents is now the single live step accumulator: it counts every
completed agent-action item via a shared isCompletedAgentStep predicate
(command_execution, mcp_tool_call, file_change, web_search), fires onStepFinish
as each step completes, and enforces the budget on that broader count. A
no-tool turn still counts as one step. toolFailures stays MCP-specific, since a
non-zero command exit is normal agent exploration, not a loop failure.

* test: align ingest llm-guard assertions with codex backend

The skip-llm ingest guard message now lists codex as a valid backend and
mentions a Claude Code/Codex session plus a codex setup hint, but this slow
suite test still asserted the pre-codex wording. Update it to match the
production message (already covered by the local-bundle-runtime unit test) and
add the codex setup-line assertion.

* fix: treat codex error:null tool calls as success

The Codex SDK serializes error: null on successful mcp_tool_call items, so
the failure check (item.error !== undefined) flagged every successful tool
call as failed with the empty-payload default "Codex turn failed". This
killed every ingest work unit under the codex backend before it could
produce a patch.

Key on status === 'failed' (authoritative, always set) and only treat a
populated error object as a failure. Add a regression test built from a
verbatim real-SDK event capture.

* fix: default codex backend to gpt-5.5 and report real probe errors

The previous default gpt-5.3-codex is an API-key-only model that the OpenAI
API rejects under ChatGPT-account (subscription) auth, so codex status/setup
failed with a misleading "authentication is not usable" message even though
auth was fine.

- Default codex model is now gpt-5.5 (works on both subscription and API-key
  auth); the curated setup picker offers gpt-5.5 / gpt-5.4 / gpt-5.4-mini and
  keeps free-form entry for account-specific ids (e.g. gpt-5.3-codex-spark).
- runCodexAuthProbe now distinguishes "model not available" from an auth
  failure and surfaces the real API error: collectEvents retains stream
  events when the SDK throws on a non-zero exit, and the API error JSON
  envelope is unwrapped to its human-readable message.
- The Codex isolation warning now renders inside the clack setup frame.
- Docs updated to gpt-5.5 with a note that *-codex ids require API-key auth.

* fix: require llm.models.default in status and match codex probe remediation

Status reported a project ready when a non-none LLM backend was configured
without llm.models.default, but the runtime (resolveModelSlots) hard-requires
it, so ingest/scan/memory threw after `ktx status` said the project was usable.
buildLlmStatus now fails for any non-none backend missing models.default and no
longer invents a fallback model for claude-code/codex.

Codex probe failures now carry a category-matched fix: a model-access failure
steers the user at llm.models.default instead of the auth/install remediation.
runCodexAuthProbe returns the fix and status consumes it; the message stays
self-sufficient so setup output is unchanged.

Docs: README now lists the codex backend and local Codex auth; ktx-setup.mdx
states --llm-model only accepts codex/default or gpt-*/codex-* ids.

Repaired four doctor fixtures that configured a backend without models.default
(the now-correctly-blocked config) and added coverage for the new behavior.
2026-06-02 13:57:11 +02:00
..
acquire-public-benchmark-fixtures.mjs Initial open-source release 2026-05-10 23:12:26 +02:00
acquire-public-benchmark-fixtures.test.mjs rename klo to ktx 2026-05-10 23:51:24 +02:00
adventureworks-oltp-source.json Initial open-source release 2026-05-10 23:12:26 +02:00
adventureworks-oltp-source.test.mjs Initial open-source release 2026-05-10 23:12:26 +02:00
anti-fixture-conditional.test.mjs test: split cli tests from source tree (#216) 2026-05-26 08:49:05 +02:00
build-adventureworks-oltp-fixture.mjs chore(workspace): gate dead-code with knip production mode (#196) 2026-05-21 15:28:58 +02:00
build-benchmark-snapshot.mjs chore(workspace): gate dead-code with knip production mode (#196) 2026-05-21 15:28:58 +02:00
build-benchmark-snapshot.test.mjs test: split cli tests from source tree (#216) 2026-05-26 08:49:05 +02:00
build-evidence-fusion-adversarial-fixtures.mjs chore(workspace): gate dead-code with knip production mode (#196) 2026-05-21 15:28:58 +02:00
build-python-runtime-wheel.mjs fix: improve setup wizard behavior (#127) 2026-05-17 19:15:09 +02:00
build-python-runtime-wheel.test.mjs chore(workspace): gate dead-code with knip production mode (#196) 2026-05-21 15:28:58 +02:00
check-boundaries.mjs fix: update ktx CI boundary checks (#223) 2026-05-26 23:03:47 +02:00
check-boundaries.test.mjs fix: update ktx CI boundary checks (#223) 2026-05-26 23:03:47 +02:00
ci-artifact-upload.test.mjs ci: parallelize KTX CI checks 2026-05-12 01:44:15 +02:00
codex-backend-live-smoke.mjs feat: add codex llm backend for ktx runtime work (#253) 2026-06-02 13:57:11 +02:00
codex-backend-live-smoke.test.mjs feat: add codex llm backend for ktx runtime work (#253) 2026-06-02 13:57:11 +02:00
conductor-run.sh [codex] Add Conductor workspace scripts (#2) 2026-05-11 09:55:42 +02:00
conductor-scripts.test.mjs perf(setup): speed up conductor setup and make it rerun-safe (#107) 2026-05-15 12:06:37 +02:00
conductor-setup.sh chore: preserve superpowers docs symlink in worktrees (#158) 2026-05-20 00:11:38 +02:00
examples-docs.test.mjs feat(cli)!: remove fast mode; ktx ingest always builds enriched context (KLO-721) (#237) 2026-05-29 17:41:04 +02:00
installed-live-database-smoke.mjs feat(cli)!: remove fast mode; ktx ingest always builds enriched context (KLO-721) (#237) 2026-05-29 17:41:04 +02:00
installed-live-database-smoke.test.mjs feat(cli)!: remove fast mode; ktx ingest always builds enriched context (KLO-721) (#237) 2026-05-29 17:41:04 +02:00
ktx-reset.sh fix(snowflake): unblock multi-schema ingest and relationship discovery (#204) 2026-05-23 10:41:30 +02:00
link-dev-cli.mjs rename klo to ktx 2026-05-10 23:51:24 +02:00
link-dev-cli.test.mjs chore(workspace): gate dead-code with knip production mode (#196) 2026-05-21 15:28:58 +02:00
local-embeddings-runtime-smoke.mjs chore(workspace): gate dead-code with knip production mode (#196) 2026-05-21 15:28:58 +02:00
local-embeddings-runtime-smoke.test.mjs chore(workspace): gate dead-code with knip production mode (#196) 2026-05-21 15:28:58 +02:00
normalize-lcov-paths.mjs ci: configure Codecov coverage uploads (#150) 2026-05-19 16:56:48 +02:00
normalize-lcov-paths.test.mjs chore(workspace): gate dead-code with knip production mode (#196) 2026-05-21 15:28:58 +02:00
package-artifacts.mjs feat(cli): stream plain ktx ingest progress to stderr (KLO-726) (#251) 2026-06-01 23:31:31 +02:00
package-artifacts.test.mjs feat(cli): stream plain ktx ingest progress to stderr (KLO-726) (#251) 2026-06-01 23:31:31 +02:00
pglite-hybrid-search-spike.mjs chore(workspace): gate dead-code with knip production mode (#196) 2026-05-21 15:28:58 +02:00
pglite-owner-process-prototype.mjs chore(workspace): gate dead-code with knip production mode (#196) 2026-05-21 15:28:58 +02:00
pglite-sl-search-prototype.mjs chore(workspace): gate dead-code with knip production mode (#196) 2026-05-21 15:28:58 +02:00
prepare-cli-bin.mjs rename klo to ktx 2026-05-10 23:51:24 +02:00
public-benchmark-manifest.json ci: run pre-commit checks in CI (#74) 2026-05-13 19:49:25 +02:00
public-npm-release-metadata.mjs chore(workspace): gate dead-code with knip production mode (#196) 2026-05-21 15:28:58 +02:00
public-npm-release-metadata.test.mjs feat(release): one version everywhere via @semantic-release/git (#186) 2026-05-20 17:01:26 +02:00
published-package-smoke-config.mjs fix(cli): remove ktx setup subcommands (#42) 2026-05-13 00:38:26 +02:00
published-package-smoke.mjs fix(release): repair next npm release workflow (#122) 2026-05-17 01:41:07 +02:00
published-package-smoke.test.mjs chore(workspace): gate dead-code with knip production mode (#196) 2026-05-21 15:28:58 +02:00
relationship-benchmark-report.mjs test: split cli tests from source tree (#216) 2026-05-26 08:49:05 +02:00
relationship-orbit-verification.mjs feat: merge ingest and scan 2026-05-14 01:43:06 +02:00
relationship-orbit-verification.test.mjs feat: merge ingest and scan 2026-05-14 01:43:06 +02:00
release-readiness.mjs chore(workspace): gate dead-code with knip production mode (#196) 2026-05-21 15:28:58 +02:00
release-readiness.test.mjs chore(workspace): gate dead-code with knip production mode (#196) 2026-05-21 15:28:58 +02:00
release-workflow.test.mjs ci(release): restore RELEASE_PAT for branch push (#189) 2026-05-20 17:57:35 +02:00
run-ktx.mjs perf(cli): cache pnpm run ktx builds against a stamp file (#113) 2026-05-15 15:49:39 +02:00
run-ktx.test.mjs chore(workspace): gate dead-code with knip production mode (#196) 2026-05-21 15:28:58 +02:00
semantic-release-config.cjs chore: revert repo references to Kaelio/ktx and remove rename-resilience (#252) 2026-06-02 00:14:43 +02:00
semantic-release-config.test.mjs chore: revert repo references to Kaelio/ktx and remove rename-resilience (#252) 2026-06-02 00:14:43 +02:00
setup-dev.mjs fix(cli): build runtime assets during dev setup (#121) 2026-05-17 01:04:44 +02:00
setup-dev.test.mjs fix(cli): build runtime assets during dev setup (#121) 2026-05-17 01:04:44 +02:00
standalone-ci-workflow.test.mjs chore(workspace): gate dead-code with knip production mode (#196) 2026-05-21 15:28:58 +02:00
test-tiering.test.mjs test: split cli tests from source tree (#216) 2026-05-26 08:49:05 +02:00
update-public-release-version.mjs chore(workspace): gate dead-code with knip production mode (#196) 2026-05-21 15:28:58 +02:00
update-public-release-version.test.mjs chore(workspace): gate dead-code with knip production mode (#196) 2026-05-21 15:28:58 +02:00
upgrade-dependencies.mjs chore: upgrade dependencies and tooling (#232) 2026-05-29 11:56:55 +02:00
upgrade-dependencies.test.mjs chore: upgrade dependencies and tooling (#232) 2026-05-29 11:56:55 +02:00
validate-llm-debug-jsonl.mjs rename klo to ktx 2026-05-10 23:51:24 +02:00
validate-llm-debug-jsonl.test.mjs rename klo to ktx 2026-05-10 23:51:24 +02:00