ktx/packages/llm at revamp-cli-setup-docs - apunkt/ktx - bitfreedom.net: free all bits, everywhere

apunkt/ktx

mirror of https://github.com/Kaelio/ktx.git synced 2026-07-22 11:51:01 +02:00

Andrey Avtomonov 49f1e2720e fix(llm): wire prompt caching through all Anthropic call sites (#90 ) * fix(llm): wire prompt caching through all Anthropic call sites - page-triage classifier + light-extraction now put the static skill prompt in `system:` so the per-document caches hit instead of re-sending boilerplate in the user message every call. - Description generation builders return `{ system, user }` with instruction text + word limit moved into the cacheable system. - Relationship-LLM proposal framing moved to `system:`. - `KtxMessageBuilder.wrapSimple` skips the history breakpoint for single-message calls (cache write that could never be reused). - Gateway backend now sets `anthropic-beta: extended-cache-ttl-2025-04-11` so 1h TTLs don't silently downgrade to 5m on Gateway routes. * fix(llm): keep wrapSimple history breakpoint so multi-step agent loops cache Reverts the wrapSimple `messages.length > 1` guard from the prior commit. agent-runner uses wrapSimple with a single user message, but generateText runs a multi-step tool loop inside it — the cache marker on the first user message is reused by every subsequent step, so it isn't waste. The release validator (scripts/validate-llm-debug-jsonl.mjs) also requires a `message-part` marker target in captured debug JSONL.		2026-05-14 15:36:27 +02:00
..
src	fix(llm): wire prompt caching through all Anthropic call sites (#90 )	2026-05-14 15:36:27 +02:00
package.json	ci: add codecov coverage reporting (#82 )	2026-05-14 01:13:31 +02:00
tsconfig.json	Initial open-source release	2026-05-10 23:12:26 +02:00
vitest.config.ts	Initial open-source release	2026-05-10 23:12:26 +02:00