ktx/packages/cli/test/context at c3d8cedb0bbeb8695eebf2c7e6a2dcd203efe5e4 - apunkt/ktx - bitfreedom.net: free all bits, everywhere

apunkt/ktx

mirror of https://github.com/Kaelio/ktx.git synced 2026-07-25 12:01:03 +02:00

Andrey Avtomonov c3d8cedb0b feat(cli): add ingest LLM rate-limit governor with paced retries (#261 ) * feat(cli): add ingest rate limit governor * feat(cli): wire ingest rate-limit config * feat(cli): report provider rate-limit signals * feat(cli): show ingest rate-limit waits * fix(cli): complete rate-limit event coverage * fix(cli): abort ingest provider calls cleanly * fix(cli): propagate ingest cancellation * fix(cli): reject pre-aborted ingest rate-limit waits * fix(cli): honor Claude rate-limit reset waits * fix(cli): retry thrown Codex rate-limit failures * fix(cli): type Claude rate-limit result details * fix(cli): emit ingest rate-limit countdowns from rejected signals * fix(cli): report ai sdk rate-limit header utilization * fix(cli): gate LLM rate-limit retries on the governor budget The AI SDK and Codex runtimes retried 429 / opaque rate-limit failures up to 6-7 times with no backoff when constructed without a RateLimitGovernor (scan, memory, setup) or with pacing disabled, ignoring Retry-After and worsening the limit. The outer retry loop only cooperates with the governor's pause, so without active pacing there is no backoff to apply. Route the retry bound through a single source: RateLimitGovernor .maxRetryAttempts(), which returns retry.maxAttempts when enabled and 1 (no outer retry) when absent or disabled. All three runtimes (ai-sdk, codex, claude-code) now use it, so ingest.rateLimit.retry.maxAttempts genuinely controls attempts and the hard-coded 6 (plus Codex's off-by-one extra attempt) is gone. Backend-native retry (e.g. the AI SDK's maxRetries) still handles transient 429s. Also correct the ktx.yaml docs for maxWaitMs (caps each wait, not the whole run) and maxAttempts, and sync uv.lock ktx-sl/ktx-daemon to 0.9.0.		2026-06-05 12:10:27 +02:00
..
connections	test: split cli tests from source tree (#216 )	2026-05-26 08:49:05 +02:00
core	feat(cli): add ingest LLM rate-limit governor with paced retries (#261 )	2026-06-05 12:10:27 +02:00
daemon	test: split cli tests from source tree (#216 )	2026-05-26 08:49:05 +02:00
index-sync	test: split cli tests from source tree (#216 )	2026-05-26 08:49:05 +02:00
ingest	feat(cli): add ingest LLM rate-limit governor with paced retries (#261 )	2026-06-05 12:10:27 +02:00
llm	feat(cli): add ingest LLM rate-limit governor with paced retries (#261 )	2026-06-05 12:10:27 +02:00
mcp	feat: report MCP client telemetry (#242 )	2026-05-30 18:00:25 +02:00
memory	test: split cli tests from source tree (#216 )	2026-05-26 08:49:05 +02:00
project	feat(cli): add ingest LLM rate-limit governor with paced retries (#261 )	2026-06-05 12:10:27 +02:00
prompts	test: split cli tests from source tree (#216 )	2026-05-26 08:49:05 +02:00
scan	test: split cli tests from source tree (#216 )	2026-05-26 08:49:05 +02:00
search	test: split cli tests from source tree (#216 )	2026-05-26 08:49:05 +02:00
skills	test: split cli tests from source tree (#216 )	2026-05-26 08:49:05 +02:00
sl	feat(cli): shell completion for commands, flags, and entity names (#244 )	2026-05-31 23:44:33 +02:00
sql-analysis	feat(query-history): scope mining to modeled schemas by default (#258 )	2026-06-03 17:19:42 +02:00
test	test: split cli tests from source tree (#216 )	2026-05-26 08:49:05 +02:00
tools	test: split cli tests from source tree (#216 )	2026-05-26 08:49:05 +02:00
wiki	feat(cli): shell completion for commands, flags, and entity names (#244 )	2026-05-31 23:44:33 +02:00