plano

mirror of https://github.com/katanemo/plano.git synced 2026-04-25 00:36:34 +02:00

Author	SHA1	Message	Date
Adil Hafeez	42b7927122	feat: head+tail trim with ellipsis and 16-turn cap for routing prompt Replaces the previous head-only truncation of oversized user messages with a middle-trim (head + ellipsis + tail) that preserves both the task framing (start of message) and the actual ask (end of message) — a common shape for long pasted content like code dumps or specs. The unicode ellipsis also signals to the router model that content was dropped, which can improve classification accuracy on truncated prompts. Also adds an outer guardrail: only the last `MAX_ROUTING_TURNS` (16) filtered messages are considered when building the routing prompt. This bounds prompt growth for long conversations before the token-budget loop runs, matching the approach HuggingFace chat-ui takes in its arch-router client. Tests: - test_huge_single_user_message_is_middle_trimmed: regression test for the 500KB user message scenario. Verifies the prompt stays bounded, head + tail markers both survive, and the ellipsis is present. - test_turn_cap_limits_routing_history: builds a 32-turn conversation and verifies only the last 16 make it into the prompt. - test_trim_middle_utf8_helper: unit test for the helper covering the no-op path, the 60/40 split, the too-small-for-marker fallback, and UTF-8 boundary safety for multi-byte characters. - Updated test_conversation_trim_upto_user_message to reflect the new middle-trim behavior.	2026-04-17 19:18:30 -07:00
Adil Hafeez	c90b699c90	fix: surface real upstream error messages from orchestrator HTTP client `post_and_extract_content` was unconditionally deserializing the upstream response body as a `ChatCompletionsResponse`, which meant 4xx/5xx error bodies (OpenAI-style `{"error": {...}}` envelopes) failed with confusing messages like `missing field 'id' at line 1 column 391`. The real upstream message (e.g. "This model's maximum context length is 32768 tokens...") only appeared once as a warn log and then got buried in the generic "Failed to parse JSON response" path. Now we: - Check the HTTP status before attempting to parse the success body. - On non-2xx, extract a human-readable message from the OpenAI-style error envelope (or fall back to a UTF-8-safe truncated raw body). - Return a dedicated `HttpError::Upstream { status, message }` variant so callers can log / surface / retry based on the real status code. - Truncate raw bodies in warn logs to 512 bytes (UTF-8-safe) to avoid flooding logs with oversized JSON or HTML error pages.	2026-04-17 18:41:15 -07:00
Adil Hafeez	321c28da37	fix: truncate oversized user messages in orchestrator routing prompt The orchestrator trimmer had a bypass that kept the latest user message whole even when it alone exceeded the configured token budget. This caused brightstaff to send a ~500KB prompt to the Plano-Orchestrator model, which rejected it with a 400 "context length exceeded" from the upstream 32K-token window. Brightstaff then surfaced a confusing "missing field id" parse error instead of the real upstream message. Fix the bypass by trimming the overflowing user message from the end toward the beginning until it fits in the remaining token budget. The beginning of the message (where user intent usually lives) is preserved and the tail is dropped. Added a UTF-8-safe byte-truncation helper and a regression test that mirrors the production payload (a single ~500KB user message with a small budget).	2026-04-17 18:00:02 -07:00
Adil Hafeez	37600fd07a	fix: passthrough_auth accepts Anthropic x-api-key and normalizes to upstream format (#892 ) Some checks are pending CI / pre-commit (push) Waiting to run Details CI / plano-tools-tests (push) Waiting to run Details CI / native-smoke-test (push) Waiting to run Details CI / docker-build (push) Waiting to run Details CI / validate-config (push) Waiting to run Details CI / security-scan (push) Blocked by required conditions Details CI / test-prompt-gateway (push) Blocked by required conditions Details CI / test-model-alias-routing (push) Blocked by required conditions Details CI / test-responses-api-with-state (push) Blocked by required conditions Details CI / e2e-plano-tests (3.10) (push) Blocked by required conditions Details CI / e2e-plano-tests (3.11) (push) Blocked by required conditions Details CI / e2e-plano-tests (3.12) (push) Blocked by required conditions Details CI / e2e-plano-tests (3.13) (push) Blocked by required conditions Details CI / e2e-plano-tests (3.14) (push) Blocked by required conditions Details CI / e2e-demo-preference (push) Blocked by required conditions Details CI / e2e-demo-currency (push) Blocked by required conditions Details Publish docker image (latest) / build-arm64 (push) Waiting to run Details Publish docker image (latest) / build-amd64 (push) Waiting to run Details Publish docker image (latest) / create-manifest (push) Blocked by required conditions Details Build and Deploy Documentation / build (push) Waiting to run Details	2026-04-17 17:23:05 -07:00
Adil Hafeez	0f67b2c806	planoai obs: live LLM observability TUI (#891 )	2026-04-17 14:03:47 -07:00
Adil Hafeez	1f701258cb	Zero-config planoai up: pass-through proxy with auto-detected providers (#890 )	2026-04-17 13:11:12 -07:00
Adil Hafeez	711e4dd07d	Add DigitalOcean as a first-class LLM provider (#889 )	2026-04-17 12:25:34 -07:00
Musa	743d074184	add Plano agent skills framework and rule set (#797 ) Some checks are pending CI / pre-commit (push) Waiting to run Details CI / plano-tools-tests (push) Waiting to run Details CI / native-smoke-test (push) Waiting to run Details CI / docker-build (push) Waiting to run Details CI / validate-config (push) Waiting to run Details CI / security-scan (push) Blocked by required conditions Details CI / test-prompt-gateway (push) Blocked by required conditions Details CI / test-model-alias-routing (push) Blocked by required conditions Details CI / test-responses-api-with-state (push) Blocked by required conditions Details CI / e2e-plano-tests (3.10) (push) Blocked by required conditions Details CI / e2e-plano-tests (3.11) (push) Blocked by required conditions Details CI / e2e-plano-tests (3.12) (push) Blocked by required conditions Details CI / e2e-plano-tests (3.13) (push) Blocked by required conditions Details CI / e2e-plano-tests (3.14) (push) Blocked by required conditions Details CI / e2e-demo-preference (push) Blocked by required conditions Details CI / e2e-demo-currency (push) Blocked by required conditions Details Publish docker image (latest) / build-arm64 (push) Waiting to run Details Publish docker image (latest) / build-amd64 (push) Waiting to run Details Publish docker image (latest) / create-manifest (push) Blocked by required conditions Details Build and Deploy Documentation / build (push) Waiting to run Details * feat: add initial documentation for Plano Agent Skills * feat: readme with examples * feat: add detailed skills documentation and examples for Plano --------- Co-authored-by: Adil Hafeez <adil.hafeez@gmail.com>	2026-04-16 13:16:51 -07:00
Adil Hafeez	d39d7ddd1c	release 0.4.19 (#887 ) Some checks are pending CI / pre-commit (push) Waiting to run Details CI / plano-tools-tests (push) Waiting to run Details CI / native-smoke-test (push) Waiting to run Details CI / docker-build (push) Waiting to run Details CI / validate-config (push) Waiting to run Details CI / security-scan (push) Blocked by required conditions Details CI / test-prompt-gateway (push) Blocked by required conditions Details CI / test-model-alias-routing (push) Blocked by required conditions Details CI / test-responses-api-with-state (push) Blocked by required conditions Details CI / e2e-plano-tests (3.10) (push) Blocked by required conditions Details CI / e2e-plano-tests (3.11) (push) Blocked by required conditions Details CI / e2e-plano-tests (3.12) (push) Blocked by required conditions Details CI / e2e-plano-tests (3.13) (push) Blocked by required conditions Details CI / e2e-plano-tests (3.14) (push) Blocked by required conditions Details CI / e2e-demo-preference (push) Blocked by required conditions Details CI / e2e-demo-currency (push) Blocked by required conditions Details Publish docker image (latest) / build-arm64 (push) Waiting to run Details Publish docker image (latest) / build-amd64 (push) Waiting to run Details Publish docker image (latest) / create-manifest (push) Blocked by required conditions Details Build and Deploy Documentation / build (push) Waiting to run Details	2026-04-15 16:49:50 -07:00
Adil Hafeez	90b926c2ce	use plano-orchestrator for LLM routing, remove arch-router (#886 )	2026-04-15 16:41:42 -07:00
Musa	980faef6be	Redis-backed session cache for cross-replica model affinity (#879 ) Some checks failed CI / pre-commit (push) Has been cancelled Details CI / plano-tools-tests (push) Has been cancelled Details CI / native-smoke-test (push) Has been cancelled Details CI / docker-build (push) Has been cancelled Details CI / validate-config (push) Has been cancelled Details Publish docker image (latest) / build-arm64 (push) Has been cancelled Details Publish docker image (latest) / build-amd64 (push) Has been cancelled Details Build and Deploy Documentation / build (push) Has been cancelled Details CI / security-scan (push) Has been cancelled Details CI / test-prompt-gateway (push) Has been cancelled Details CI / test-model-alias-routing (push) Has been cancelled Details CI / test-responses-api-with-state (push) Has been cancelled Details CI / e2e-plano-tests (3.10) (push) Has been cancelled Details CI / e2e-plano-tests (3.11) (push) Has been cancelled Details CI / e2e-plano-tests (3.12) (push) Has been cancelled Details CI / e2e-plano-tests (3.13) (push) Has been cancelled Details CI / e2e-plano-tests (3.14) (push) Has been cancelled Details CI / e2e-demo-preference (push) Has been cancelled Details CI / e2e-demo-currency (push) Has been cancelled Details Publish docker image (latest) / create-manifest (push) Has been cancelled Details * add pluggable session cache with Redis backend * add Redis session affinity demos (Docker Compose and Kubernetes) * address PR review feedback on session cache * document Redis session cache backend for model affinity * sync rendered config reference with session_cache addition * add tenant-scoped Redis session cache keys and remove dead log_affinity_hit - Add tenant_header to SessionCacheConfig; when set, cache keys are scoped as plano:affinity:{tenant_id}:{session_id} for multi-tenant isolation - Thread tenant_id through RouterService, routing_service, and llm handlers - Use Cow<'_, str> in session_key to avoid allocation when no tenant is set - Remove unused log_affinity_hit (logging was already inlined at call sites) * remove session_affinity_redis and session_affinity_redis_k8s demos	2026-04-13 19:30:47 -07:00
Musa	128059e7c1	release 0.4.18 (#878 ) Some checks failed CI / security-scan (push) Blocked by required conditions Details CI / test-prompt-gateway (push) Blocked by required conditions Details CI / test-model-alias-routing (push) Blocked by required conditions Details CI / test-responses-api-with-state (push) Blocked by required conditions Details CI / e2e-plano-tests (3.10) (push) Blocked by required conditions Details CI / e2e-plano-tests (3.11) (push) Blocked by required conditions Details CI / e2e-plano-tests (3.12) (push) Blocked by required conditions Details CI / e2e-plano-tests (3.13) (push) Blocked by required conditions Details CI / e2e-plano-tests (3.14) (push) Blocked by required conditions Details CI / e2e-demo-preference (push) Blocked by required conditions Details CI / e2e-demo-currency (push) Blocked by required conditions Details CI / pre-commit (push) Has been cancelled Details CI / plano-tools-tests (push) Has been cancelled Details CI / native-smoke-test (push) Has been cancelled Details CI / docker-build (push) Has been cancelled Details CI / validate-config (push) Has been cancelled Details Publish docker image (latest) / build-arm64 (push) Has been cancelled Details Publish docker image (latest) / build-amd64 (push) Has been cancelled Details Publish docker image (latest) / create-manifest (push) Has been cancelled Details Build and Deploy Documentation / build (push) Has been cancelled Details	2026-04-09 13:12:45 -07:00
Adil Hafeez	8dedf0bec1	Model affinity for consistent model selection in agentic loops (#827 ) Some checks are pending CI / pre-commit (push) Waiting to run Details CI / plano-tools-tests (push) Waiting to run Details CI / native-smoke-test (push) Waiting to run Details CI / docker-build (push) Waiting to run Details CI / validate-config (push) Waiting to run Details CI / security-scan (push) Blocked by required conditions Details CI / test-prompt-gateway (push) Blocked by required conditions Details CI / test-model-alias-routing (push) Blocked by required conditions Details CI / test-responses-api-with-state (push) Blocked by required conditions Details CI / e2e-plano-tests (3.10) (push) Blocked by required conditions Details CI / e2e-plano-tests (3.11) (push) Blocked by required conditions Details CI / e2e-plano-tests (3.12) (push) Blocked by required conditions Details CI / e2e-plano-tests (3.13) (push) Blocked by required conditions Details CI / e2e-plano-tests (3.14) (push) Blocked by required conditions Details CI / e2e-demo-preference (push) Blocked by required conditions Details CI / e2e-demo-currency (push) Blocked by required conditions Details Publish docker image (latest) / build-arm64 (push) Waiting to run Details Publish docker image (latest) / build-amd64 (push) Waiting to run Details Publish docker image (latest) / create-manifest (push) Blocked by required conditions Details Build and Deploy Documentation / build (push) Waiting to run Details	2026-04-08 17:32:02 -07:00
Musa	978b1ea722	Add first-class Xiaomi provider support (#863 ) Some checks failed CI / pre-commit (push) Has been cancelled Details CI / plano-tools-tests (push) Has been cancelled Details CI / native-smoke-test (push) Has been cancelled Details CI / docker-build (push) Has been cancelled Details CI / validate-config (push) Has been cancelled Details CI / security-scan (push) Has been cancelled Details CI / test-prompt-gateway (push) Has been cancelled Details CI / test-model-alias-routing (push) Has been cancelled Details CI / test-responses-api-with-state (push) Has been cancelled Details CI / e2e-plano-tests (3.10) (push) Has been cancelled Details CI / e2e-plano-tests (3.11) (push) Has been cancelled Details CI / e2e-plano-tests (3.12) (push) Has been cancelled Details CI / e2e-plano-tests (3.13) (push) Has been cancelled Details CI / e2e-plano-tests (3.14) (push) Has been cancelled Details CI / e2e-demo-preference (push) Has been cancelled Details CI / e2e-demo-currency (push) Has been cancelled Details Publish docker image (latest) / build-arm64 (push) Has been cancelled Details Publish docker image (latest) / build-amd64 (push) Has been cancelled Details Publish docker image (latest) / create-manifest (push) Has been cancelled Details Build and Deploy Documentation / build (push) Has been cancelled Details * feat(provider): add xiaomi as first-class provider * feat(demos): add xiaomi mimo integration demo * refactor(demos): remove Xiaomi MiMo integration demo and update documentation * updating model list and adding the xiamoi models --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-389.local>	2026-04-04 09:58:36 -07:00
Adil Hafeez	9406af3a09	release 0.4.17 (#869 ) Some checks are pending CI / pre-commit (push) Waiting to run Details CI / plano-tools-tests (push) Waiting to run Details CI / native-smoke-test (push) Waiting to run Details CI / docker-build (push) Waiting to run Details CI / validate-config (push) Waiting to run Details CI / security-scan (push) Blocked by required conditions Details CI / test-prompt-gateway (push) Blocked by required conditions Details CI / test-model-alias-routing (push) Blocked by required conditions Details CI / test-responses-api-with-state (push) Blocked by required conditions Details CI / e2e-plano-tests (3.10) (push) Blocked by required conditions Details CI / e2e-plano-tests (3.11) (push) Blocked by required conditions Details CI / e2e-plano-tests (3.12) (push) Blocked by required conditions Details CI / e2e-plano-tests (3.13) (push) Blocked by required conditions Details CI / e2e-plano-tests (3.14) (push) Blocked by required conditions Details CI / e2e-demo-preference (push) Blocked by required conditions Details CI / e2e-demo-currency (push) Blocked by required conditions Details Publish docker image (latest) / build-arm64 (push) Waiting to run Details Publish docker image (latest) / build-amd64 (push) Waiting to run Details Publish docker image (latest) / create-manifest (push) Blocked by required conditions Details Build and Deploy Documentation / build (push) Waiting to run Details	2026-04-03 10:05:33 -07:00
Musa	aa16a6dc4b	ci(e2e): stabilize preference demo test execution (#865 )	2026-04-02 21:32:20 -04:00
Adil Hafeez	7606c55b4b	support developer role in chat completions API (#867 )	2026-04-02 18:10:32 -07:00
Adil Hafeez	1d3f4d6c05	Publish docker images to DigitalOcean Container Registry (#868 )	2026-04-02 18:08:49 -07:00
Adil Hafeez	5d79e7a7d4	fix: resolve all open Dependabot security alerts (#866 )	2026-04-02 18:00:28 -07:00
Musa	76ff353c1e	fix(web): refresh blog content and featured post selection (#862 )	2026-04-02 06:18:19 -07:00
Musa	39b430d74b	feat(web): merge DigitalOcean release announcement updates (#860 ) * feat(web): announce DigitalOcean acquisition across sites * fix(web): make blog routes resilient without Sanity config * fix(web): add mobile arrow cue to announcement banner * fix(web): point acquisition links to announcement post	2026-04-02 06:03:52 -07:00
Musa	0857cfafbf	release 0.4.16 (#859 )	2026-03-31 17:45:28 -07:00
Musa	f68c21f8df	Handle null prefer in inline routing policy (#856 ) * Handle null prefer in inline routing policy * Use serde defaulting for null selection preference * Add tests for default selection policy behavior in routing preferences	2026-03-31 17:41:25 -07:00
Musa	3dbda9741e	fix: route Perplexity OpenAI endpoints without /v1 (#854 ) * fix: route Perplexity OpenAI paths without /v1 * add tests for Perplexity provider handling in LLM module * refactor: use constant for Perplexity provider prefix in LLM module * moving const to top of file	2026-03-31 17:40:42 -07:00
Adil Hafeez	d8f4fd76e3	replace production panics with graceful error handling in common crate (#844 )	2026-03-31 14:28:11 -07:00
Musa	36fa42b364	Improve planoai up/down CLI progress output (#858 )	2026-03-31 14:26:32 -07:00
Musa	82f34f82f2	Update black hook for Python 3.14 (#857 ) * Update pre-commit black to latest release * Reformat Python files for new black version	2026-03-31 13:18:45 -07:00
Adil Hafeez	f019f05738	release 0.4.15 (#853 )	2026-03-30 17:33:40 -07:00
Adil Hafeez	af98c11a6d	restructure model_metrics_sources to type + provider (#855 )	2026-03-30 17:12:20 -07:00
Adil Hafeez	e5751d6b13	model routing: cost/latency ranking with ranked fallback list (#849 )	2026-03-30 13:46:52 -07:00
Musa	3a531ce22a	expand configuration reference with missing fields (#851 )	2026-03-30 12:25:05 -07:00
Adil Hafeez	406fa92802	release 0.4.14 (#840 )	2026-03-20 00:51:37 -07:00
Salman Paracha	69df124c47	the orchestrator had a bug where it was setting the wrong headers for archfc.katanemo.dev (#839 ) Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-389.local>	2026-03-20 00:40:47 -07:00
Adil Hafeez	180a9cb748	separate config generation from process startup in supervisord (#838 )	2026-03-19 22:37:56 -07:00
Adil Hafeez	cdad02c5ee	release 0.4.13 (#837 )	2026-03-19 19:51:58 -07:00
Adil Hafeez	1ad3e0f64e	refactor brightstaff (#736 )	2026-03-19 17:58:33 -07:00
Adil Hafeez	1f23c573bf	add output filter chain (#822 )	2026-03-18 17:58:20 -07:00
Adil Hafeez	de2d8847f3	fix CVE-2026-0861: upgrade glibc via apt-get upgrade in Dockerfile (#832 )	2026-03-16 13:27:37 -07:00
Adil Hafeez	5388c6777f	add k8s deployment manifests and docs for self-hosted Arch-Router (#831 )	2026-03-16 12:05:30 -07:00
Adil Hafeez	f1b8c03e2f	release 0.4.12 (#830 )	2026-03-15 13:03:32 -07:00
Salman Paracha	4bb5c6404f	adding new supported models to plano (#829 ) Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-389.local>	2026-03-15 12:37:20 -07:00
Adil Hafeez	5d85829bc2	Improve config validation error messages (#825 ) * improve config validation error messages and update getting started demo * fix black formatting	2026-03-15 09:36:58 -07:00
Adil Hafeez	bc059aed4d	Unified overrides for custom router and orchestrator models (#820 ) * support configurable orchestrator model via orchestration config section * add self-hosting docs and demo for Plano-Orchestrator * list all Plano-Orchestrator model variants in docs * use overrides for custom routing and orchestration model * update docs * update orchestrator model name * rename arch provider to plano, use llm_routing_model and agent_orchestration_model * regenerate rendered config reference	2026-03-15 09:36:11 -07:00
Adil Hafeez	785bf7e021	add build-cli and build-brightstaff skills (#824 )	2026-03-13 00:28:35 -07:00
Adil Hafeez	2f52774c0e	Add Claude Code skills and streamline CLAUDE.md (#823 ) * add claude code skills and streamline CLAUDE.md * remove claude code attribution from PR skill * update pr skill	2026-03-13 00:18:41 -07:00
Adil Hafeez	5400b0a2fa	add instructions on hosting arch-router locally (#819 )	2026-03-11 15:28:50 -07:00
Adil Hafeez	b4313d93a4	Run demos without Docker (#809 )	2026-03-11 12:49:36 -07:00
Musa	6610097659	Support for Codex via Plano (#808 ) * Add Codex CLI support; xAI response improvements * Add native Plano running check and update CLI agent error handling * adding PR suggestions for transformations and code quality * message extraction logic in ResponsesAPIRequest * xAI support for Responses API by routing to native endpoint + refactor code	2026-03-10 20:54:14 -07:00
Adil Hafeez	5189f7907a	add k8s deploy guide (#816 )	2026-03-10 12:27:31 -07:00
Adil Hafeez	97b7a390ef	support inline routing_policy in request body (#811 ) (#815 )	2026-03-10 12:23:18 -07:00

1 2 3 4 5 ...

677 commits