plano

mirror of https://github.com/katanemo/plano.git synced 2026-06-08 14:55:14 +02:00

Author	SHA1	Message	Date
Musa	b5ebb1beea	Document model_providers headers in configuration reference (#950 ) * Document model_providers headers in configuration reference Co-authored-by: Musa <musa@spherrrical.dev> * ci: retrigger workflow Co-authored-by: Musa <musa@spherrrical.dev> * fix(llm_gateway): buffer non-streaming response body until end_of_stream Wait for the full upstream body before JSON parsing to avoid truncated responses on chunked replies. Retry currency_exchange demo tests on flake. Co-authored-by: Musa <musa@spherrrical.dev> * fix(llm_gateway): read full non-streaming body when final chunk is empty Co-authored-by: Musa <musa@spherrrical.dev> * fix(llm_gateway): read full non-streaming body with usize::MAX at end_of_stream Co-authored-by: Musa <musa@spherrrical.dev> * fix(llm_gateway): use envoy body_size for response body replacement Co-authored-by: Musa <musa@spherrrical.dev> * docs: explain model_providers headers in configuration reference Revert unrelated llm_gateway and demo test runner changes. Co-authored-by: Musa <musa@spherrrical.dev> * chore: drop unrelated changes, keep docs-only diff Co-authored-by: Musa <musa@spherrrical.dev> --------- Co-authored-by: Cursor Agent <cursoragent@cursor.com>	2026-06-03 13:38:39 -07:00
Musa	0297b10163	Bump version to 0.4.22 (#917 )	2026-04-24 16:43:19 -07:00
Musa	473ec70b5c	Bump version to 0.4.21 (#911 ) Some checks are pending CI / pre-commit (push) Waiting to run CI / plano-tools-tests (push) Waiting to run CI / native-smoke-test (push) Waiting to run CI / docker-build (push) Waiting to run CI / validate-config (push) Waiting to run CI / security-scan (push) Blocked by required conditions CI / test-prompt-gateway (push) Blocked by required conditions CI / test-model-alias-routing (push) Blocked by required conditions CI / test-responses-api-with-state (push) Blocked by required conditions CI / e2e-plano-tests (3.10) (push) Blocked by required conditions CI / e2e-plano-tests (3.11) (push) Blocked by required conditions CI / e2e-plano-tests (3.12) (push) Blocked by required conditions CI / e2e-plano-tests (3.13) (push) Blocked by required conditions CI / e2e-plano-tests (3.14) (push) Blocked by required conditions CI / e2e-demo-preference (push) Blocked by required conditions CI / e2e-demo-currency (push) Blocked by required conditions Publish docker image (latest) / build-arm64 (push) Waiting to run Publish docker image (latest) / build-amd64 (push) Waiting to run Publish docker image (latest) / create-manifest (push) Blocked by required conditions Build and Deploy Documentation / build (push) Waiting to run	2026-04-24 14:27:32 -07:00
Musa	897fda2deb	fix(routing): auto-migrate v0.3.0 inline routing_preferences to v0.4.0 top-level (#912 ) * fix(routing): auto-migrate v0.3.0 inline routing_preferences to v0.4.0 top-level Lift inline routing_preferences under each model_provider into the top-level routing_preferences list with merged models[] and bump version to v0.4.0, with a deprecation warning. Existing v0.3.0 demo configs (Claude Code, Codex, preference_based_routing, etc.) keep working unchanged. Schema flags the inline shape as deprecated but still accepts it. Docs and skills updated to canonical top-level multi-model form. * test(common): bump reference config assertion to v0.4.0 The rendered reference config was bumped to v0.4.0 when its inline routing_preferences were lifted to the top level; align the configuration deserialization test with that change. * fix(config_generator): bump version to v0.4.0 up front in migration Move the v0.3.0 -> v0.4.0 version bump to the top of migrate_inline_routing_preferences so it runs unconditionally, including for configs that already declare top-level routing_preferences at v0.3.0. Previously the bump only fired when inline migration produced entries, leaving top-level v0.3.0 configs rejected by brightstaff's v0.4.0 gate. Tests updated to cover the new behavior and to confirm we never downgrade newer versions. * fix(config_generator): gate routing_preferences migration on version < v0.4.0 Short-circuit the migration when the config already declares v0.4.0 or newer. Anything at v0.4.0+ is assumed to be on the canonical top-level shape and is passed through untouched, including stray inline preferences (which are the author's bug to fix). Only v0.3.0 and older configs are rewritten and bumped.	2026-04-24 12:31:44 -07:00
Adil Hafeez	6701195a5d	add overrides.disable_signals to skip CPU-heavy signal analysis (#906 )	2026-04-23 11:38:29 -07:00
Adil Hafeez	254d2b03bc	release: bump version to 0.4.20 (#897 ) Some checks are pending CI / pre-commit (push) Waiting to run CI / plano-tools-tests (push) Waiting to run CI / native-smoke-test (push) Waiting to run CI / docker-build (push) Waiting to run CI / validate-config (push) Waiting to run CI / security-scan (push) Blocked by required conditions CI / test-prompt-gateway (push) Blocked by required conditions CI / test-model-alias-routing (push) Blocked by required conditions CI / test-responses-api-with-state (push) Blocked by required conditions CI / e2e-plano-tests (3.10) (push) Blocked by required conditions CI / e2e-plano-tests (3.11) (push) Blocked by required conditions CI / e2e-plano-tests (3.12) (push) Blocked by required conditions CI / e2e-plano-tests (3.13) (push) Blocked by required conditions CI / e2e-plano-tests (3.14) (push) Blocked by required conditions CI / e2e-demo-preference (push) Blocked by required conditions CI / e2e-demo-currency (push) Blocked by required conditions Publish docker image (latest) / build-arm64 (push) Waiting to run Publish docker image (latest) / build-amd64 (push) Waiting to run Publish docker image (latest) / create-manifest (push) Blocked by required conditions Build and Deploy Documentation / build (push) Waiting to run	2026-04-17 21:16:12 -07:00
Adil Hafeez	d39d7ddd1c	release 0.4.19 (#887 ) Some checks are pending CI / pre-commit (push) Waiting to run CI / plano-tools-tests (push) Waiting to run CI / native-smoke-test (push) Waiting to run CI / docker-build (push) Waiting to run CI / validate-config (push) Waiting to run CI / security-scan (push) Blocked by required conditions CI / test-prompt-gateway (push) Blocked by required conditions CI / test-model-alias-routing (push) Blocked by required conditions CI / test-responses-api-with-state (push) Blocked by required conditions CI / e2e-plano-tests (3.10) (push) Blocked by required conditions CI / e2e-plano-tests (3.11) (push) Blocked by required conditions CI / e2e-plano-tests (3.12) (push) Blocked by required conditions CI / e2e-plano-tests (3.13) (push) Blocked by required conditions CI / e2e-plano-tests (3.14) (push) Blocked by required conditions CI / e2e-demo-preference (push) Blocked by required conditions CI / e2e-demo-currency (push) Blocked by required conditions Publish docker image (latest) / build-arm64 (push) Waiting to run Publish docker image (latest) / build-amd64 (push) Waiting to run Publish docker image (latest) / create-manifest (push) Blocked by required conditions Build and Deploy Documentation / build (push) Waiting to run	2026-04-15 16:49:50 -07:00
Adil Hafeez	90b926c2ce	use plano-orchestrator for LLM routing, remove arch-router (#886 )	2026-04-15 16:41:42 -07:00
Musa	980faef6be	Redis-backed session cache for cross-replica model affinity (#879 ) Some checks failed CI / pre-commit (push) Has been cancelled CI / plano-tools-tests (push) Has been cancelled CI / native-smoke-test (push) Has been cancelled CI / docker-build (push) Has been cancelled CI / validate-config (push) Has been cancelled Publish docker image (latest) / build-arm64 (push) Has been cancelled Publish docker image (latest) / build-amd64 (push) Has been cancelled Build and Deploy Documentation / build (push) Has been cancelled CI / security-scan (push) Has been cancelled CI / test-prompt-gateway (push) Has been cancelled CI / test-model-alias-routing (push) Has been cancelled CI / test-responses-api-with-state (push) Has been cancelled CI / e2e-plano-tests (3.10) (push) Has been cancelled CI / e2e-plano-tests (3.11) (push) Has been cancelled CI / e2e-plano-tests (3.12) (push) Has been cancelled CI / e2e-plano-tests (3.13) (push) Has been cancelled CI / e2e-plano-tests (3.14) (push) Has been cancelled CI / e2e-demo-preference (push) Has been cancelled CI / e2e-demo-currency (push) Has been cancelled Publish docker image (latest) / create-manifest (push) Has been cancelled * add pluggable session cache with Redis backend * add Redis session affinity demos (Docker Compose and Kubernetes) * address PR review feedback on session cache * document Redis session cache backend for model affinity * sync rendered config reference with session_cache addition * add tenant-scoped Redis session cache keys and remove dead log_affinity_hit - Add tenant_header to SessionCacheConfig; when set, cache keys are scoped as plano:affinity:{tenant_id}:{session_id} for multi-tenant isolation - Thread tenant_id through RouterService, routing_service, and llm handlers - Use Cow<'_, str> in session_key to avoid allocation when no tenant is set - Remove unused log_affinity_hit (logging was already inlined at call sites) * remove session_affinity_redis and session_affinity_redis_k8s demos	2026-04-13 19:30:47 -07:00
Musa	128059e7c1	release 0.4.18 (#878 ) Some checks failed CI / security-scan (push) Blocked by required conditions CI / test-prompt-gateway (push) Blocked by required conditions CI / test-model-alias-routing (push) Blocked by required conditions CI / test-responses-api-with-state (push) Blocked by required conditions CI / e2e-plano-tests (3.10) (push) Blocked by required conditions CI / e2e-plano-tests (3.11) (push) Blocked by required conditions CI / e2e-plano-tests (3.12) (push) Blocked by required conditions CI / e2e-plano-tests (3.13) (push) Blocked by required conditions CI / e2e-plano-tests (3.14) (push) Blocked by required conditions CI / e2e-demo-preference (push) Blocked by required conditions CI / e2e-demo-currency (push) Blocked by required conditions CI / pre-commit (push) Has been cancelled CI / plano-tools-tests (push) Has been cancelled CI / native-smoke-test (push) Has been cancelled CI / docker-build (push) Has been cancelled CI / validate-config (push) Has been cancelled Publish docker image (latest) / build-arm64 (push) Has been cancelled Publish docker image (latest) / build-amd64 (push) Has been cancelled Publish docker image (latest) / create-manifest (push) Has been cancelled Build and Deploy Documentation / build (push) Has been cancelled	2026-04-09 13:12:45 -07:00
Adil Hafeez	8dedf0bec1	Model affinity for consistent model selection in agentic loops (#827 ) Some checks are pending CI / pre-commit (push) Waiting to run CI / plano-tools-tests (push) Waiting to run CI / native-smoke-test (push) Waiting to run CI / docker-build (push) Waiting to run CI / validate-config (push) Waiting to run CI / security-scan (push) Blocked by required conditions CI / test-prompt-gateway (push) Blocked by required conditions CI / test-model-alias-routing (push) Blocked by required conditions CI / test-responses-api-with-state (push) Blocked by required conditions CI / e2e-plano-tests (3.10) (push) Blocked by required conditions CI / e2e-plano-tests (3.11) (push) Blocked by required conditions CI / e2e-plano-tests (3.12) (push) Blocked by required conditions CI / e2e-plano-tests (3.13) (push) Blocked by required conditions CI / e2e-plano-tests (3.14) (push) Blocked by required conditions CI / e2e-demo-preference (push) Blocked by required conditions CI / e2e-demo-currency (push) Blocked by required conditions Publish docker image (latest) / build-arm64 (push) Waiting to run Publish docker image (latest) / build-amd64 (push) Waiting to run Publish docker image (latest) / create-manifest (push) Blocked by required conditions Build and Deploy Documentation / build (push) Waiting to run	2026-04-08 17:32:02 -07:00
Adil Hafeez	9406af3a09	release 0.4.17 (#869 ) Some checks are pending CI / pre-commit (push) Waiting to run CI / plano-tools-tests (push) Waiting to run CI / native-smoke-test (push) Waiting to run CI / docker-build (push) Waiting to run CI / validate-config (push) Waiting to run CI / security-scan (push) Blocked by required conditions CI / test-prompt-gateway (push) Blocked by required conditions CI / test-model-alias-routing (push) Blocked by required conditions CI / test-responses-api-with-state (push) Blocked by required conditions CI / e2e-plano-tests (3.10) (push) Blocked by required conditions CI / e2e-plano-tests (3.11) (push) Blocked by required conditions CI / e2e-plano-tests (3.12) (push) Blocked by required conditions CI / e2e-plano-tests (3.13) (push) Blocked by required conditions CI / e2e-plano-tests (3.14) (push) Blocked by required conditions CI / e2e-demo-preference (push) Blocked by required conditions CI / e2e-demo-currency (push) Blocked by required conditions Publish docker image (latest) / build-arm64 (push) Waiting to run Publish docker image (latest) / build-amd64 (push) Waiting to run Publish docker image (latest) / create-manifest (push) Blocked by required conditions Build and Deploy Documentation / build (push) Waiting to run	2026-04-03 10:05:33 -07:00
Musa	0857cfafbf	release 0.4.16 (#859 )	2026-03-31 17:45:28 -07:00
Musa	82f34f82f2	Update black hook for Python 3.14 (#857 ) * Update pre-commit black to latest release * Reformat Python files for new black version	2026-03-31 13:18:45 -07:00
Adil Hafeez	f019f05738	release 0.4.15 (#853 )	2026-03-30 17:33:40 -07:00
Adil Hafeez	e5751d6b13	model routing: cost/latency ranking with ranked fallback list (#849 )	2026-03-30 13:46:52 -07:00
Musa	3a531ce22a	expand configuration reference with missing fields (#851 )	2026-03-30 12:25:05 -07:00
Adil Hafeez	406fa92802	release 0.4.14 (#840 )	2026-03-20 00:51:37 -07:00
Adil Hafeez	cdad02c5ee	release 0.4.13 (#837 )	2026-03-19 19:51:58 -07:00
Adil Hafeez	1f23c573bf	add output filter chain (#822 )	2026-03-18 17:58:20 -07:00
Adil Hafeez	f1b8c03e2f	release 0.4.12 (#830 )	2026-03-15 13:03:32 -07:00
Adil Hafeez	bc059aed4d	Unified overrides for custom router and orchestrator models (#820 ) * support configurable orchestrator model via orchestration config section * add self-hosting docs and demo for Plano-Orchestrator * list all Plano-Orchestrator model variants in docs * use overrides for custom routing and orchestration model * update docs * update orchestrator model name * rename arch provider to plano, use llm_routing_model and agent_orchestration_model * regenerate rendered config reference	2026-03-15 09:36:11 -07:00
Adil Hafeez	5189f7907a	add k8s deploy guide (#816 )	2026-03-10 12:27:31 -07:00
Adil Hafeez	065328e11c	release 0.4.11 (#806 )	2026-03-05 13:58:19 -08:00
Adil Hafeez	c13ce19293	release 0.4.10 (#802 )	2026-03-05 12:17:45 -08:00
Adil Hafeez	f63d5de02c	Run plano natively by default (#744 )	2026-03-05 07:35:25 -08:00
Adil Hafeez	d9404afa4d	release 0.4.9 (#785 )	2026-02-26 16:18:02 -08:00
Adil Hafeez	70ad56a258	remove exposed example passwords from documentation (#779 ) * remove exposed example passwords from documentation Replace hardcoded example password (MyPass#123/MyPass%23123) and project-specific Supabase references (postgres.myproject) with generic placeholders in docs. https://claude.ai/code/session_01H5wj3VH1Jh28kzepEwdDCx * remove hardcoded FlightAware AeroAPI key from flights.py https://claude.ai/code/session_01H5wj3VH1Jh28kzepEwdDCx --------- Co-authored-by: Claude <noreply@anthropic.com>	2026-02-25 13:14:36 -08:00
Musa	ed64230833	add support for background trace collection and tracing output (#749 ) * feat: add trace listener process management and foreground mode * docs: add CLI reference documentation and update index * fix: test coverage failing * refactor: simplify trace listener initialization and remove debug mode handling * docs: add CLI command screenshots to reference documentation * fix: update trace listener PID file path * refactor: integrate trace listener management into runtime module and streamline PID handling * adjusting trace command for feedback on PR	2026-02-24 19:17:33 -08:00
Salman Paracha	69d650a4e5	updating architecture diagram (#774 ) Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>	2026-02-21 16:00:02 -08:00
Adil Hafeez	7b5f1549a5	release 0.4.8 (#767 ) Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-18 01:52:55 -08:00
Adil Hafeez	bfbf838b19	release 0.4.7 (#752 )	2026-02-17 05:45:44 -08:00
Adil Hafeez	ba651aaf71	Rename all arch references to plano (#745 ) * Rename all arch references to plano across the codebase Complete rebrand from "Arch"/"archgw" to "Plano" including: - Config files: arch_config_schema.yaml, workflow, demo configs - Environment variables: ARCH_CONFIG_* → PLANO_CONFIG_* - Python CLI: variables, functions, file paths, docker mounts - Rust crates: config paths, log messages, metadata keys - Docker/build: Dockerfile, supervisord, .dockerignore, .gitignore - Docker Compose: volume mounts and env vars across all demos/tests - GitHub workflows: job/step names - Shell scripts: log messages - Demos: Python code, READMEs, VS Code configs, Grafana dashboard - Docs: RST includes, code comments, config references - Package metadata: package.json, pyproject.toml, uv.lock External URLs (docs.archgw.com, github.com/katanemo/archgw) left as-is. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Update remaining arch references in docs - Rename RST cross-reference labels: arch_access_logging, arch_overview_tracing, arch_overview_threading → plano_* - Update label references in request_lifecycle.rst - Rename arch_config_state_storage_example.yaml → plano_config_state_storage_example.yaml - Update config YAML comments: "Arch creates/uses" → "Plano creates/uses" - Update "the Arch gateway" → "the Plano gateway" in configuration_reference.rst - Update arch_config_schema.yaml reference in provider_models.py - Rename arch_agent_router → plano_agent_router in config example Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Fix remaining arch references found in second pass - config/docker-compose.dev.yaml: ARCH_CONFIG_FILE → PLANO_CONFIG_FILE, arch_config.yaml → plano_config.yaml, archgw_logs → plano_logs - config/test_passthrough.yaml: container mount path - tests/e2e/docker-compose.yaml: source file path (was still arch_config.yaml) - cli/planoai/core.py: comment and log message - crates/brightstaff/src/tracing/constants.rs: doc comment - tests/{e2e,archgw}/common.py: get_arch_messages → get_plano_messages, arch_state/arch_messages variables renamed - tests/{e2e,archgw}/test_prompt_gateway.py: updated imports and usages - demos/shared/test_runner/{common,test_demos}.py: same renames - tests/e2e/test_model_alias_routing.py: docstring - .dockerignore: archgw_modelserver → plano_modelserver - demos/use_cases/claude_code_router/pretty_model_resolution.sh: container name Note: x-arch-* HTTP header values and Rust constant names intentionally preserved for backwards compatibility with existing deployments. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 15:16:56 -08:00
Adil Hafeez	b9b91ddc74	release 0.4.6 (#740 )	2026-02-10 21:00:29 -08:00
Adil Hafeez	25693c36ee	release 0.4.5 (#737 )	2026-02-10 13:37:04 -08:00
Adil Hafeez	46de89590b	use standard tracing and logging in brightstaff (#721 )	2026-02-09 13:33:27 -08:00
Adil Hafeez	d8b4c800e6	release 0.4.4 (#713 )	2026-01-28 20:45:10 -08:00
Adil Hafeez	062825f26e	add envoy retries (#712 ) * add envoy retries * add missing file * fix tests --------- Co-authored-by: Adil Hafeez <adil.hafeez10@t-mobile.com>	2026-01-28 20:31:01 -08:00
Adil Hafeez	da5cbc29b7	release 0.4.3 (#701 )	2026-01-18 00:07:46 -08:00
Tang Quoc Thai	4d53297c17	feat: add passthrough_auth option for forwarding client Authorization header (#687 ) * feat: add passthrough_auth option for forwarding client Authorization header * fix tests * Update comment to reflect upstream forwarding * Apply suggestions from code review --------- Co-authored-by: Adil Hafeez <adil.hafeez@gmail.com> Co-authored-by: Adil Hafeez <adil@katanemo.com>	2026-01-14 15:06:28 -08:00
Adil Hafeez	ab391f96c7	don't include internal models in /v1/models endpoint (#685 )	2026-01-09 16:57:41 -08:00
Adil Hafeez	b7fba7a97f	release 0.4.2 (#679 )	2026-01-07 13:02:06 -08:00
Adil Hafeez	41aa4abaeb	release 0.4.1 (#670 )	2026-01-01 23:39:18 -08:00
Adil Hafeez	77cdc7f6ef	Revert "release 0.4.1 (#666 )" (#669 ) This reverts commit `77df5160d8`.	2025-12-30 15:28:30 -08:00
Adil Hafeez	77df5160d8	release 0.4.1 (#666 )	2025-12-28 14:29:19 -08:00
Salman Paracha	e224cba3e3	Update docs to Plano (#639 )	2025-12-23 17:14:50 -08:00
Adil Hafeez	15fbb6c3af	plano orchestration using plano orchestration 4b model (#637 )	2025-12-22 18:05:49 -08:00
Salman Paracha	d5a273f740	enable state management for v1/responses (#631 ) * first commit with tests to enable state mamangement via memory * fixed logs to follow the conversational flow a bit better * added support for supabase * added the state_storage_v1_responses flag, and use that to store state appropriately * cleaned up logs and fixed issue with connectivity for llm gateway in weather forecast demo * fixed mixed inputs from openai v1/responses api (#632) * fixed mixed inputs from openai v1/responses api * removing tracing from model-alias-rouing * handling additional input types from openairs --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local> * resolving PR comments --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>	2025-12-17 12:18:38 -08:00
Adil Hafeez	8adb9795d8	release 0.3.22 (#629 )	2025-12-11 11:20:19 -08:00
Adil Hafeez	09c0b999b2	release 0.3.21 (#626 )	2025-12-03 17:12:34 -08:00

1 2

69 commits