plano

mirror of https://github.com/katanemo/plano.git synced 2026-05-18 13:45:15 +02:00

Author	SHA1	Message	Date
Musa	897fda2deb	fix(routing): auto-migrate v0.3.0 inline routing_preferences to v0.4.0 top-level (#912 ) * fix(routing): auto-migrate v0.3.0 inline routing_preferences to v0.4.0 top-level Lift inline routing_preferences under each model_provider into the top-level routing_preferences list with merged models[] and bump version to v0.4.0, with a deprecation warning. Existing v0.3.0 demo configs (Claude Code, Codex, preference_based_routing, etc.) keep working unchanged. Schema flags the inline shape as deprecated but still accepts it. Docs and skills updated to canonical top-level multi-model form. * test(common): bump reference config assertion to v0.4.0 The rendered reference config was bumped to v0.4.0 when its inline routing_preferences were lifted to the top level; align the configuration deserialization test with that change. * fix(config_generator): bump version to v0.4.0 up front in migration Move the v0.3.0 -> v0.4.0 version bump to the top of migrate_inline_routing_preferences so it runs unconditionally, including for configs that already declare top-level routing_preferences at v0.3.0. Previously the bump only fired when inline migration produced entries, leaving top-level v0.3.0 configs rejected by brightstaff's v0.4.0 gate. Tests updated to cover the new behavior and to confirm we never downgrade newer versions. * fix(config_generator): gate routing_preferences migration on version < v0.4.0 Short-circuit the migration when the config already declares v0.4.0 or newer. Anything at v0.4.0+ is assumed to be on the canonical top-level shape and is passed through untouched, including stray inline preferences (which are the author's bug to fix). Only v0.3.0 and older configs are rewritten and bumped.	2026-04-24 12:31:44 -07:00
Musa	b81eb7266c	feat(providers): add Vercel AI Gateway and OpenRouter support (#902 ) Some checks are pending CI / pre-commit (push) Waiting to run CI / plano-tools-tests (push) Waiting to run CI / native-smoke-test (push) Waiting to run CI / docker-build (push) Waiting to run CI / validate-config (push) Waiting to run CI / security-scan (push) Blocked by required conditions CI / test-prompt-gateway (push) Blocked by required conditions CI / test-model-alias-routing (push) Blocked by required conditions CI / test-responses-api-with-state (push) Blocked by required conditions CI / e2e-plano-tests (3.10) (push) Blocked by required conditions CI / e2e-plano-tests (3.11) (push) Blocked by required conditions CI / e2e-plano-tests (3.12) (push) Blocked by required conditions CI / e2e-plano-tests (3.13) (push) Blocked by required conditions CI / e2e-plano-tests (3.14) (push) Blocked by required conditions CI / e2e-demo-preference (push) Blocked by required conditions CI / e2e-demo-currency (push) Blocked by required conditions Publish docker image (latest) / build-arm64 (push) Waiting to run Publish docker image (latest) / build-amd64 (push) Waiting to run Publish docker image (latest) / create-manifest (push) Blocked by required conditions Build and Deploy Documentation / build (push) Waiting to run * add Vercel and OpenRouter as OpenAI-compatible LLM providers * fix(fmt): fix cargo fmt line length issues in provider id tests Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * style(hermesllm): fix rustfmt formatting in provider id tests * Add Vercel and OpenRouter to zero-config planoai up defaults Wires `vercel/` and `openrouter/` into the synthesized default config so `planoai up` with no user config exposes both providers out of the box (env-keyed via AI_GATEWAY_API_KEY / OPENROUTER_API_KEY, pass-through otherwise). Registers both in SUPPORTED_PROVIDERS_WITHOUT_BASE_URL so wildcard model entries validate without an explicit provider_interface. --------- Co-authored-by: Musa Malik <musam@uw.edu> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-23 15:54:39 -07:00
Musa	78dc4edad9	Add first-class ChatGPT subscription provider support (#881 ) * Add first-class ChatGPT subscription provider support * Address PR feedback: move uuid import to top, reuse parsed config in up() * Add ChatGPT token watchdog for seamless long-lived sessions * Address PR feedback: error on stream=false for ChatGPT, fix auth file permissions * Replace ChatGPT watchdog/restart with passthrough_auth --------- Co-authored-by: Musa Malik <musam@uw.edu>	2026-04-23 15:34:44 -07:00
Adil Hafeez	6701195a5d	add overrides.disable_signals to skip CPU-heavy signal analysis (#906 )	2026-04-23 11:38:29 -07:00
Adil Hafeez	22f332f62d	Add Prometheus metrics endpoint and Grafana dashboard for brightstaff (#904 ) Some checks are pending CI / pre-commit (push) Waiting to run CI / plano-tools-tests (push) Waiting to run CI / native-smoke-test (push) Waiting to run CI / docker-build (push) Waiting to run CI / validate-config (push) Waiting to run CI / security-scan (push) Blocked by required conditions CI / test-prompt-gateway (push) Blocked by required conditions CI / test-model-alias-routing (push) Blocked by required conditions CI / test-responses-api-with-state (push) Blocked by required conditions CI / e2e-plano-tests (3.10) (push) Blocked by required conditions CI / e2e-plano-tests (3.11) (push) Blocked by required conditions CI / e2e-plano-tests (3.12) (push) Blocked by required conditions CI / e2e-plano-tests (3.13) (push) Blocked by required conditions CI / e2e-plano-tests (3.14) (push) Blocked by required conditions CI / e2e-demo-preference (push) Blocked by required conditions CI / e2e-demo-currency (push) Blocked by required conditions Publish docker image (latest) / build-arm64 (push) Waiting to run Publish docker image (latest) / build-amd64 (push) Waiting to run Publish docker image (latest) / create-manifest (push) Blocked by required conditions Build and Deploy Documentation / build (push) Waiting to run	2026-04-22 11:19:10 -07:00
Adil Hafeez	1f701258cb	Zero-config planoai up: pass-through proxy with auto-detected providers (#890 )	2026-04-17 13:11:12 -07:00
Adil Hafeez	711e4dd07d	Add DigitalOcean as a first-class LLM provider (#889 )	2026-04-17 12:25:34 -07:00
Adil Hafeez	90b926c2ce	use plano-orchestrator for LLM routing, remove arch-router (#886 )	2026-04-15 16:41:42 -07:00
Musa	980faef6be	Redis-backed session cache for cross-replica model affinity (#879 ) Some checks failed CI / pre-commit (push) Has been cancelled CI / plano-tools-tests (push) Has been cancelled CI / native-smoke-test (push) Has been cancelled CI / docker-build (push) Has been cancelled CI / validate-config (push) Has been cancelled Publish docker image (latest) / build-arm64 (push) Has been cancelled Publish docker image (latest) / build-amd64 (push) Has been cancelled Build and Deploy Documentation / build (push) Has been cancelled CI / security-scan (push) Has been cancelled CI / test-prompt-gateway (push) Has been cancelled CI / test-model-alias-routing (push) Has been cancelled CI / test-responses-api-with-state (push) Has been cancelled CI / e2e-plano-tests (3.10) (push) Has been cancelled CI / e2e-plano-tests (3.11) (push) Has been cancelled CI / e2e-plano-tests (3.12) (push) Has been cancelled CI / e2e-plano-tests (3.13) (push) Has been cancelled CI / e2e-plano-tests (3.14) (push) Has been cancelled CI / e2e-demo-preference (push) Has been cancelled CI / e2e-demo-currency (push) Has been cancelled Publish docker image (latest) / create-manifest (push) Has been cancelled * add pluggable session cache with Redis backend * add Redis session affinity demos (Docker Compose and Kubernetes) * address PR review feedback on session cache * document Redis session cache backend for model affinity * sync rendered config reference with session_cache addition * add tenant-scoped Redis session cache keys and remove dead log_affinity_hit - Add tenant_header to SessionCacheConfig; when set, cache keys are scoped as plano:affinity:{tenant_id}:{session_id} for multi-tenant isolation - Thread tenant_id through RouterService, routing_service, and llm handlers - Use Cow<'_, str> in session_key to avoid allocation when no tenant is set - Remove unused log_affinity_hit (logging was already inlined at call sites) * remove session_affinity_redis and session_affinity_redis_k8s demos	2026-04-13 19:30:47 -07:00
Adil Hafeez	8dedf0bec1	Model affinity for consistent model selection in agentic loops (#827 ) Some checks are pending CI / pre-commit (push) Waiting to run CI / plano-tools-tests (push) Waiting to run CI / native-smoke-test (push) Waiting to run CI / docker-build (push) Waiting to run CI / validate-config (push) Waiting to run CI / security-scan (push) Blocked by required conditions CI / test-prompt-gateway (push) Blocked by required conditions CI / test-model-alias-routing (push) Blocked by required conditions CI / test-responses-api-with-state (push) Blocked by required conditions CI / e2e-plano-tests (3.10) (push) Blocked by required conditions CI / e2e-plano-tests (3.11) (push) Blocked by required conditions CI / e2e-plano-tests (3.12) (push) Blocked by required conditions CI / e2e-plano-tests (3.13) (push) Blocked by required conditions CI / e2e-plano-tests (3.14) (push) Blocked by required conditions CI / e2e-demo-preference (push) Blocked by required conditions CI / e2e-demo-currency (push) Blocked by required conditions Publish docker image (latest) / build-arm64 (push) Waiting to run Publish docker image (latest) / build-amd64 (push) Waiting to run Publish docker image (latest) / create-manifest (push) Blocked by required conditions Build and Deploy Documentation / build (push) Waiting to run	2026-04-08 17:32:02 -07:00
Musa	978b1ea722	Add first-class Xiaomi provider support (#863 ) Some checks failed CI / pre-commit (push) Has been cancelled CI / plano-tools-tests (push) Has been cancelled CI / native-smoke-test (push) Has been cancelled CI / docker-build (push) Has been cancelled CI / validate-config (push) Has been cancelled CI / security-scan (push) Has been cancelled CI / test-prompt-gateway (push) Has been cancelled CI / test-model-alias-routing (push) Has been cancelled CI / test-responses-api-with-state (push) Has been cancelled CI / e2e-plano-tests (3.10) (push) Has been cancelled CI / e2e-plano-tests (3.11) (push) Has been cancelled CI / e2e-plano-tests (3.12) (push) Has been cancelled CI / e2e-plano-tests (3.13) (push) Has been cancelled CI / e2e-plano-tests (3.14) (push) Has been cancelled CI / e2e-demo-preference (push) Has been cancelled CI / e2e-demo-currency (push) Has been cancelled Publish docker image (latest) / build-arm64 (push) Has been cancelled Publish docker image (latest) / build-amd64 (push) Has been cancelled Publish docker image (latest) / create-manifest (push) Has been cancelled Build and Deploy Documentation / build (push) Has been cancelled * feat(provider): add xiaomi as first-class provider * feat(demos): add xiaomi mimo integration demo * refactor(demos): remove Xiaomi MiMo integration demo and update documentation * updating model list and adding the xiamoi models --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-389.local>	2026-04-04 09:58:36 -07:00
Adil Hafeez	af98c11a6d	restructure model_metrics_sources to type + provider (#855 )	2026-03-30 17:12:20 -07:00
Adil Hafeez	e5751d6b13	model routing: cost/latency ranking with ranked fallback list (#849 )	2026-03-30 13:46:52 -07:00
Adil Hafeez	180a9cb748	separate config generation from process startup in supervisord (#838 )	2026-03-19 22:37:56 -07:00
Adil Hafeez	1f23c573bf	add output filter chain (#822 )	2026-03-18 17:58:20 -07:00
Adil Hafeez	bc059aed4d	Unified overrides for custom router and orchestrator models (#820 ) * support configurable orchestrator model via orchestration config section * add self-hosting docs and demo for Plano-Orchestrator * list all Plano-Orchestrator model variants in docs * use overrides for custom routing and orchestration model * update docs * update orchestrator model name * rename arch provider to plano, use llm_routing_model and agent_orchestration_model * regenerate rendered config reference	2026-03-15 09:36:11 -07:00
Adil Hafeez	f63d5de02c	Run plano natively by default (#744 )	2026-03-05 07:35:25 -08:00
Adil Hafeez	198c912202	allow otel collector endpoint to be set from config (#794 ) Co-authored-by: Adil Hafeez <adil.hafeez10@t-mobile.com>	2026-03-01 04:05:45 -08:00
Adil Hafeez	d9404afa4d	release 0.4.9 (#785 )	2026-02-26 16:18:02 -08:00
Musa	2bde21ff57	add Custom Trace Attributes to extend observability (#708 ) * add custom trace attributes * refactor: prefix custom trace attributes and update schema handlers tests configs * refactor: rename custom_attribute_prefixes to span_attribute_header_prefixes in configuration and related handlers * docs: add section on custom span attributes * refactor: update tracing configuration to use span attributes and adjust related handlers * docs: custom span attributes section to include static attributes and clarify configuration * add custom trace attributes * refactor: prefix custom trace attributes and update schema handlers tests configs * refactor: rename custom_attribute_prefixes to span_attribute_header_prefixes in configuration and related handlers * docs: add section on custom span attributes * refactor: update tracing configuration to use span attributes and adjust related handlers * docs: custom span attributes section to include static attributes and clarify configuration * refactor: remove TraceCollector usage and enhance logging with structured attributes * refactor: custom trace attribute extraction to improve clarity --------- Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-25 16:27:20 -08:00
Adil Hafeez	7b5f1549a5	release 0.4.8 (#767 ) Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-18 01:52:55 -08:00
Adil Hafeez	98b979ce54	Upstream TLS validation and configurable connect timeout (#766 )	2026-02-18 01:19:20 -08:00
Adil Hafeez	bfbf838b19	release 0.4.7 (#752 )	2026-02-17 05:45:44 -08:00
Adil Hafeez	473996d35d	Overhaul demos directory: cleanup, restructure, and standardize configs (#760 )	2026-02-17 03:09:28 -08:00
Adil Hafeez	ba651aaf71	Rename all arch references to plano (#745 ) * Rename all arch references to plano across the codebase Complete rebrand from "Arch"/"archgw" to "Plano" including: - Config files: arch_config_schema.yaml, workflow, demo configs - Environment variables: ARCH_CONFIG_* → PLANO_CONFIG_* - Python CLI: variables, functions, file paths, docker mounts - Rust crates: config paths, log messages, metadata keys - Docker/build: Dockerfile, supervisord, .dockerignore, .gitignore - Docker Compose: volume mounts and env vars across all demos/tests - GitHub workflows: job/step names - Shell scripts: log messages - Demos: Python code, READMEs, VS Code configs, Grafana dashboard - Docs: RST includes, code comments, config references - Package metadata: package.json, pyproject.toml, uv.lock External URLs (docs.archgw.com, github.com/katanemo/archgw) left as-is. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Update remaining arch references in docs - Rename RST cross-reference labels: arch_access_logging, arch_overview_tracing, arch_overview_threading → plano_* - Update label references in request_lifecycle.rst - Rename arch_config_state_storage_example.yaml → plano_config_state_storage_example.yaml - Update config YAML comments: "Arch creates/uses" → "Plano creates/uses" - Update "the Arch gateway" → "the Plano gateway" in configuration_reference.rst - Update arch_config_schema.yaml reference in provider_models.py - Rename arch_agent_router → plano_agent_router in config example Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Fix remaining arch references found in second pass - config/docker-compose.dev.yaml: ARCH_CONFIG_FILE → PLANO_CONFIG_FILE, arch_config.yaml → plano_config.yaml, archgw_logs → plano_logs - config/test_passthrough.yaml: container mount path - tests/e2e/docker-compose.yaml: source file path (was still arch_config.yaml) - cli/planoai/core.py: comment and log message - crates/brightstaff/src/tracing/constants.rs: doc comment - tests/{e2e,archgw}/common.py: get_arch_messages → get_plano_messages, arch_state/arch_messages variables renamed - tests/{e2e,archgw}/test_prompt_gateway.py: updated imports and usages - demos/shared/test_runner/{common,test_demos}.py: same renames - tests/e2e/test_model_alias_routing.py: docstring - .dockerignore: archgw_modelserver → plano_modelserver - demos/use_cases/claude_code_router/pretty_model_resolution.sh: container name Note: x-arch-* HTTP header values and Rust constant names intentionally preserved for backwards compatibility with existing deployments. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 15:16:56 -08:00
Adil Hafeez	b9b91ddc74	release 0.4.6 (#740 )	2026-02-10 21:00:29 -08:00
Adil Hafeez	25693c36ee	release 0.4.5 (#737 )	2026-02-10 13:37:04 -08:00
Adil Hafeez	46de89590b	use standard tracing and logging in brightstaff (#721 )	2026-02-09 13:33:27 -08:00
Adil Hafeez	e056ddbcd3	add log_level env var (#728 )	2026-02-09 09:25:43 -08:00
Adil Hafeez	d8b4c800e6	release 0.4.4 (#713 )	2026-01-28 20:45:10 -08:00
Adil Hafeez	062825f26e	add envoy retries (#712 ) * add envoy retries * add missing file * fix tests --------- Co-authored-by: Adil Hafeez <adil.hafeez10@t-mobile.com>	2026-01-28 20:31:01 -08:00
Salman Paracha	2941392ed1	Adding support for wildcard models in the model_providers config (#696 ) * cleaning up plano cli commands * adding support for wildcard model providers * fixing compile errors * fixing bugs related to default model provider, provider hint and duplicates in the model provider list * fixed cargo fmt issues * updating tests to always include the model id * using default for the prompt_gateway path * fixed the model name, as gpt-5-mini-2025-08-07 wasn't in the config * making sure that all aliases and models match the config * fixed the config generator to allow for base_url providers LLMs to include wildcard models * re-ran the models list utility and added a shell script to run it * updating docs to mention wildcard model providers * updated provider_models.json to yaml, added that file to our docs for reference * updating the build docs to use the new root-based build --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>	2026-01-28 17:47:33 -08:00
Adil Hafeez	8428b06e22	add ability to set agent timeout (#710 ) Co-authored-by: Adil Hafeez <adil.hafeez10@t-mobile.com>	2026-01-28 17:18:20 -08:00
Adil Hafeez	43bdd0bfcf	add default agent schema enforcement (#702 )	2026-01-24 12:00:49 -08:00
Adil Hafeez	da5cbc29b7	release 0.4.3 (#701 )	2026-01-18 00:07:46 -08:00
Adil Hafeez	a4ccbda8fb	improve supervisord so its readable (#700 )	2026-01-17 15:29:03 -08:00
Tang Quoc Thai	4d53297c17	feat: add passthrough_auth option for forwarding client Authorization header (#687 ) * feat: add passthrough_auth option for forwarding client Authorization header * fix tests * Update comment to reflect upstream forwarding * Apply suggestions from code review --------- Co-authored-by: Adil Hafeez <adil.hafeez@gmail.com> Co-authored-by: Adil Hafeez <adil@katanemo.com>	2026-01-14 15:06:28 -08:00
Adil Hafeez	b7fba7a97f	release 0.4.2 (#679 )	2026-01-07 13:02:06 -08:00
Adil Hafeez	57327ba667	ensure that request id is consistent (#677 ) * ensure that request id is consistent * remove test debug/info statements	2026-01-07 08:44:41 -08:00
Adil Hafeez	41aa4abaeb	release 0.4.1 (#670 )	2026-01-01 23:39:18 -08:00
Adil Hafeez	77cdc7f6ef	Revert "release 0.4.1 (#666 )" (#669 ) This reverts commit `77df5160d8`.	2025-12-30 15:28:30 -08:00
Adil Hafeez	77df5160d8	release 0.4.1 (#666 )	2025-12-28 14:29:19 -08:00
Adil Hafeez	053e2b3a74	use uv instead of poetry (#663 )	2025-12-26 11:21:42 -08:00
Adil Hafeez	88d14a205b	restructure cli (#656 )	2025-12-25 14:55:29 -08:00

44 commits