Adil Hafeez
90b926c2ce
use plano-orchestrator for LLM routing, remove arch-router ( #886 )
2026-04-15 16:41:42 -07:00
Musa
980faef6be
Redis-backed session cache for cross-replica model affinity ( #879 )
...
CI / pre-commit (push) Has been cancelled
CI / plano-tools-tests (push) Has been cancelled
CI / native-smoke-test (push) Has been cancelled
CI / docker-build (push) Has been cancelled
CI / validate-config (push) Has been cancelled
Publish docker image (latest) / build-arm64 (push) Has been cancelled
Publish docker image (latest) / build-amd64 (push) Has been cancelled
Build and Deploy Documentation / build (push) Has been cancelled
CI / security-scan (push) Has been cancelled
CI / test-prompt-gateway (push) Has been cancelled
CI / test-model-alias-routing (push) Has been cancelled
CI / test-responses-api-with-state (push) Has been cancelled
CI / e2e-plano-tests (3.10) (push) Has been cancelled
CI / e2e-plano-tests (3.11) (push) Has been cancelled
CI / e2e-plano-tests (3.12) (push) Has been cancelled
CI / e2e-plano-tests (3.13) (push) Has been cancelled
CI / e2e-plano-tests (3.14) (push) Has been cancelled
CI / e2e-demo-preference (push) Has been cancelled
CI / e2e-demo-currency (push) Has been cancelled
Publish docker image (latest) / create-manifest (push) Has been cancelled
* add pluggable session cache with Redis backend
* add Redis session affinity demos (Docker Compose and Kubernetes)
* address PR review feedback on session cache
* document Redis session cache backend for model affinity
* sync rendered config reference with session_cache addition
* add tenant-scoped Redis session cache keys and remove dead log_affinity_hit
- Add tenant_header to SessionCacheConfig; when set, cache keys are scoped
as plano:affinity:{tenant_id}:{session_id} for multi-tenant isolation
- Thread tenant_id through RouterService, routing_service, and llm handlers
- Use Cow<'_, str> in session_key to avoid allocation when no tenant is set
- Remove unused log_affinity_hit (logging was already inlined at call sites)
* remove session_affinity_redis and session_affinity_redis_k8s demos
2026-04-13 19:30:47 -07:00
Musa
128059e7c1
release 0.4.18 ( #878 )
CI / security-scan (push) Blocked by required conditions
CI / test-prompt-gateway (push) Blocked by required conditions
CI / test-model-alias-routing (push) Blocked by required conditions
CI / test-responses-api-with-state (push) Blocked by required conditions
CI / e2e-plano-tests (3.10) (push) Blocked by required conditions
CI / e2e-plano-tests (3.11) (push) Blocked by required conditions
CI / e2e-plano-tests (3.12) (push) Blocked by required conditions
CI / e2e-plano-tests (3.13) (push) Blocked by required conditions
CI / e2e-plano-tests (3.14) (push) Blocked by required conditions
CI / e2e-demo-preference (push) Blocked by required conditions
CI / e2e-demo-currency (push) Blocked by required conditions
CI / pre-commit (push) Has been cancelled
CI / plano-tools-tests (push) Has been cancelled
CI / native-smoke-test (push) Has been cancelled
CI / docker-build (push) Has been cancelled
CI / validate-config (push) Has been cancelled
Publish docker image (latest) / build-arm64 (push) Has been cancelled
Publish docker image (latest) / build-amd64 (push) Has been cancelled
Publish docker image (latest) / create-manifest (push) Has been cancelled
Build and Deploy Documentation / build (push) Has been cancelled
2026-04-09 13:12:45 -07:00
Adil Hafeez
8dedf0bec1
Model affinity for consistent model selection in agentic loops ( #827 )
CI / pre-commit (push) Waiting to run
CI / plano-tools-tests (push) Waiting to run
CI / native-smoke-test (push) Waiting to run
CI / docker-build (push) Waiting to run
CI / validate-config (push) Waiting to run
CI / security-scan (push) Blocked by required conditions
CI / test-prompt-gateway (push) Blocked by required conditions
CI / test-model-alias-routing (push) Blocked by required conditions
CI / test-responses-api-with-state (push) Blocked by required conditions
CI / e2e-plano-tests (3.10) (push) Blocked by required conditions
CI / e2e-plano-tests (3.11) (push) Blocked by required conditions
CI / e2e-plano-tests (3.12) (push) Blocked by required conditions
CI / e2e-plano-tests (3.13) (push) Blocked by required conditions
CI / e2e-plano-tests (3.14) (push) Blocked by required conditions
CI / e2e-demo-preference (push) Blocked by required conditions
CI / e2e-demo-currency (push) Blocked by required conditions
Publish docker image (latest) / build-arm64 (push) Waiting to run
Publish docker image (latest) / build-amd64 (push) Waiting to run
Publish docker image (latest) / create-manifest (push) Blocked by required conditions
Build and Deploy Documentation / build (push) Waiting to run
2026-04-08 17:32:02 -07:00
Musa
978b1ea722
Add first-class Xiaomi provider support ( #863 )
...
CI / pre-commit (push) Has been cancelled
CI / plano-tools-tests (push) Has been cancelled
CI / native-smoke-test (push) Has been cancelled
CI / docker-build (push) Has been cancelled
CI / validate-config (push) Has been cancelled
CI / security-scan (push) Has been cancelled
CI / test-prompt-gateway (push) Has been cancelled
CI / test-model-alias-routing (push) Has been cancelled
CI / test-responses-api-with-state (push) Has been cancelled
CI / e2e-plano-tests (3.10) (push) Has been cancelled
CI / e2e-plano-tests (3.11) (push) Has been cancelled
CI / e2e-plano-tests (3.12) (push) Has been cancelled
CI / e2e-plano-tests (3.13) (push) Has been cancelled
CI / e2e-plano-tests (3.14) (push) Has been cancelled
CI / e2e-demo-preference (push) Has been cancelled
CI / e2e-demo-currency (push) Has been cancelled
Publish docker image (latest) / build-arm64 (push) Has been cancelled
Publish docker image (latest) / build-amd64 (push) Has been cancelled
Publish docker image (latest) / create-manifest (push) Has been cancelled
Build and Deploy Documentation / build (push) Has been cancelled
* feat(provider): add xiaomi as first-class provider
* feat(demos): add xiaomi mimo integration demo
* refactor(demos): remove Xiaomi MiMo integration demo and update documentation
* updating model list and adding the xiamoi models
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-389.local>
2026-04-04 09:58:36 -07:00
Adil Hafeez
9406af3a09
release 0.4.17 ( #869 )
CI / pre-commit (push) Waiting to run
CI / plano-tools-tests (push) Waiting to run
CI / native-smoke-test (push) Waiting to run
CI / docker-build (push) Waiting to run
CI / validate-config (push) Waiting to run
CI / security-scan (push) Blocked by required conditions
CI / test-prompt-gateway (push) Blocked by required conditions
CI / test-model-alias-routing (push) Blocked by required conditions
CI / test-responses-api-with-state (push) Blocked by required conditions
CI / e2e-plano-tests (3.10) (push) Blocked by required conditions
CI / e2e-plano-tests (3.11) (push) Blocked by required conditions
CI / e2e-plano-tests (3.12) (push) Blocked by required conditions
CI / e2e-plano-tests (3.13) (push) Blocked by required conditions
CI / e2e-plano-tests (3.14) (push) Blocked by required conditions
CI / e2e-demo-preference (push) Blocked by required conditions
CI / e2e-demo-currency (push) Blocked by required conditions
Publish docker image (latest) / build-arm64 (push) Waiting to run
Publish docker image (latest) / build-amd64 (push) Waiting to run
Publish docker image (latest) / create-manifest (push) Blocked by required conditions
Build and Deploy Documentation / build (push) Waiting to run
2026-04-03 10:05:33 -07:00
Musa
39b430d74b
feat(web): merge DigitalOcean release announcement updates ( #860 )
...
* feat(web): announce DigitalOcean acquisition across sites
* fix(web): make blog routes resilient without Sanity config
* fix(web): add mobile arrow cue to announcement banner
* fix(web): point acquisition links to announcement post
2026-04-02 06:03:52 -07:00
Musa
0857cfafbf
release 0.4.16 ( #859 )
2026-03-31 17:45:28 -07:00
Musa
82f34f82f2
Update black hook for Python 3.14 ( #857 )
...
* Update pre-commit black to latest release
* Reformat Python files for new black version
2026-03-31 13:18:45 -07:00
Adil Hafeez
f019f05738
release 0.4.15 ( #853 )
2026-03-30 17:33:40 -07:00
Adil Hafeez
af98c11a6d
restructure model_metrics_sources to type + provider ( #855 )
2026-03-30 17:12:20 -07:00
Adil Hafeez
e5751d6b13
model routing: cost/latency ranking with ranked fallback list ( #849 )
2026-03-30 13:46:52 -07:00
Musa
3a531ce22a
expand configuration reference with missing fields ( #851 )
2026-03-30 12:25:05 -07:00
Adil Hafeez
406fa92802
release 0.4.14 ( #840 )
2026-03-20 00:51:37 -07:00
Adil Hafeez
cdad02c5ee
release 0.4.13 ( #837 )
2026-03-19 19:51:58 -07:00
Adil Hafeez
1f23c573bf
add output filter chain ( #822 )
2026-03-18 17:58:20 -07:00
Adil Hafeez
5388c6777f
add k8s deployment manifests and docs for self-hosted Arch-Router ( #831 )
2026-03-16 12:05:30 -07:00
Adil Hafeez
f1b8c03e2f
release 0.4.12 ( #830 )
2026-03-15 13:03:32 -07:00
Adil Hafeez
bc059aed4d
Unified overrides for custom router and orchestrator models ( #820 )
...
* support configurable orchestrator model via orchestration config section
* add self-hosting docs and demo for Plano-Orchestrator
* list all Plano-Orchestrator model variants in docs
* use overrides for custom routing and orchestration model
* update docs
* update orchestrator model name
* rename arch provider to plano, use llm_routing_model and agent_orchestration_model
* regenerate rendered config reference
2026-03-15 09:36:11 -07:00
Adil Hafeez
5400b0a2fa
add instructions on hosting arch-router locally ( #819 )
2026-03-11 15:28:50 -07:00
Adil Hafeez
5189f7907a
add k8s deploy guide ( #816 )
2026-03-10 12:27:31 -07:00
Adil Hafeez
065328e11c
release 0.4.11 ( #806 )
2026-03-05 13:58:19 -08:00
Adil Hafeez
c13ce19293
release 0.4.10 ( #802 )
2026-03-05 12:17:45 -08:00
Adil Hafeez
f63d5de02c
Run plano natively by default ( #744 )
2026-03-05 07:35:25 -08:00
Musa
15c6c62df0
fix: strip trailing 'Copy code' from clipboard when copying code blocks ( #788 )
...
Co-authored-by: Claude <noreply@anthropic.com>
2026-02-27 13:11:26 -08:00
Adil Hafeez
d9404afa4d
release 0.4.9 ( #785 )
2026-02-26 16:18:02 -08:00
Musa
2bde21ff57
add Custom Trace Attributes to extend observability ( #708 )
...
* add custom trace attributes
* refactor: prefix custom trace attributes and update schema handlers tests configs
* refactor: rename custom_attribute_prefixes to span_attribute_header_prefixes in configuration and related handlers
* docs: add section on custom span attributes
* refactor: update tracing configuration to use span attributes and adjust related handlers
* docs: custom span attributes section to include static attributes and clarify configuration
* add custom trace attributes
* refactor: prefix custom trace attributes and update schema handlers tests configs
* refactor: rename custom_attribute_prefixes to span_attribute_header_prefixes in configuration and related handlers
* docs: add section on custom span attributes
* refactor: update tracing configuration to use span attributes and adjust related handlers
* docs: custom span attributes section to include static attributes and clarify configuration
* refactor: remove TraceCollector usage and enhance logging with structured attributes
* refactor: custom trace attribute extraction to improve clarity
---------
Co-authored-by: Cursor <cursoragent@cursor.com>
2026-02-25 16:27:20 -08:00
Adil Hafeez
70ad56a258
remove exposed example passwords from documentation ( #779 )
...
* remove exposed example passwords from documentation
Replace hardcoded example password (MyPass#123/MyPass%23123) and
project-specific Supabase references (postgres.myproject) with
generic placeholders in docs.
https://claude.ai/code/session_01H5wj3VH1Jh28kzepEwdDCx
* remove hardcoded FlightAware AeroAPI key from flights.py
https://claude.ai/code/session_01H5wj3VH1Jh28kzepEwdDCx
---------
Co-authored-by: Claude <noreply@anthropic.com>
2026-02-25 13:14:36 -08:00
Musa
ed64230833
add support for background trace collection and tracing output ( #749 )
...
* feat: add trace listener process management and foreground mode
* docs: add CLI reference documentation and update index
* fix: test coverage failing
* refactor: simplify trace listener initialization and remove debug mode handling
* docs: add CLI command screenshots to reference documentation
* fix: update trace listener PID file path
* refactor: integrate trace listener management into runtime module and streamline PID handling
* adjusting trace command for feedback on PR
2026-02-24 19:17:33 -08:00
Salman Paracha
69d650a4e5
updating architecture diagram ( #774 )
...
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>
2026-02-21 16:00:02 -08:00
Adil Hafeez
7b5f1549a5
release 0.4.8 ( #767 )
...
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-18 01:52:55 -08:00
Adil Hafeez
bfbf838b19
release 0.4.7 ( #752 )
2026-02-17 05:45:44 -08:00
Adil Hafeez
473996d35d
Overhaul demos directory: cleanup, restructure, and standardize configs ( #760 )
2026-02-17 03:09:28 -08:00
Adil Hafeez
ba651aaf71
Rename all arch references to plano ( #745 )
...
* Rename all arch references to plano across the codebase
Complete rebrand from "Arch"/"archgw" to "Plano" including:
- Config files: arch_config_schema.yaml, workflow, demo configs
- Environment variables: ARCH_CONFIG_* → PLANO_CONFIG_*
- Python CLI: variables, functions, file paths, docker mounts
- Rust crates: config paths, log messages, metadata keys
- Docker/build: Dockerfile, supervisord, .dockerignore, .gitignore
- Docker Compose: volume mounts and env vars across all demos/tests
- GitHub workflows: job/step names
- Shell scripts: log messages
- Demos: Python code, READMEs, VS Code configs, Grafana dashboard
- Docs: RST includes, code comments, config references
- Package metadata: package.json, pyproject.toml, uv.lock
External URLs (docs.archgw.com, github.com/katanemo/archgw) left as-is.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* Update remaining arch references in docs
- Rename RST cross-reference labels: arch_access_logging, arch_overview_tracing, arch_overview_threading → plano_*
- Update label references in request_lifecycle.rst
- Rename arch_config_state_storage_example.yaml → plano_config_state_storage_example.yaml
- Update config YAML comments: "Arch creates/uses" → "Plano creates/uses"
- Update "the Arch gateway" → "the Plano gateway" in configuration_reference.rst
- Update arch_config_schema.yaml reference in provider_models.py
- Rename arch_agent_router → plano_agent_router in config example
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* Fix remaining arch references found in second pass
- config/docker-compose.dev.yaml: ARCH_CONFIG_FILE → PLANO_CONFIG_FILE,
arch_config.yaml → plano_config.yaml, archgw_logs → plano_logs
- config/test_passthrough.yaml: container mount path
- tests/e2e/docker-compose.yaml: source file path (was still arch_config.yaml)
- cli/planoai/core.py: comment and log message
- crates/brightstaff/src/tracing/constants.rs: doc comment
- tests/{e2e,archgw}/common.py: get_arch_messages → get_plano_messages,
arch_state/arch_messages variables renamed
- tests/{e2e,archgw}/test_prompt_gateway.py: updated imports and usages
- demos/shared/test_runner/{common,test_demos}.py: same renames
- tests/e2e/test_model_alias_routing.py: docstring
- .dockerignore: archgw_modelserver → plano_modelserver
- demos/use_cases/claude_code_router/pretty_model_resolution.sh: container name
Note: x-arch-* HTTP header values and Rust constant names intentionally
preserved for backwards compatibility with existing deployments.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
---------
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-13 15:16:56 -08:00
Adil Hafeez
b9b91ddc74
release 0.4.6 ( #740 )
2026-02-10 21:00:29 -08:00
Adil Hafeez
25693c36ee
release 0.4.5 ( #737 )
2026-02-10 13:37:04 -08:00
Musa
e3bf2b7f71
Introduce brand new CLI experience with tracing and quickstart ( #724 )
...
Release hardens tracing and routing: clearer CLI, modular internals, updated demos/docs/tests, and improved multi-agent reliability.
Co-authored-by: Adil Hafeez <adil.hafeez@gmail.com>
2026-02-10 13:17:43 -08:00
Adil Hafeez
46de89590b
use standard tracing and logging in brightstaff ( #721 )
2026-02-09 13:33:27 -08:00
Adil Hafeez
99077d83fb
fix build break ( #730 )
...
docs build was breaking because requirements file was getting ignored from .dockerignore
2026-02-09 10:08:20 -08:00
Adil Hafeez
d8b4c800e6
release 0.4.4 ( #713 )
2026-01-28 20:45:10 -08:00
Adil Hafeez
062825f26e
add envoy retries ( #712 )
...
* add envoy retries
* add missing file
* fix tests
---------
Co-authored-by: Adil Hafeez <adil.hafeez10@t-mobile.com>
2026-01-28 20:31:01 -08:00
Salman Paracha
2a36dd7376
fixing the build scripts for documentation ( #711 )
...
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>
2026-01-28 18:55:35 -08:00
Salman Paracha
2941392ed1
Adding support for wildcard models in the model_providers config ( #696 )
...
* cleaning up plano cli commands
* adding support for wildcard model providers
* fixing compile errors
* fixing bugs related to default model provider, provider hint and duplicates in the model provider list
* fixed cargo fmt issues
* updating tests to always include the model id
* using default for the prompt_gateway path
* fixed the model name, as gpt-5-mini-2025-08-07 wasn't in the config
* making sure that all aliases and models match the config
* fixed the config generator to allow for base_url providers LLMs to include wildcard models
* re-ran the models list utility and added a shell script to run it
* updating docs to mention wildcard model providers
* updated provider_models.json to yaml, added that file to our docs for reference
* updating the build docs to use the new root-based build
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>
2026-01-28 17:47:33 -08:00
Adil Hafeez
da5cbc29b7
release 0.4.3 ( #701 )
2026-01-18 00:07:46 -08:00
Tang Quoc Thai
4d53297c17
feat: add passthrough_auth option for forwarding client Authorization header ( #687 )
...
* feat: add passthrough_auth option for forwarding client Authorization header
* fix tests
* Update comment to reflect upstream forwarding
* Apply suggestions from code review
---------
Co-authored-by: Adil Hafeez <adil.hafeez@gmail.com>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2026-01-14 15:06:28 -08:00
Adil Hafeez
ab391f96c7
don't include internal models in /v1/models endpoint ( #685 )
2026-01-09 16:57:41 -08:00
Salman Paracha
40b9780774
tweaks to web and docs to align to 0.4.2 ( #680 )
...
* tweaks to web and docs to align to 0.4.2
* made our release banner clickable
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>
2026-01-07 13:51:40 -08:00
Adil Hafeez
b7fba7a97f
release 0.4.2 ( #679 )
2026-01-07 13:02:06 -08:00
Salman Paracha
b4543ba56c
Introduce signals change ( #655 )
...
* adding support for signals
* reducing false positives for signals like positive interaction
* adding docs. Still need to fix the messages list, but waiting on PR #621
* Improve frustration detection: normalize contractions and refine punctuation
* Further refine test cases with longer messages
* minor doc changes
* fixing echo statement for build
* fixing the messages construction and using the trait for signals
* update signals docs
* fixed some minor doc changes
* added more tests and fixed docuemtnation. PR 100% ready
* made fixes based on PR comments
* Optimize latency
1. replace sliding window approach with trigram containment check
2. add code to pre-compute ngrams for patterns
* removed some debug statements to make tests easier to read
* PR comments to make ObservableStreamProcessor accept optonal Vec<Messagges>
* fixed PR comments
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>
Co-authored-by: MeiyuZhong <mariazhong9612@gmail.com>
Co-authored-by: nehcgs <54548843+nehcgs@users.noreply.github.com>
2026-01-07 11:20:44 -08:00
Salman Paracha
a764cac869
updated readme with a snippet of code to go along with the descriptio… ( #674 )
...
* updated readme with a snippet of code to go along with the description of the proejct
* updated readme
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>
2026-01-03 23:22:04 -08:00