2025-09-16 11:12:08 -07:00
|
|
|
# Model Alias Demo Suite
|
|
|
|
|
|
Rename all arch references to plano (#745)
* Rename all arch references to plano across the codebase
Complete rebrand from "Arch"/"archgw" to "Plano" including:
- Config files: arch_config_schema.yaml, workflow, demo configs
- Environment variables: ARCH_CONFIG_* → PLANO_CONFIG_*
- Python CLI: variables, functions, file paths, docker mounts
- Rust crates: config paths, log messages, metadata keys
- Docker/build: Dockerfile, supervisord, .dockerignore, .gitignore
- Docker Compose: volume mounts and env vars across all demos/tests
- GitHub workflows: job/step names
- Shell scripts: log messages
- Demos: Python code, READMEs, VS Code configs, Grafana dashboard
- Docs: RST includes, code comments, config references
- Package metadata: package.json, pyproject.toml, uv.lock
External URLs (docs.archgw.com, github.com/katanemo/archgw) left as-is.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* Update remaining arch references in docs
- Rename RST cross-reference labels: arch_access_logging, arch_overview_tracing, arch_overview_threading → plano_*
- Update label references in request_lifecycle.rst
- Rename arch_config_state_storage_example.yaml → plano_config_state_storage_example.yaml
- Update config YAML comments: "Arch creates/uses" → "Plano creates/uses"
- Update "the Arch gateway" → "the Plano gateway" in configuration_reference.rst
- Update arch_config_schema.yaml reference in provider_models.py
- Rename arch_agent_router → plano_agent_router in config example
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* Fix remaining arch references found in second pass
- config/docker-compose.dev.yaml: ARCH_CONFIG_FILE → PLANO_CONFIG_FILE,
arch_config.yaml → plano_config.yaml, archgw_logs → plano_logs
- config/test_passthrough.yaml: container mount path
- tests/e2e/docker-compose.yaml: source file path (was still arch_config.yaml)
- cli/planoai/core.py: comment and log message
- crates/brightstaff/src/tracing/constants.rs: doc comment
- tests/{e2e,archgw}/common.py: get_arch_messages → get_plano_messages,
arch_state/arch_messages variables renamed
- tests/{e2e,archgw}/test_prompt_gateway.py: updated imports and usages
- demos/shared/test_runner/{common,test_demos}.py: same renames
- tests/e2e/test_model_alias_routing.py: docstring
- .dockerignore: archgw_modelserver → plano_modelserver
- demos/use_cases/claude_code_router/pretty_model_resolution.sh: container name
Note: x-arch-* HTTP header values and Rust constant names intentionally
preserved for backwards compatibility with existing deployments.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
---------
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-13 15:16:56 -08:00
|
|
|
This directory contains demos for the model alias feature in Plano.
|
2025-09-16 11:12:08 -07:00
|
|
|
|
|
|
|
|
## Overview
|
|
|
|
|
|
|
|
|
|
Model aliases allow clients to use friendly, semantic names instead of provider-specific model names. For example:
|
|
|
|
|
- `arch.summarize.v1` → `4o-mini` (fast, cheap model for summaries)
|
|
|
|
|
- `arch.reasoning.v1` → `gpt-4o` (capable model for complex reasoning)
|
|
|
|
|
- `creative-model` → `claude-3-5-sonnet` (creative tasks)
|
|
|
|
|
|
|
|
|
|
## Configuration
|
|
|
|
|
|
|
|
|
|
The `arch_config_with_aliases.yaml` file defines several aliases:
|
|
|
|
|
|
|
|
|
|
```yaml
|
|
|
|
|
# Model aliases - friendly names that map to actual provider names
|
|
|
|
|
model_aliases:
|
|
|
|
|
# Alias for summarization tasks -> fast/cheap model
|
|
|
|
|
arch.summarize.v1:
|
|
|
|
|
target: gpt-4o-mini
|
|
|
|
|
|
|
|
|
|
# Alias for general purpose tasks -> latest model
|
|
|
|
|
arch.v1:
|
|
|
|
|
target: o3
|
|
|
|
|
|
|
|
|
|
# Alias for reasoning tasks -> capable model
|
|
|
|
|
arch.reasoning.v1:
|
|
|
|
|
target: gpt-4o
|
|
|
|
|
|
|
|
|
|
# Alias for creative tasks -> Claude model
|
|
|
|
|
arch.creative.v1:
|
|
|
|
|
target: claude-3-5-sonnet-20241022
|
|
|
|
|
|
|
|
|
|
# Alias for quick responses -> fast model
|
|
|
|
|
arch.fast.v1:
|
|
|
|
|
target: claude-3-haiku-20240307
|
|
|
|
|
|
|
|
|
|
# Semantic aliases
|
|
|
|
|
summary-model:
|
|
|
|
|
target: gpt-4o-mini
|
|
|
|
|
|
|
|
|
|
chat-model:
|
|
|
|
|
target: gpt-4o
|
|
|
|
|
|
|
|
|
|
creative-model:
|
|
|
|
|
target: claude-3-5-sonnet-20241022
|
|
|
|
|
```
|
|
|
|
|
|
|
|
|
|
## Prerequisites
|
Rename all arch references to plano (#745)
* Rename all arch references to plano across the codebase
Complete rebrand from "Arch"/"archgw" to "Plano" including:
- Config files: arch_config_schema.yaml, workflow, demo configs
- Environment variables: ARCH_CONFIG_* → PLANO_CONFIG_*
- Python CLI: variables, functions, file paths, docker mounts
- Rust crates: config paths, log messages, metadata keys
- Docker/build: Dockerfile, supervisord, .dockerignore, .gitignore
- Docker Compose: volume mounts and env vars across all demos/tests
- GitHub workflows: job/step names
- Shell scripts: log messages
- Demos: Python code, READMEs, VS Code configs, Grafana dashboard
- Docs: RST includes, code comments, config references
- Package metadata: package.json, pyproject.toml, uv.lock
External URLs (docs.archgw.com, github.com/katanemo/archgw) left as-is.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* Update remaining arch references in docs
- Rename RST cross-reference labels: arch_access_logging, arch_overview_tracing, arch_overview_threading → plano_*
- Update label references in request_lifecycle.rst
- Rename arch_config_state_storage_example.yaml → plano_config_state_storage_example.yaml
- Update config YAML comments: "Arch creates/uses" → "Plano creates/uses"
- Update "the Arch gateway" → "the Plano gateway" in configuration_reference.rst
- Update arch_config_schema.yaml reference in provider_models.py
- Rename arch_agent_router → plano_agent_router in config example
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* Fix remaining arch references found in second pass
- config/docker-compose.dev.yaml: ARCH_CONFIG_FILE → PLANO_CONFIG_FILE,
arch_config.yaml → plano_config.yaml, archgw_logs → plano_logs
- config/test_passthrough.yaml: container mount path
- tests/e2e/docker-compose.yaml: source file path (was still arch_config.yaml)
- cli/planoai/core.py: comment and log message
- crates/brightstaff/src/tracing/constants.rs: doc comment
- tests/{e2e,archgw}/common.py: get_arch_messages → get_plano_messages,
arch_state/arch_messages variables renamed
- tests/{e2e,archgw}/test_prompt_gateway.py: updated imports and usages
- demos/shared/test_runner/{common,test_demos}.py: same renames
- tests/e2e/test_model_alias_routing.py: docstring
- .dockerignore: archgw_modelserver → plano_modelserver
- demos/use_cases/claude_code_router/pretty_model_resolution.sh: container name
Note: x-arch-* HTTP header values and Rust constant names intentionally
preserved for backwards compatibility with existing deployments.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
---------
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-13 15:16:56 -08:00
|
|
|
- Install all dependencies as described in the main Plano README ([link](https://github.com/katanemo/plano/?tab=readme-ov-file#prerequisites))
|
2025-09-16 11:12:08 -07:00
|
|
|
- Set your API keys in your environment:
|
|
|
|
|
- `export OPENAI_API_KEY=your-openai-key`
|
|
|
|
|
- `export ANTHROPIC_API_KEY=your-anthropic-key` (optional, but recommended for Anthropic tests)
|
|
|
|
|
|
|
|
|
|
## How to Run
|
|
|
|
|
|
|
|
|
|
1. Start the demo:
|
|
|
|
|
```sh
|
|
|
|
|
sh run_demo.sh
|
|
|
|
|
```
|
|
|
|
|
- This will create a `.env` file with your API keys (if not present).
|
Rename all arch references to plano (#745)
* Rename all arch references to plano across the codebase
Complete rebrand from "Arch"/"archgw" to "Plano" including:
- Config files: arch_config_schema.yaml, workflow, demo configs
- Environment variables: ARCH_CONFIG_* → PLANO_CONFIG_*
- Python CLI: variables, functions, file paths, docker mounts
- Rust crates: config paths, log messages, metadata keys
- Docker/build: Dockerfile, supervisord, .dockerignore, .gitignore
- Docker Compose: volume mounts and env vars across all demos/tests
- GitHub workflows: job/step names
- Shell scripts: log messages
- Demos: Python code, READMEs, VS Code configs, Grafana dashboard
- Docs: RST includes, code comments, config references
- Package metadata: package.json, pyproject.toml, uv.lock
External URLs (docs.archgw.com, github.com/katanemo/archgw) left as-is.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* Update remaining arch references in docs
- Rename RST cross-reference labels: arch_access_logging, arch_overview_tracing, arch_overview_threading → plano_*
- Update label references in request_lifecycle.rst
- Rename arch_config_state_storage_example.yaml → plano_config_state_storage_example.yaml
- Update config YAML comments: "Arch creates/uses" → "Plano creates/uses"
- Update "the Arch gateway" → "the Plano gateway" in configuration_reference.rst
- Update arch_config_schema.yaml reference in provider_models.py
- Rename arch_agent_router → plano_agent_router in config example
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* Fix remaining arch references found in second pass
- config/docker-compose.dev.yaml: ARCH_CONFIG_FILE → PLANO_CONFIG_FILE,
arch_config.yaml → plano_config.yaml, archgw_logs → plano_logs
- config/test_passthrough.yaml: container mount path
- tests/e2e/docker-compose.yaml: source file path (was still arch_config.yaml)
- cli/planoai/core.py: comment and log message
- crates/brightstaff/src/tracing/constants.rs: doc comment
- tests/{e2e,archgw}/common.py: get_arch_messages → get_plano_messages,
arch_state/arch_messages variables renamed
- tests/{e2e,archgw}/test_prompt_gateway.py: updated imports and usages
- demos/shared/test_runner/{common,test_demos}.py: same renames
- tests/e2e/test_model_alias_routing.py: docstring
- .dockerignore: archgw_modelserver → plano_modelserver
- demos/use_cases/claude_code_router/pretty_model_resolution.sh: container name
Note: x-arch-* HTTP header values and Rust constant names intentionally
preserved for backwards compatibility with existing deployments.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
---------
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-13 15:16:56 -08:00
|
|
|
- Starts Plano gateway with model alias config (`arch_config_with_aliases.yaml`).
|
2025-09-16 11:12:08 -07:00
|
|
|
|
|
|
|
|
2. To stop the demo:
|
|
|
|
|
```sh
|
|
|
|
|
sh run_demo.sh down
|
|
|
|
|
```
|
Rename all arch references to plano (#745)
* Rename all arch references to plano across the codebase
Complete rebrand from "Arch"/"archgw" to "Plano" including:
- Config files: arch_config_schema.yaml, workflow, demo configs
- Environment variables: ARCH_CONFIG_* → PLANO_CONFIG_*
- Python CLI: variables, functions, file paths, docker mounts
- Rust crates: config paths, log messages, metadata keys
- Docker/build: Dockerfile, supervisord, .dockerignore, .gitignore
- Docker Compose: volume mounts and env vars across all demos/tests
- GitHub workflows: job/step names
- Shell scripts: log messages
- Demos: Python code, READMEs, VS Code configs, Grafana dashboard
- Docs: RST includes, code comments, config references
- Package metadata: package.json, pyproject.toml, uv.lock
External URLs (docs.archgw.com, github.com/katanemo/archgw) left as-is.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* Update remaining arch references in docs
- Rename RST cross-reference labels: arch_access_logging, arch_overview_tracing, arch_overview_threading → plano_*
- Update label references in request_lifecycle.rst
- Rename arch_config_state_storage_example.yaml → plano_config_state_storage_example.yaml
- Update config YAML comments: "Arch creates/uses" → "Plano creates/uses"
- Update "the Arch gateway" → "the Plano gateway" in configuration_reference.rst
- Update arch_config_schema.yaml reference in provider_models.py
- Rename arch_agent_router → plano_agent_router in config example
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* Fix remaining arch references found in second pass
- config/docker-compose.dev.yaml: ARCH_CONFIG_FILE → PLANO_CONFIG_FILE,
arch_config.yaml → plano_config.yaml, archgw_logs → plano_logs
- config/test_passthrough.yaml: container mount path
- tests/e2e/docker-compose.yaml: source file path (was still arch_config.yaml)
- cli/planoai/core.py: comment and log message
- crates/brightstaff/src/tracing/constants.rs: doc comment
- tests/{e2e,archgw}/common.py: get_arch_messages → get_plano_messages,
arch_state/arch_messages variables renamed
- tests/{e2e,archgw}/test_prompt_gateway.py: updated imports and usages
- demos/shared/test_runner/{common,test_demos}.py: same renames
- tests/e2e/test_model_alias_routing.py: docstring
- .dockerignore: archgw_modelserver → plano_modelserver
- demos/use_cases/claude_code_router/pretty_model_resolution.sh: container name
Note: x-arch-* HTTP header values and Rust constant names intentionally
preserved for backwards compatibility with existing deployments.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
---------
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-13 15:16:56 -08:00
|
|
|
- This will stop Plano gateway and any related services.
|
2025-09-16 11:12:08 -07:00
|
|
|
|
|
|
|
|
## Example Requests
|
|
|
|
|
|
|
|
|
|
### OpenAI client with alias `arch.summarize.v1`
|
|
|
|
|
```sh
|
|
|
|
|
curl -sS -X POST "http://localhost:12000/v1/chat/completions" \
|
|
|
|
|
-H "Authorization: Bearer test-key" \
|
|
|
|
|
-H "Content-Type: application/json" \
|
|
|
|
|
-d '{
|
|
|
|
|
"model": "arch.summarize.v1",
|
|
|
|
|
"max_tokens": 50,
|
|
|
|
|
"messages": [
|
|
|
|
|
{ "role": "user",
|
|
|
|
|
"content": "Hello, please respond with exactly: Hello from alias arch.summarize.v1!"
|
|
|
|
|
}
|
|
|
|
|
]
|
|
|
|
|
}' | jq .
|
|
|
|
|
```
|
|
|
|
|
|
|
|
|
|
### OpenAI client with alias `arch.v1`
|
|
|
|
|
```sh
|
|
|
|
|
curl -sS -X POST "http://localhost:12000/v1/chat/completions" \
|
|
|
|
|
-H "Authorization: Bearer test-key" \
|
|
|
|
|
-H "Content-Type: application/json" \
|
|
|
|
|
-d '{
|
|
|
|
|
"model": "arch.v1",
|
|
|
|
|
"max_tokens": 50,
|
|
|
|
|
"messages": [
|
|
|
|
|
{ "role": "user",
|
|
|
|
|
"content": "Hello, please respond with exactly: Hello from alias arch.v1!"
|
|
|
|
|
}
|
|
|
|
|
]
|
|
|
|
|
}' | jq .
|
|
|
|
|
```
|
|
|
|
|
|
|
|
|
|
### Anthropic client with alias `arch.summarize.v1`
|
|
|
|
|
```sh
|
|
|
|
|
curl -sS -X POST "http://localhost:12000/v1/messages" \
|
|
|
|
|
-H "x-api-key: test-key" \
|
|
|
|
|
-H "anthropic-version: 2023-06-01" \
|
|
|
|
|
-H "Content-Type: application/json" \
|
|
|
|
|
-d '{
|
|
|
|
|
"model": "arch.summarize.v1",
|
|
|
|
|
"max_tokens": 50,
|
|
|
|
|
"messages": [
|
|
|
|
|
{ "role": "user",
|
|
|
|
|
"content": "Hello, please respond with exactly: Hello from alias arch.summarize.v1 via Anthropic!"
|
|
|
|
|
}
|
|
|
|
|
]
|
|
|
|
|
}' | jq .
|
|
|
|
|
```
|
|
|
|
|
|
|
|
|
|
### Anthropic client with alias `arch.v1`
|
|
|
|
|
```sh
|
|
|
|
|
curl -sS -X POST "http://localhost:12000/v1/messages" \
|
|
|
|
|
-H "x-api-key: test-key" \
|
|
|
|
|
-H "anthropic-version: 2023-06-01" \
|
|
|
|
|
-H "Content-Type: application/json" \
|
|
|
|
|
-d '{
|
|
|
|
|
"model": "arch.summarize.v1",
|
|
|
|
|
"max_tokens": 50,
|
|
|
|
|
"messages": [
|
|
|
|
|
{ "role": "user",
|
|
|
|
|
"content": "Hello, please respond with exactly: Hello from alias arch.summarize.v1 via Anthropic!"
|
|
|
|
|
}
|
|
|
|
|
]
|
|
|
|
|
}' | jq .
|
|
|
|
|
```
|
|
|
|
|
|
|
|
|
|
## Notes
|
|
|
|
|
- The `.env` file will be created automatically if missing, with your API keys.
|
|
|
|
|
- If `ANTHROPIC_API_KEY` is not set, Anthropic requests will not work.
|
|
|
|
|
- You can add more aliases in `arch_config_with_aliases.yaml`.
|
|
|
|
|
- All curl examples use `jq .` for pretty-printing JSON responses.
|
|
|
|
|
|
|
|
|
|
## Troubleshooting
|
|
|
|
|
- Ensure your API keys are set in your environment before running the demo.
|
|
|
|
|
- If you see errors about missing keys, set them and re-run the script.
|
Rename all arch references to plano (#745)
* Rename all arch references to plano across the codebase
Complete rebrand from "Arch"/"archgw" to "Plano" including:
- Config files: arch_config_schema.yaml, workflow, demo configs
- Environment variables: ARCH_CONFIG_* → PLANO_CONFIG_*
- Python CLI: variables, functions, file paths, docker mounts
- Rust crates: config paths, log messages, metadata keys
- Docker/build: Dockerfile, supervisord, .dockerignore, .gitignore
- Docker Compose: volume mounts and env vars across all demos/tests
- GitHub workflows: job/step names
- Shell scripts: log messages
- Demos: Python code, READMEs, VS Code configs, Grafana dashboard
- Docs: RST includes, code comments, config references
- Package metadata: package.json, pyproject.toml, uv.lock
External URLs (docs.archgw.com, github.com/katanemo/archgw) left as-is.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* Update remaining arch references in docs
- Rename RST cross-reference labels: arch_access_logging, arch_overview_tracing, arch_overview_threading → plano_*
- Update label references in request_lifecycle.rst
- Rename arch_config_state_storage_example.yaml → plano_config_state_storage_example.yaml
- Update config YAML comments: "Arch creates/uses" → "Plano creates/uses"
- Update "the Arch gateway" → "the Plano gateway" in configuration_reference.rst
- Update arch_config_schema.yaml reference in provider_models.py
- Rename arch_agent_router → plano_agent_router in config example
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* Fix remaining arch references found in second pass
- config/docker-compose.dev.yaml: ARCH_CONFIG_FILE → PLANO_CONFIG_FILE,
arch_config.yaml → plano_config.yaml, archgw_logs → plano_logs
- config/test_passthrough.yaml: container mount path
- tests/e2e/docker-compose.yaml: source file path (was still arch_config.yaml)
- cli/planoai/core.py: comment and log message
- crates/brightstaff/src/tracing/constants.rs: doc comment
- tests/{e2e,archgw}/common.py: get_arch_messages → get_plano_messages,
arch_state/arch_messages variables renamed
- tests/{e2e,archgw}/test_prompt_gateway.py: updated imports and usages
- demos/shared/test_runner/{common,test_demos}.py: same renames
- tests/e2e/test_model_alias_routing.py: docstring
- .dockerignore: archgw_modelserver → plano_modelserver
- demos/use_cases/claude_code_router/pretty_model_resolution.sh: container name
Note: x-arch-* HTTP header values and Rust constant names intentionally
preserved for backwards compatibility with existing deployments.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
---------
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-13 15:16:56 -08:00
|
|
|
- For more details, see the main Plano documentation.
|