plano/docs/source/resources/includes/plano_config_full_reference.yaml

# Plano Gateway configuration version
version: v0.3.0

# External HTTP agents - API type is controlled by request path (/v1/responses, /v1/messages, /v1/chat/completions)
agents:
  - id: weather_agent # Example agent for weather
    url: http://localhost:10510

  - id: flight_agent # Example agent for flights
    url: http://localhost:10520

# MCP filters applied to requests/responses (e.g., input validation, query rewriting)
filters:
  - id: input_guards # Example filter for input validation
    url: http://localhost:10500
    # type: mcp (default)
    # transport: streamable-http (default)
    # tool: input_guards (default - same as filter id)

# LLM provider configurations with API keys and model routing
model_providers:
  - model: openai/gpt-4o
    access_key: $OPENAI_API_KEY
    default: true

  - model: openai/gpt-4o-mini
    access_key: $OPENAI_API_KEY

  - model: anthropic/claude-sonnet-4-0
    access_key: $ANTHROPIC_API_KEY

  - model: mistral/ministral-3b-latest
    access_key: $MISTRAL_API_KEY

  # Example: Passthrough authentication for LiteLLM or similar proxies
  # When passthrough_auth is true, client's Authorization header is forwarded
  # instead of using the configured access_key
  - model: openai/gpt-4o-litellm
    base_url: https://litellm.example.com
    passthrough_auth: true

# Model aliases - use friendly names instead of full provider model names
model_aliases:
  fast-llm:
    target: gpt-4o-mini

  smart-llm:
    target: gpt-4o

# HTTP listeners - entry points for agent routing, prompt targets, and direct LLM access
listeners:
  # Agent listener for routing requests to multiple agents
  - type: agent
    name: travel_booking_service
    port: 8001
    router: plano_orchestrator_v1
    address: 0.0.0.0
    agents:
      - id: rag_agent
        description: virtual assistant for retrieval augmented generation tasks
        input_filters:
          - input_guards

  # Model listener for direct LLM access
  - type: model
    name: model_1
    address: 0.0.0.0
    port: 12000
    # Optional: attach input filters for guardrails on direct LLM requests
    # input_filters:
    #   - input_guards

  # Prompt listener for function calling (for prompt_targets)
  - type: prompt
    name: prompt_function_listener
    address: 0.0.0.0
    port: 10000
    # This listener is used for prompt_targets and function calling

# Reusable service endpoints
endpoints:
  app_server:
    endpoint: 127.0.0.1:80
    connect_timeout: 0.005s

  mistral_local:
    endpoint: 127.0.0.1:8001

# Prompt targets for function calling and API orchestration
prompt_targets:
  - name: get_current_weather
    description: Get current weather at a location.
    parameters:
      - name: location
        description: The location to get the weather for
        required: true
        type: string
        format: City, State
      - name: days
        description: the number of days for the request
        required: true
        type: int
    endpoint:
      name: app_server
      path: /weather
      http_method: POST

# OpenTelemetry tracing configuration
tracing:
  # Random sampling percentage (1-100)
  random_sampling: 100
Rename all arch references to plano (#745) * Rename all arch references to plano across the codebase Complete rebrand from "Arch"/"archgw" to "Plano" including: - Config files: arch_config_schema.yaml, workflow, demo configs - Environment variables: ARCH_CONFIG_* → PLANO_CONFIG_* - Python CLI: variables, functions, file paths, docker mounts - Rust crates: config paths, log messages, metadata keys - Docker/build: Dockerfile, supervisord, .dockerignore, .gitignore - Docker Compose: volume mounts and env vars across all demos/tests - GitHub workflows: job/step names - Shell scripts: log messages - Demos: Python code, READMEs, VS Code configs, Grafana dashboard - Docs: RST includes, code comments, config references - Package metadata: package.json, pyproject.toml, uv.lock External URLs (docs.archgw.com, github.com/katanemo/archgw) left as-is. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Update remaining arch references in docs - Rename RST cross-reference labels: arch_access_logging, arch_overview_tracing, arch_overview_threading → plano_* - Update label references in request_lifecycle.rst - Rename arch_config_state_storage_example.yaml → plano_config_state_storage_example.yaml - Update config YAML comments: "Arch creates/uses" → "Plano creates/uses" - Update "the Arch gateway" → "the Plano gateway" in configuration_reference.rst - Update arch_config_schema.yaml reference in provider_models.py - Rename arch_agent_router → plano_agent_router in config example Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Fix remaining arch references found in second pass - config/docker-compose.dev.yaml: ARCH_CONFIG_FILE → PLANO_CONFIG_FILE, arch_config.yaml → plano_config.yaml, archgw_logs → plano_logs - config/test_passthrough.yaml: container mount path - tests/e2e/docker-compose.yaml: source file path (was still arch_config.yaml) - cli/planoai/core.py: comment and log message - crates/brightstaff/src/tracing/constants.rs: doc comment - tests/{e2e,archgw}/common.py: get_arch_messages → get_plano_messages, arch_state/arch_messages variables renamed - tests/{e2e,archgw}/test_prompt_gateway.py: updated imports and usages - demos/shared/test_runner/{common,test_demos}.py: same renames - tests/e2e/test_model_alias_routing.py: docstring - .dockerignore: archgw_modelserver → plano_modelserver - demos/use_cases/claude_code_router/pretty_model_resolution.sh: container name Note: x-arch-* HTTP header values and Rust constant names intentionally preserved for backwards compatibility with existing deployments. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> 2026-02-13 15:16:56 -08:00			`# Plano Gateway configuration version`
Update docs to Plano (#639) 2025-12-23 17:14:50 -08:00			`version: v0.3.0`
Docs branch - v1 of our tech docs (#69) * added the first set of docs for our technical docs * more docuemtnation changes * added support for prompt processing and updated life of a request * updated docs to including getting help sections and updated life of a request * committing local changes for getting started guide, sample applications, and full reference spec for prompt-config * updated configuration reference, added sample app skeleton, updated favico * fixed the configuration refernce file, and made minor changes to the intent detection. commit v1 for now --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local> Co-authored-by: Adil Hafeez <adil@katanemo.com> 2024-09-20 17:08:42 -07:00
Update docs to Plano (#639) 2025-12-23 17:14:50 -08:00			`# External HTTP agents - API type is controlled by request path (/v1/responses, /v1/messages, /v1/chat/completions)`
			`agents:`
feat: add passthrough_auth option for forwarding client Authorization header (#687) * feat: add passthrough_auth option for forwarding client Authorization header * fix tests * Update comment to reflect upstream forwarding * Apply suggestions from code review --------- Co-authored-by: Adil Hafeez <adil.hafeez@gmail.com> Co-authored-by: Adil Hafeez <adil@katanemo.com> 2026-01-15 00:06:28 +01:00			`- id: weather_agent # Example agent for weather`
Run plano natively by default (#744) 2026-03-05 07:35:25 -08:00			`url: http://localhost:10510`
update config (#93) 2024-09-30 17:49:05 -07:00
feat: add passthrough_auth option for forwarding client Authorization header (#687) * feat: add passthrough_auth option for forwarding client Authorization header * fix tests * Update comment to reflect upstream forwarding * Apply suggestions from code review --------- Co-authored-by: Adil Hafeez <adil.hafeez@gmail.com> Co-authored-by: Adil Hafeez <adil@katanemo.com> 2026-01-15 00:06:28 +01:00			`- id: flight_agent # Example agent for flights`
Run plano natively by default (#744) 2026-03-05 07:35:25 -08:00			`url: http://localhost:10520`
update config (#93) 2024-09-30 17:49:05 -07:00
Update docs to Plano (#639) 2025-12-23 17:14:50 -08:00			`# MCP filters applied to requests/responses (e.g., input validation, query rewriting)`
			`filters:`
feat: add passthrough_auth option for forwarding client Authorization header (#687) * feat: add passthrough_auth option for forwarding client Authorization header * fix tests * Update comment to reflect upstream forwarding * Apply suggestions from code review --------- Co-authored-by: Adil Hafeez <adil.hafeez@gmail.com> Co-authored-by: Adil Hafeez <adil@katanemo.com> 2026-01-15 00:06:28 +01:00			`- id: input_guards # Example filter for input validation`
Run plano natively by default (#744) 2026-03-05 07:35:25 -08:00			`url: http://localhost:10500`
Update docs to Plano (#639) 2025-12-23 17:14:50 -08:00			`# type: mcp (default)`
			`# transport: streamable-http (default)`
			`# tool: input_guards (default - same as filter id)`

			`# LLM provider configurations with API keys and model routing`
			`model_providers:`
			`- model: openai/gpt-4o`
Use large github action machine to run e2e tests (#230) 2024-10-30 17:54:51 -07:00			`access_key: $OPENAI_API_KEY`
Docs branch - v1 of our tech docs (#69) * added the first set of docs for our technical docs * more docuemtnation changes * added support for prompt processing and updated life of a request * updated docs to including getting help sections and updated life of a request * committing local changes for getting started guide, sample applications, and full reference spec for prompt-config * updated configuration reference, added sample app skeleton, updated favico * fixed the configuration refernce file, and made minor changes to the intent detection. commit v1 for now --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local> Co-authored-by: Adil Hafeez <adil@katanemo.com> 2024-09-20 17:08:42 -07:00			`default: true`
update config (#93) 2024-09-30 17:49:05 -07:00
Update docs to Plano (#639) 2025-12-23 17:14:50 -08:00			`- model: openai/gpt-4o-mini`
			`access_key: $OPENAI_API_KEY`

			`- model: anthropic/claude-sonnet-4-0`
			`access_key: $ANTHROPIC_API_KEY`
update config (#93) 2024-09-30 17:49:05 -07:00
Update docs to Plano (#639) 2025-12-23 17:14:50 -08:00			`- model: mistral/ministral-3b-latest`
			`access_key: $MISTRAL_API_KEY`
Docs branch - v1 of our tech docs (#69) * added the first set of docs for our technical docs * more docuemtnation changes * added support for prompt processing and updated life of a request * updated docs to including getting help sections and updated life of a request * committing local changes for getting started guide, sample applications, and full reference spec for prompt-config * updated configuration reference, added sample app skeleton, updated favico * fixed the configuration refernce file, and made minor changes to the intent detection. commit v1 for now --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local> Co-authored-by: Adil Hafeez <adil@katanemo.com> 2024-09-20 17:08:42 -07:00
feat: add passthrough_auth option for forwarding client Authorization header (#687) * feat: add passthrough_auth option for forwarding client Authorization header * fix tests * Update comment to reflect upstream forwarding * Apply suggestions from code review --------- Co-authored-by: Adil Hafeez <adil.hafeez@gmail.com> Co-authored-by: Adil Hafeez <adil@katanemo.com> 2026-01-15 00:06:28 +01:00			`# Example: Passthrough authentication for LiteLLM or similar proxies`
			`# When passthrough_auth is true, client's Authorization header is forwarded`
			`# instead of using the configured access_key`
			`- model: openai/gpt-4o-litellm`
			`base_url: https://litellm.example.com`
			`passthrough_auth: true`
Update docs to Plano (#639) 2025-12-23 17:14:50 -08:00
			`# Model aliases - use friendly names instead of full provider model names`
draft commit to add support for xAI, TogehterAI, AzureOpenAI (#570) * draft commit to add support for xAI, LambdaAI, TogehterAI, AzureOpenAI * fixing failing tests and updating rederend config file * Update arch_config_with_aliases.yaml * adding the AZURE_API_KEY to the GH workflow for e2e * fixing GH secerts * adding valdiating for azure_openai --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-167.local> 2025-09-18 18:36:30 -07:00			`model_aliases:`
Update docs to Plano (#639) 2025-12-23 17:14:50 -08:00			`fast-llm:`
			`target: gpt-4o-mini`

			`smart-llm:`
draft commit to add support for xAI, TogehterAI, AzureOpenAI (#570) * draft commit to add support for xAI, LambdaAI, TogehterAI, AzureOpenAI * fixing failing tests and updating rederend config file * Update arch_config_with_aliases.yaml * adding the AZURE_API_KEY to the GH workflow for e2e * fixing GH secerts * adding valdiating for azure_openai --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-167.local> 2025-09-18 18:36:30 -07:00			`target: gpt-4o`

Update docs to Plano (#639) 2025-12-23 17:14:50 -08:00			`# HTTP listeners - entry points for agent routing, prompt targets, and direct LLM access`
			`listeners:`
			`# Agent listener for routing requests to multiple agents`
			`- type: agent`
			`name: travel_booking_service`
			`port: 8001`
			`router: plano_orchestrator_v1`
			`address: 0.0.0.0`
			`agents:`
			`- id: rag_agent`
			`description: virtual assistant for retrieval augmented generation tasks`
add output filter chain (#822) 2026-03-18 17:58:20 -07:00			`input_filters:`
Update docs to Plano (#639) 2025-12-23 17:14:50 -08:00			`- input_guards`

			`# Model listener for direct LLM access`
			`- type: model`
			`name: model_1`
			`address: 0.0.0.0`
			`port: 12000`
add output filter chain (#822) 2026-03-18 17:58:20 -07:00			`# Optional: attach input filters for guardrails on direct LLM requests`
			`# input_filters:`
			`# - input_guards`
Update docs to Plano (#639) 2025-12-23 17:14:50 -08:00
			`# Prompt listener for function calling (for prompt_targets)`
			`- type: prompt`
			`name: prompt_function_listener`
			`address: 0.0.0.0`
			`port: 10000`
			`# This listener is used for prompt_targets and function calling`
update config (#93) 2024-09-30 17:49:05 -07:00
Update docs to Plano (#639) 2025-12-23 17:14:50 -08:00			`# Reusable service endpoints`
			`endpoints:`
			`app_server:`
			`endpoint: 127.0.0.1:80`
			`connect_timeout: 0.005s`
Docs branch - v1 of our tech docs (#69) * added the first set of docs for our technical docs * more docuemtnation changes * added support for prompt processing and updated life of a request * updated docs to including getting help sections and updated life of a request * committing local changes for getting started guide, sample applications, and full reference spec for prompt-config * updated configuration reference, added sample app skeleton, updated favico * fixed the configuration refernce file, and made minor changes to the intent detection. commit v1 for now --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local> Co-authored-by: Adil Hafeez <adil@katanemo.com> 2024-09-20 17:08:42 -07:00
Update docs to Plano (#639) 2025-12-23 17:14:50 -08:00			`mistral_local:`
			`endpoint: 127.0.0.1:8001`

			`# Prompt targets for function calling and API orchestration`
Docs branch - v1 of our tech docs (#69) * added the first set of docs for our technical docs * more docuemtnation changes * added support for prompt processing and updated life of a request * updated docs to including getting help sections and updated life of a request * committing local changes for getting started guide, sample applications, and full reference spec for prompt-config * updated configuration reference, added sample app skeleton, updated favico * fixed the configuration refernce file, and made minor changes to the intent detection. commit v1 for now --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local> Co-authored-by: Adil Hafeez <adil@katanemo.com> 2024-09-20 17:08:42 -07:00			`prompt_targets:`
Update docs to Plano (#639) 2025-12-23 17:14:50 -08:00			`- name: get_current_weather`
			`description: Get current weather at a location.`
			`parameters:`
			`- name: location`
			`description: The location to get the weather for`
			`required: true`
			`type: string`
			`format: City, State`
			`- name: days`
			`description: the number of days for the request`
			`required: true`
			`type: int`
update config (#93) 2024-09-30 17:49:05 -07:00			`endpoint:`
			`name: app_server`
Update docs to Plano (#639) 2025-12-23 17:14:50 -08:00			`path: /weather`
update getting started guide and add llm gateway and prompt gateway samples (#330) 2024-12-06 14:37:33 -08:00			`http_method: POST`
Fix errors and improve Doc (#143) * Fix link issues and add icons * Improve Doc * fix test * making minor modifications to shuguangs' doc changes --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local> Co-authored-by: Adil Hafeez <adil@katanemo.com> 2024-10-08 13:18:34 -07:00
Update docs to Plano (#639) 2025-12-23 17:14:50 -08:00			`# OpenTelemetry tracing configuration`
Split listener (#141) 2024-10-08 16:24:08 -07:00			`tracing:`
Update docs to Plano (#639) 2025-12-23 17:14:50 -08:00			`# Random sampling percentage (1-100)`
			`random_sampling: 100`