plano/README.md

<div align="center">
  <img src="docs/source/_static/img/PlanoTagline.svg" alt="Plano Logo" width="75%" height=auto>
</div>
<div align="center">

 _Plano is a models-native proxy and data plane for agents._<br><br>
 Plano pulls out the rote plumbing work and decouples you from brittle framework abstractions, centralizing what shouldn’t be bespoke in every codebase - like agent routing and orchestration, rich agentic signals and traces for continuous improvement, guardrail filters for moderation, and smart LLM routing APIs for UX and DX agility. Use any language or AI framework, and deliver agents faster to production.


[Quickstart](#Quickstart) •
[Demos](#Demos) •
[Route LLMs](#use-plano-as-a-llm-router) •
[Build Agentic Apps with Plano](#Build-Agentic-Apps-with-Plano) •
[Documentation](https://docs.planoai.dev) •
[Contact](#Contact)

[![pre-commit](https://github.com/katanemo/plano/actions/workflows/pre-commit.yml/badge.svg)](https://github.com/katanemo/plano/actions/workflows/pre-commit.yml)
[![rust tests (prompt and llm gateway)](https://github.com/katanemo/plano/actions/workflows/rust_tests.yml/badge.svg)](https://github.com/katanemo/plano/actions/workflows/rust_tests.yml)
[![e2e tests](https://github.com/katanemo/plano/actions/workflows/e2e_tests.yml/badge.svg)](https://github.com/katanemo/plano/actions/workflows/e2e_tests.yml)
[![Build and Deploy Documentation](https://github.com/katanemo/plano/actions/workflows/static.yml/badge.svg)](https://github.com/katanemo/plano/actions/workflows/static.yml)

</div>

# Overview
Building agentic demos is easy. Shipping agentic applications safely, reliably, and repeatably to production is hard. After the thrill of a quick hack, you end up building the “hidden middleware” to reach production: routing logic to reach the right agent, guardrail hooks for safety and moderation, evaluation and observability glue for continuous learning, and model/provider quirks scattered across frameworks and application code.

Plano solves this by moving core delivery concerns into a unified, out-of-process dataplane.

- **🚦 Orchestration:** Low-latency orchestration between agents, and add new agents without changing app code
- **🔗 Model Agility:** Route [by model name, alias (semantic names) or automatically via preferences](#use-plano-as-a-llm-router)
- **🕵 Agentic Signals&trade;:** Zero-code capture of [behavior signals](#observability) plus OTEL traces/metrics across every agent.
- **🛡️ Moderation & Memory Hooks:** Build jailbreak protection, add moderation policies and memory consistently via [Filter Chains](https://docs.planoai.dev/concepts/filter_chain.html).

Plano pulls rote plumbing out of your framework so you can stay focused on what matters most: the core product logic of your agentic applications. Plano is backed by [industry-leading LLM research](https://planoai.dev/research) and built on [Envoy](https://envoyproxy.io) by its core contributors, who built critical infrastructure at scale for modern worklaods.

**High-Level Network Sequence Diagram**:
![high-level network plano arcitecture for Plano](docs/source/_static/img/plano_network_diagram_high_level.png)

**Jump to our [docs](https://docs.planoai.dev)** to learn how you can use Plano to improve the speed, safety and obervability of your agentic applications.

> [!IMPORTANT]
> Plano and the Arch family of LLMs (like Plano-Orchestrator-4B, Arch-Router, etc) are hosted free of charge in the US-central region to give you a great first-run developer experience of Plano. To scale and run in production, you can either run these LLMs locally or contact us on [Discord](https://discord.gg/pGZf2gcwEc) for API keys.

## Contact
To get in touch with us, please join our [discord server](https://discord.gg/pGZf2gcwEc). We will be monitoring that actively and offering support there.

## Demos
* [Sample App: Weather Forecast Agent](demos/samples_python/weather_forecast/README.md) - A sample agentic weather forecasting app that highlights core function calling capabilities of Plano.
* [Sample App: Network Operator Agent](demos/samples_python/network_switch_operator_agent/README.md) - A simple network device switch operator agent that can retrieve device statistics and reboot them.


## Quickstart

Follow this quickstart guide to use Plano as a router for local or hosted LLMs, including dynamic routing. Later in the section we will see how you can Plano to build highly capable agentic applications, and to provide e2e observability.

### Prerequisites

Before you begin, ensure you have the following:

1. [Docker System](https://docs.docker.com/get-started/get-docker/) (v24)
2. [Docker compose](https://docs.docker.com/compose/install/) (v2.29)
3. [Python](https://www.python.org/downloads/) (v3.13)

Plano's CLI allows you to manage and interact with the Plano gateway efficiently. To install the CLI, simply run the following command:

> [!TIP]
> We recommend that developers create a new Python virtual environment to isolate dependencies before installing Plano. This ensures that plano and its dependencies do not interfere with other packages on your system.

```console
$ python3.12 -m venv venv
$ source venv/bin/activate   # On Windows, use: venv\Scripts\activate
$ pip install plano==0.4.0
```

### Use Plano as a LLM Router
Plano supports multiple powerful routing strategies for LLMs. [Model-based routing](https://docs.arch.com/guides/llm_router.html#model-based-routing) gives you direct control over specific models and supports 11+ LLM providers including OpenAI, Anthropic, DeepSeek, Mistral, Groq, and more. [Alias-based routing](https://docs.arch.com/guides/llm_router.html#alias-based-routing) lets you create semantic model names that decouple your application code from specific providers, making it easy to experiment with different models or handle provider changes without refactoring. For full configuration examples and code walkthroughs, see our [routing guides](https://docs.arch.com/guides/llm_router.html).

#### Preference-aligned Routing
Preference-aligned routing provides intelligent, dynamic model selection based on natural language descriptions of tasks and preferences. Instead of hardcoded routing logic, you describe what each model is good at using plain English.

```yaml
version: v0.1.0

listeners:
  egress_traffic:
    address: 0.0.0.0
    port: 12000
    message_format: openai
    timeout: 30s

llm_providers:
  - model: openai/gpt-4o
    access_key: $OPENAI_API_KEY
    routing_preferences:
      - name: complex_reasoning
        description: deep analysis, mathematical problem solving, and logical reasoning
      - name: creative_writing
        description: storytelling, creative content, and artistic writing

  - model: deepseek/deepseek-coder
    access_key: $DEEPSEEK_API_KEY
    routing_preferences:
      - name: code_generation
        description: generating new code, writing functions, and creating scripts
      - name: code_review
        description: analyzing existing code for bugs, improvements, and optimization
```


Plano uses a lightweight 1.5B autoregressive model to intelligently map user prompts to these preferences, automatically selecting the best model for each request. This approach adapts to intent drift, supports multi-turn conversations, and avoids brittle embedding-based classifiers or manual if/else chains. No retraining required when adding models or updating policies — routing is governed entirely by human-readable rules.

**Learn More**: Check our [documentation](https://docs.plano.com/concepts/llm_providers/llm_providers.html) for comprehensive provider setup guides and routing strategies. You can learn more about the design, benchmarks, and methodology behind preference-based routing in our paper:

<div align="left">
  <a href="https://arxiv.org/abs/2506.16655" target="_blank">
    <img src="docs/source/_static/img/plano_router_paper_preview.png" alt="Plano Router Paper Preview">
  </a>
</div>

### Build Agentic Apps with Plano

In following quickstart we will show you how easy it is to build AI agent with Plano gateway. We will build a currency exchange agent using following simple steps. For this demo we will use `https://api.frankfurter.dev/` to fetch latest price for currencies and assume USD as base currency.

#### Step 1. Create plano config file

Create `plano_config.yaml` file with following content,

```yaml
version: v0.1.0

listeners:
  ingress_traffic:
    address: 0.0.0.0
    port: 10000
    message_format: openai
    timeout: 30s

llm_providers:
  - access_key: $OPENAI_API_KEY
    model: openai/gpt-4o

system_prompt: |
  You are a helpful assistant.

prompt_guards:
  input_guards:
    jailbreak:
      on_exception:
        message: Looks like you're curious about my abilities, but I can only provide assistance for currency exchange.

prompt_targets:
  - name: currency_exchange
    description: Get currency exchange rate from USD to other currencies
    parameters:
      - name: currency_symbol
        description: the currency that needs conversion
        required: true
        type: str
        in_path: true
    endpoint:
      name: frankfurter_api
      path: /v1/latest?base=USD&symbols={currency_symbol}
    system_prompt: |
      You are a helpful assistant. Show me the currency symbol you want to convert from USD.

  - name: get_supported_currencies
    description: Get list of supported currencies for conversion
    endpoint:
      name: frankfurter_api
      path: /v1/currencies

endpoints:
  frankfurter_api:
    endpoint: api.frankfurter.dev:443
    protocol: https
```

#### Step 2. Start plano gateway with currency conversion config

```sh

$ plano up plano_config.yaml
2024-12-05 16:56:27,979 - cli.main - INFO - Starting plano cli version: 0.4.0
2024-12-05 16:56:28,485 - cli.utils - INFO - Schema validation successful!
2024-12-05 16:56:28,485 - cli.main - INFO - Starting plano model server and plano gateway
2024-12-05 16:56:51,647 - cli.core - INFO - Container is healthy!
```

Once the gateway is up you can start interacting with at port 10000 using openai chat completion API.

Some of the sample queries you can ask could be `what is currency rate for gbp?` or `show me list of currencies for conversion`.

#### Step 3. Interacting with gateway using curl command

Here is a sample curl command you can use to interact,

```bash
$ curl --header 'Content-Type: application/json' \
  --data '{"messages": [{"role": "user","content": "what is exchange rate for gbp"}], "model": "none"}' \
  http://localhost:10000/v1/chat/completions | jq ".choices[0].message.content"

"As of the date provided in your context, December 5, 2024, the exchange rate for GBP (British Pound) from USD (United States Dollar) is 0.78558. This means that 1 USD is equivalent to 0.78558 GBP."

```

And to get list of supported currencies,

```bash
$ curl --header 'Content-Type: application/json' \
  --data '{"messages": [{"role": "user","content": "show me list of currencies that are supported for conversion"}], "model": "none"}' \
  http://localhost:10000/v1/chat/completions | jq ".choices[0].message.content"

"Here is a list of the currencies that are supported for conversion from USD, along with their symbols:\n\n1. AUD - Australian Dollar\n2. BGN - Bulgarian Lev\n3. BRL - Brazilian Real\n4. CAD - Canadian Dollar\n5. CHF - Swiss Franc\n6. CNY - Chinese Renminbi Yuan\n7. CZK - Czech Koruna\n8. DKK - Danish Krone\n9. EUR - Euro\n10. GBP - British Pound\n11. HKD - Hong Kong Dollar\n12. HUF - Hungarian Forint\n13. IDR - Indonesian Rupiah\n14. ILS - Israeli New Sheqel\n15. INR - Indian Rupee\n16. ISK - Icelandic Króna\n17. JPY - Japanese Yen\n18. KRW - South Korean Won\n19. MXN - Mexican Peso\n20. MYR - Malaysian Ringgit\n21. NOK - Norwegian Krone\n22. NZD - New Zealand Dollar\n23. PHP - Philippine Peso\n24. PLN - Polish Złoty\n25. RON - Romanian Leu\n26. SEK - Swedish Krona\n27. SGD - Singapore Dollar\n28. THB - Thai Baht\n29. TRY - Turkish Lira\n30. USD - United States Dollar\n31. ZAR - South African Rand\n\nIf you want to convert USD to any of these currencies, you can select the one you are interested in."

```

## [Observability](https://docs.plano.com/guides/observability/observability.html)
Plano is designed to support best-in class observability by supporting open standards. Please read our [docs](https://docs.plano.com/guides/observability/observability.html) on observability for more details on tracing, metrics, and logs. The screenshot below is from our integration with Signoz (among others)

![alt text](docs/source/_static/img/tracing.png)


## Contribution
We would love feedback on our [Roadmap](https://github.com/orgs/katanemo/projects/1) and we welcome contributions to **Plano**!
Whether you're fixing bugs, adding new features, improving documentation, or creating tutorials, your help is much appreciated.
Please visit our [Contribution Guide](CONTRIBUTING.md) for more details
-												Fixing relative link to the shared chatbotui in the spotify demo, and add references to it in the README (#400)


											
										
										
											2025-02-10 11:23:04 -08:00
+								<div align="center">
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								  <img src="docs/source/_static/img/PlanoTagline.svg" alt="Plano Logo" width="75%" height=auto>
-												Fixing relative link to the shared chatbotui in the spotify demo, and add references to it in the README (#400)


											
										
										
											2025-02-10 11:23:04 -08:00
+								</div>
 								<div align="center">
-												Update README.md
											
										
										
											2025-01-23 11:26:21 -08:00
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								 _Plano is a models-native proxy and data plane for agents._<br><br>
 								 Plano pulls out the rote plumbing work and decouples you from brittle framework abstractions, centralizing what shouldn’t be bespoke in every codebase - like agent routing and orchestration, rich agentic signals and traces for continuous improvement, guardrail filters for moderation, and smart LLM routing APIs for UX and DX agility. Use any language or AI framework, and deliver agents faster to production.
-												pushing docs updated (#508)

* pushing docs updated

* Fixed README.md logo

* Fixed README.md logo

* Fixed README.md spacing

* fixed tag line

* LLM router doc fixes

* minor logo and branding changes

* minor changes to the README

* minor changes to the README

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>
											
										
										
											2025-06-17 08:16:42 -07:00
-												Salmanap/fix readme 019a (#373)

* updated README based on feedback on reddit

* fixed typo

* updating README with minor fixes

* more fixes to README

* updated README

* updated README

* updated README

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2025-01-20 14:44:40 -08:00
-												updated the spotify bearer authorization README and fixed main README… (#402)

* updated the spotify bearer authorization README and fixed main README links

* minor fixes to SPOTIFY README

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2025-02-10 17:56:28 -08:00
+								[Quickstart](#Quickstart) •
 								[Demos](#Demos) •
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								[Route LLMs](#use-plano-as-a-llm-router) •
 								[Build Agentic Apps with Plano](#Build-Agentic-Apps-with-Plano) •
 								[Documentation](https://docs.planoai.dev) •
-												updated the spotify bearer authorization README and fixed main README… (#402)

* updated the spotify bearer authorization README and fixed main README links

* minor fixes to SPOTIFY README

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2025-02-10 17:56:28 -08:00
+								[Contact](#Contact)
-												Salmanap/fix readme 019a (#373)

* updated README based on feedback on reddit

* fixed typo

* updating README with minor fixes

* more fixes to README

* updated README

* updated README

* updated README

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2025-01-20 14:44:40 -08:00
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								[![pre-commit](https://github.com/katanemo/plano/actions/workflows/pre-commit.yml/badge.svg)](https://github.com/katanemo/plano/actions/workflows/pre-commit.yml)
 								[![rust tests (prompt and llm gateway)](https://github.com/katanemo/plano/actions/workflows/rust_tests.yml/badge.svg)](https://github.com/katanemo/plano/actions/workflows/rust_tests.yml)
 								[![e2e tests](https://github.com/katanemo/plano/actions/workflows/e2e_tests.yml/badge.svg)](https://github.com/katanemo/plano/actions/workflows/e2e_tests.yml)
 								[![Build and Deploy Documentation](https://github.com/katanemo/plano/actions/workflows/static.yml/badge.svg)](https://github.com/katanemo/plano/actions/workflows/static.yml)
-												updating README to better describe the problems we are solving (#437)

* updating README to better describe the problems we are solving
* fixing formatting issues

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2025-03-18 22:34:42 -07:00
-												Fixing relative link to the shared chatbotui in the spotify demo, and add references to it in the README (#400)


											
										
										
											2025-02-10 11:23:04 -08:00
+								</div>
 								# Overview
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								Building agentic demos is easy. Shipping agentic applications safely, reliably, and repeatably to production is hard. After the thrill of a quick hack, you end up building the “hidden middleware” to reach production: routing logic to reach the right agent, guardrail hooks for safety and moderation, evaluation and observability glue for continuous learning, and model/provider quirks scattered across frameworks and application code.
-												Adil/fix salman docs (#75)

* added the first set of docs for our technical docs

* more docuemtnation changes

* added support for prompt processing and updated life of a request

* updated docs to including getting help sections and updated life of a request

* committing local changes for getting started guide, sample applications, and full reference spec for prompt-config

* updated configuration reference, added sample app skeleton, updated favico

* fixed the configuration refernce file, and made minor changes to the intent detection. commit v1 for now

* Updated docs with use cases and example code, updated what is arch, and made minor changes throughout

* fixed imaged and minor doc fixes

* add sphinx_book_theme

* updated README, and make some minor fixes to documetnation

* fixed README.md

* fixed image width

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
											
										
										
											2024-09-24 13:54:17 -07:00
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								Plano solves this by moving core delivery concerns into a unified, out-of-process dataplane.
-												Fixing relative link to the shared chatbotui in the spotify demo, and add references to it in the README (#400)


											
										
										
											2025-02-10 11:23:04 -08:00
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								- **🚦 Orchestration:** Low-latency orchestration between agents, and add new agents without changing app code
 								- **🔗 Model Agility:** Route [by model name, alias (semantic names) or automatically via preferences](#use-plano-as-a-llm-router)
 								- **🕵 Agentic Signals&trade;:** Zero-code capture of [behavior signals](#observability) plus OTEL traces/metrics across every agent.
 								- **🛡️ Moderation & Memory Hooks:** Build jailbreak protection, add moderation policies and memory consistently via [Filter Chains](https://docs.planoai.dev/concepts/filter_chain.html).
-												Fixing relative link to the shared chatbotui in the spotify demo, and add references to it in the README (#400)


											
										
										
											2025-02-10 11:23:04 -08:00
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								Plano pulls rote plumbing out of your framework so you can stay focused on what matters most: the core product logic of your agentic applications. Plano is backed by [industry-leading LLM research](https://planoai.dev/research) and built on [Envoy](https://envoyproxy.io) by its core contributors, who built critical infrastructure at scale for modern worklaods.
-												Fixing relative link to the shared chatbotui in the spotify demo, and add references to it in the README (#400)


											
										
										
											2025-02-10 11:23:04 -08:00
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								**High-Level Network Sequence Diagram**:
 								![high-level network plano arcitecture for Plano](docs/source/_static/img/plano_network_diagram_high_level.png)
-												Adil/fix salman docs (#75)

* added the first set of docs for our technical docs

* more docuemtnation changes

* added support for prompt processing and updated life of a request

* updated docs to including getting help sections and updated life of a request

* committing local changes for getting started guide, sample applications, and full reference spec for prompt-config

* updated configuration reference, added sample app skeleton, updated favico

* fixed the configuration refernce file, and made minor changes to the intent detection. commit v1 for now

* Updated docs with use cases and example code, updated what is arch, and made minor changes throughout

* fixed imaged and minor doc fixes

* add sphinx_book_theme

* updated README, and make some minor fixes to documetnation

* fixed README.md

* fixed image width

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
											
										
										
											2024-09-24 13:54:17 -07:00
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								**Jump to our [docs](https://docs.planoai.dev)** to learn how you can use Plano to improve the speed, safety and obervability of your agentic applications.
-												Setup pre-commit so it runs locally before every git push (#12)

* Setup pre-commit so it runs locally before every git push

* Update .pre-commit-config.yaml

* added more checks

* update readme

* Apply suggestions from code review

Co-authored-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>

* remove cargo-check

---------

Co-authored-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
											
										
										
											2024-07-18 11:01:02 -07:00
-												add note about hosted arch-fc (#308)


											
										
										
											2024-11-26 14:19:10 -08:00
+								> [!IMPORTANT]
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								> Plano and the Arch family of LLMs (like Plano-Orchestrator-4B, Arch-Router, etc) are hosted free of charge in the US-central region to give you a great first-run developer experience of Plano. To scale and run in production, you can either run these LLMs locally or contact us on [Discord](https://discord.gg/pGZf2gcwEc) for API keys.
-												updating readme and docs with note about Arch-Function (#285)

* updating readme and docs with note about Arch-Function

* minor fixes to README

* a few more minor updates to the README

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2024-11-19 08:43:56 -08:00
-												fixed cli to use poetry as well. this way we make it easy to have the… (#160)


											
										
										
											2024-10-09 15:53:12 -07:00
+								## Contact
-												fixing discord link and moving contributing guide to root (#215)

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2024-10-23 15:45:49 -07:00
+								To get in touch with us, please join our [discord server](https://discord.gg/pGZf2gcwEc). We will be monitoring that actively and offering support there.
-												added index.html and made minor README.md edits (#130)

* added index.html and made minor README.md edits

* minor fix to the text

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2024-10-06 17:16:48 -07:00
-												fixed cli to use poetry as well. this way we make it easy to have the… (#160)


											
										
										
											2024-10-09 15:53:12 -07:00
+								## Demos
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								* [Sample App: Weather Forecast Agent](demos/samples_python/weather_forecast/README.md) - A sample agentic weather forecasting app that highlights core function calling capabilities of Plano.
-												Fixed a few typos in README.md (#593)


											
										
										
											2025-10-21 16:51:58 +01:00
+								* [Sample App: Network Operator Agent](demos/samples_python/network_switch_operator_agent/README.md) - A simple network device switch operator agent that can retrieve device statistics and reboot them.
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
-												updated all demo READMes and minor doc changes (#154)

* updated all demo READMes and minor doc changes

* minor typo fixes

* updated main Readme

* fixed README and docs

* fixed README and docs

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2024-10-08 23:58:55 -07:00
-												fixed cli to use poetry as well. this way we make it easy to have the… (#160)


											
										
										
											2024-10-09 15:53:12 -07:00
+								## Quickstart
-												updated all demo READMes and minor doc changes (#154)

* updated all demo READMes and minor doc changes

* minor typo fixes

* updated main Readme

* fixed README and docs

* fixed README and docs

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2024-10-08 23:58:55 -07:00
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								Follow this quickstart guide to use Plano as a router for local or hosted LLMs, including dynamic routing. Later in the section we will see how you can Plano to build highly capable agentic applications, and to provide e2e observability.
-												updated all demo READMes and minor doc changes (#154)

* updated all demo READMes and minor doc changes

* minor typo fixes

* updated main Readme

* fixed README and docs

* fixed README and docs

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2024-10-08 23:58:55 -07:00
-												fixed cli to use poetry as well. this way we make it easy to have the… (#160)


											
										
										
											2024-10-09 15:53:12 -07:00
+								### Prerequisites
-												updated all demo READMes and minor doc changes (#154)

* updated all demo READMes and minor doc changes

* minor typo fixes

* updated main Readme

* fixed README and docs

* fixed README and docs

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2024-10-08 23:58:55 -07:00
 								Before you begin, ensure you have the following:
-												add requirements to readme (#249)


											
										
										
											2024-11-08 12:43:18 -06:00
+. [Docker System](https://docs.docker.com/get-started/get-docker/) (v24)
 . [Docker compose](https://docs.docker.com/compose/install/) (v2.29)
-												update base image to python3.13 (#554)


											
										
										
											2025-08-13 14:20:46 -07:00
+. [Python](https://www.python.org/downloads/) (v3.13)
-												add requirements to readme (#249)


											
										
										
											2024-11-08 12:43:18 -06:00
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								Plano's CLI allows you to manage and interact with the Plano gateway efficiently. To install the CLI, simply run the following command:
-												update getting started guide and add llm gateway and prompt gateway samples (#330)


											
										
										
											2024-12-06 14:37:33 -08:00
 								> [!TIP]
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								> We recommend that developers create a new Python virtual environment to isolate dependencies before installing Plano. This ensures that plano and its dependencies do not interfere with other packages on your system.
-												updated all demo READMes and minor doc changes (#154)

* updated all demo READMes and minor doc changes

* minor typo fixes

* updated main Readme

* fixed README and docs

* fixed README and docs

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2024-10-08 23:58:55 -07:00
 								```console
-												updating the messaging to call ourselves the edge and AI gateway for … (#527)

* updating the messaging to call ourselves the edge and AI gateway for agents

* updating README to tidy up some language

* updating README to tidy up some language

* updating README to tidy up some language

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>
											
										
										
											2025-07-12 03:25:09 -07:00
+								$ python3.12 -m venv venv
-												updated all demo READMes and minor doc changes (#154)

* updated all demo READMes and minor doc changes

* minor typo fixes

* updated main Readme

* fixed README and docs

* fixed README and docs

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2024-10-08 23:58:55 -07:00
+								$ source venv/bin/activate   # On Windows, use: venv\Scripts\activate
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								$ pip install plano==0.4.0
-												updating readme and see how it flows (#556)

* updating readme and see how it flows

* fixed links
---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-4.local>
											
										
										
											2025-08-21 06:29:47 -07:00
+								```
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								### Use Plano as a LLM Router
 								Plano supports multiple powerful routing strategies for LLMs. [Model-based routing](https://docs.arch.com/guides/llm_router.html#model-based-routing) gives you direct control over specific models and supports 11+ LLM providers including OpenAI, Anthropic, DeepSeek, Mistral, Groq, and more. [Alias-based routing](https://docs.arch.com/guides/llm_router.html#alias-based-routing) lets you create semantic model names that decouple your application code from specific providers, making it easy to experiment with different models or handle provider changes without refactoring. For full configuration examples and code walkthroughs, see our [routing guides](https://docs.arch.com/guides/llm_router.html).
-												Salmanap/fix docs new providers model alias (#571)

* fixed docs and added ollama as a first-class LLM provider

* matching the LLM routing section on the README.md to the docs

* updated the section on preference-based routing

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-167.local>
											
										
										
											2025-09-19 10:19:57 -07:00
 								#### Preference-aligned Routing
 								Preference-aligned routing provides intelligent, dynamic model selection based on natural language descriptions of tasks and preferences. Instead of hardcoded routing logic, you describe what each model is good at using plain English.
 								```yaml
 								version: v0.1.0
 								listeners:
 								  egress_traffic:
 								    address: 0.0.0.0
 								    port: 12000
 								    message_format: openai
 								    timeout: 30s
 								llm_providers:
 								  - model: openai/gpt-4o
-												updating readme and see how it flows (#556)

* updating readme and see how it flows

* fixed links
---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-4.local>
											
										
										
											2025-08-21 06:29:47 -07:00
+								    access_key: $OPENAI_API_KEY
 								    routing_preferences:
-												Salmanap/fix docs new providers model alias (#571)

* fixed docs and added ollama as a first-class LLM provider

* matching the LLM routing section on the README.md to the docs

* updated the section on preference-based routing

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-167.local>
											
										
										
											2025-09-19 10:19:57 -07:00
+								      - name: complex_reasoning
 								        description: deep analysis, mathematical problem solving, and logical reasoning
 								      - name: creative_writing
 								        description: storytelling, creative content, and artistic writing
 								  - model: deepseek/deepseek-coder
 								    access_key: $DEEPSEEK_API_KEY
 								    routing_preferences:
 								      - name: code_generation
 								        description: generating new code, writing functions, and creating scripts
 								      - name: code_review
 								        description: analyzing existing code for bugs, improvements, and optimization
-												updating readme and see how it flows (#556)

* updating readme and see how it flows

* fixed links
---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-4.local>
											
										
										
											2025-08-21 06:29:47 -07:00
+								```
-												Salmanap/fix docs new providers model alias (#571)

* fixed docs and added ollama as a first-class LLM provider

* matching the LLM routing section on the README.md to the docs

* updated the section on preference-based routing

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-167.local>
											
										
										
											2025-09-19 10:19:57 -07:00
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
 								Plano uses a lightweight 1.5B autoregressive model to intelligently map user prompts to these preferences, automatically selecting the best model for each request. This approach adapts to intent drift, supports multi-turn conversations, and avoids brittle embedding-based classifiers or manual if/else chains. No retraining required when adding models or updating policies — routing is governed entirely by human-readable rules.
 								**Learn More**: Check our [documentation](https://docs.plano.com/concepts/llm_providers/llm_providers.html) for comprehensive provider setup guides and routing strategies. You can learn more about the design, benchmarks, and methodology behind preference-based routing in our paper:
-												updating readme and see how it flows (#556)

* updating readme and see how it flows

* fixed links
---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-4.local>
											
										
										
											2025-08-21 06:29:47 -07:00
 								<div align="left">
 								  <a href="https://arxiv.org/abs/2506.16655" target="_blank">
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								    <img src="docs/source/_static/img/plano_router_paper_preview.png" alt="Plano Router Paper Preview">
-												updating readme and see how it flows (#556)

* updating readme and see how it flows

* fixed links
---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-4.local>
											
										
										
											2025-08-21 06:29:47 -07:00
+								  </a>
 								</div>
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								### Build Agentic Apps with Plano
-												update getting started guide and add llm gateway and prompt gateway samples (#330)


											
										
										
											2024-12-06 14:37:33 -08:00
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								In following quickstart we will show you how easy it is to build AI agent with Plano gateway. We will build a currency exchange agent using following simple steps. For this demo we will use `https://api.frankfurter.dev/` to fetch latest price for currencies and assume USD as base currency.
-												updated all demo READMes and minor doc changes (#154)

* updated all demo READMes and minor doc changes

* minor typo fixes

* updated main Readme

* fixed README and docs

* fixed README and docs

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2024-10-08 23:58:55 -07:00
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								#### Step 1. Create plano config file
-												update getting started guide and add llm gateway and prompt gateway samples (#330)


											
										
										
											2024-12-06 14:37:33 -08:00
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								Create `plano_config.yaml` file with following content,
-												updated all demo READMes and minor doc changes (#154)

* updated all demo READMes and minor doc changes

* minor typo fixes

* updated main Readme

* fixed README and docs

* fixed README and docs

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2024-10-08 23:58:55 -07:00
 								```yaml
-												use consistent version across all arch_config files (#497)


											
										
										
											2025-05-31 01:11:14 -07:00
+								version: v0.1.0
-												update getting started guide and add llm gateway and prompt gateway samples (#330)


											
										
										
											2024-12-06 14:37:33 -08:00
-												update arch_config sample on readme to match with new format (#475)


											
										
										
											2025-04-29 12:36:46 -07:00
+								listeners:
 								  ingress_traffic:
 								    address: 0.0.0.0
 								    port: 10000
 								    message_format: openai
 								    timeout: 30s
-												updated all demo READMes and minor doc changes (#154)

* updated all demo READMes and minor doc changes

* minor typo fixes

* updated main Readme

* fixed README and docs

* fixed README and docs

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2024-10-08 23:58:55 -07:00
 								llm_providers:
-												better model names (#517)


											
										
										
											2025-07-11 16:42:16 -07:00
+								  - access_key: $OPENAI_API_KEY
 								    model: openai/gpt-4o
-												updated all demo READMes and minor doc changes (#154)

* updated all demo READMes and minor doc changes

* minor typo fixes

* updated main Readme

* fixed README and docs

* fixed README and docs

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2024-10-08 23:58:55 -07:00
-												fixed typos in arch_config.yaml file based on issue #221 (#223)

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2024-10-24 14:57:29 -07:00
+								system_prompt: |
-												update getting started guide and add llm gateway and prompt gateway samples (#330)


											
										
										
											2024-12-06 14:37:33 -08:00
+								  You are a helpful assistant.
 								prompt_guards:
 								  input_guards:
 								    jailbreak:
 								      on_exception:
 								        message: Looks like you're curious about my abilities, but I can only provide assistance for currency exchange.
-												updated all demo READMes and minor doc changes (#154)

* updated all demo READMes and minor doc changes

* minor typo fixes

* updated main Readme

* fixed README and docs

* fixed README and docs

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2024-10-08 23:58:55 -07:00
 								prompt_targets:
-												update getting started guide and add llm gateway and prompt gateway samples (#330)


											
										
										
											2024-12-06 14:37:33 -08:00
+								  - name: currency_exchange
 								    description: Get currency exchange rate from USD to other currencies
 								    parameters:
 								      - name: currency_symbol
 								        description: the currency that needs conversion
 								        required: true
 								        type: str
 								        in_path: true
 								    endpoint:
-												Fixed a few typos in README.md (#593)


											
										
										
											2025-10-21 16:51:58 +01:00
+								      name: frankfurter_api
-												update getting started guide and add llm gateway and prompt gateway samples (#330)


											
										
										
											2024-12-06 14:37:33 -08:00
+								      path: /v1/latest?base=USD&symbols={currency_symbol}
 								    system_prompt: |
 								      You are a helpful assistant. Show me the currency symbol you want to convert from USD.
 								  - name: get_supported_currencies
 								    description: Get list of supported currencies for conversion
 								    endpoint:
-												Fixed a few typos in README.md (#593)


											
										
										
											2025-10-21 16:51:58 +01:00
+								      name: frankfurter_api
-												update getting started guide and add llm gateway and prompt gateway samples (#330)


											
										
										
											2024-12-06 14:37:33 -08:00
+								      path: /v1/currencies
-												updated all demo READMes and minor doc changes (#154)

* updated all demo READMes and minor doc changes

* minor typo fixes

* updated main Readme

* fixed README and docs

* fixed README and docs

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2024-10-08 23:58:55 -07:00
+								endpoints:
-												Fixed a few typos in README.md (#593)


											
										
										
											2025-10-21 16:51:58 +01:00
+								  frankfurter_api:
-												update getting started guide and add llm gateway and prompt gateway samples (#330)


											
										
										
											2024-12-06 14:37:33 -08:00
+								    endpoint: api.frankfurter.dev:443
 								    protocol: https
 								```
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								#### Step 2. Start plano gateway with currency conversion config
-												update getting started guide and add llm gateway and prompt gateway samples (#330)


											
										
										
											2024-12-06 14:37:33 -08:00
 								```sh
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								$ plano up plano_config.yaml
 -12-05 16:56:27,979 - cli.main - INFO - Starting plano cli version: 0.4.0
-												update getting started guide and add llm gateway and prompt gateway samples (#330)


											
										
										
											2024-12-06 14:37:33 -08:00
+-12-05 16:56:28,485 - cli.utils - INFO - Schema validation successful!
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+-12-05 16:56:28,485 - cli.main - INFO - Starting plano model server and plano gateway
-												update getting started guide and add llm gateway and prompt gateway samples (#330)


											
										
										
											2024-12-06 14:37:33 -08:00
+-12-05 16:56:51,647 - cli.core - INFO - Container is healthy!
 								```
 								Once the gateway is up you can start interacting with at port 10000 using openai chat completion API.
 								Some of the sample queries you can ask could be `what is currency rate for gbp?` or `show me list of currencies for conversion`.
 								#### Step 3. Interacting with gateway using curl command
 								Here is a sample curl command you can use to interact,
 								```bash
 								$ curl --header 'Content-Type: application/json' \
-												make model required in readme and rst files (#503)


											
										
										
											2025-06-05 20:14:13 -07:00
+								  --data '{"messages": [{"role": "user","content": "what is exchange rate for gbp"}], "model": "none"}' \
-												update getting started guide and add llm gateway and prompt gateway samples (#330)


											
										
										
											2024-12-06 14:37:33 -08:00
+								  http://localhost:10000/v1/chat/completions | jq ".choices[0].message.content"
 								"As of the date provided in your context, December 5, 2024, the exchange rate for GBP (British Pound) from USD (United States Dollar) is 0.78558. This means that 1 USD is equivalent to 0.78558 GBP."
 								```
 								And to get list of supported currencies,
 								```bash
 								$ curl --header 'Content-Type: application/json' \
-												make model required in readme and rst files (#503)


											
										
										
											2025-06-05 20:14:13 -07:00
+								  --data '{"messages": [{"role": "user","content": "show me list of currencies that are supported for conversion"}], "model": "none"}' \
-												update getting started guide and add llm gateway and prompt gateway samples (#330)


											
										
										
											2024-12-06 14:37:33 -08:00
+								  http://localhost:10000/v1/chat/completions | jq ".choices[0].message.content"
 								"Here is a list of the currencies that are supported for conversion from USD, along with their symbols:\n\n1. AUD - Australian Dollar\n2. BGN - Bulgarian Lev\n3. BRL - Brazilian Real\n4. CAD - Canadian Dollar\n5. CHF - Swiss Franc\n6. CNY - Chinese Renminbi Yuan\n7. CZK - Czech Koruna\n8. DKK - Danish Krone\n9. EUR - Euro\n10. GBP - British Pound\n11. HKD - Hong Kong Dollar\n12. HUF - Hungarian Forint\n13. IDR - Indonesian Rupiah\n14. ILS - Israeli New Sheqel\n15. INR - Indian Rupee\n16. ISK - Icelandic Króna\n17. JPY - Japanese Yen\n18. KRW - South Korean Won\n19. MXN - Mexican Peso\n20. MYR - Malaysian Ringgit\n21. NOK - Norwegian Krone\n22. NZD - New Zealand Dollar\n23. PHP - Philippine Peso\n24. PLN - Polish Złoty\n25. RON - Romanian Leu\n26. SEK - Swedish Krona\n27. SGD - Singapore Dollar\n28. THB - Thai Baht\n29. TRY - Turkish Lira\n30. USD - United States Dollar\n31. ZAR - South African Rand\n\nIf you want to convert USD to any of these currencies, you can select the one you are interested in."
-												updated all demo READMes and minor doc changes (#154)

* updated all demo READMes and minor doc changes

* minor typo fixes

* updated main Readme

* fixed README and docs

* fixed README and docs

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2024-10-08 23:58:55 -07:00
+								```
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								## [Observability](https://docs.plano.com/guides/observability/observability.html)
 								Plano is designed to support best-in class observability by supporting open standards. Please read our [docs](https://docs.plano.com/guides/observability/observability.html) on observability for more details on tracing, metrics, and logs. The screenshot below is from our integration with Signoz (among others)
-												updated all demo READMes and minor doc changes (#154)

* updated all demo READMes and minor doc changes

* minor typo fixes

* updated main Readme

* fixed README and docs

* fixed README and docs

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2024-10-08 23:58:55 -07:00
-												updating docs to reflect changes in 0.1.2 like tracing via signoz and… (#271)


											
										
										
											2024-11-15 16:55:27 -08:00
+								![alt text](docs/source/_static/img/tracing.png)
-												Use better logs (#452)


											
										
										
											2025-03-27 10:40:20 -07:00
-												update getting started guide and add llm gateway and prompt gateway samples (#330)


											
										
										
											2024-12-06 14:37:33 -08:00
+								## Contribution
-												updating plano docs, README and CLI

											
										
										
											2025-12-19 17:45:51 -08:00
+								We would love feedback on our [Roadmap](https://github.com/orgs/katanemo/projects/1) and we welcome contributions to **Plano**!
-												updated all demo READMes and minor doc changes (#154)

* updated all demo READMes and minor doc changes

* minor typo fixes

* updated main Readme

* fixed README and docs

* fixed README and docs

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2024-10-08 23:58:55 -07:00
+								Whether you're fixing bugs, adding new features, improving documentation, or creating tutorials, your help is much appreciated.
-												docs: update README.md (#220)

vist -> visit
											
										
										
											2024-10-24 12:37:26 +09:00
+								Please visit our [Contribution Guide](CONTRIBUTING.md) for more details