plano/README.md

<div align="center">
  <img src="docs/source/_static/img/PlanoTagline.svg" alt="Plano Logo" width="75%" height=auto>
</div>
<div align="center">

 _The AI-native proxy server and data plane for agentic apps._<br><br>
 Plano pulls out the rote plumbing work and decouples you from brittle framework abstractions, centralizing what shouldn’t be bespoke in every codebase - like agent routing and orchestration, rich agentic signals and traces for continuous improvement, guardrail filters for safety and moderation, and smart LLM routing APIs for model agility. Use any language or AI framework, and deliver agents faster to production.


[Quickstart Guide](https://docs.planoai.dev/get_started/quickstart.html) •
[Build Agentic Apps with Plano](#Build-Agentic-Apps-with-Plano) •
[Documentation](https://docs.planoai.dev) •
[Contact](#Contact)

[![CI](https://github.com/katanemo/plano/actions/workflows/ci.yml/badge.svg)](https://github.com/katanemo/plano/actions/workflows/ci.yml)
[![Docker Image](https://github.com/katanemo/plano/actions/workflows/docker-push-main.yml/badge.svg)](https://github.com/katanemo/plano/actions/workflows/docker-push-main.yml)
[![Build and Deploy Documentation](https://github.com/katanemo/plano/actions/workflows/static.yml/badge.svg)](https://github.com/katanemo/plano/actions/workflows/static.yml)

Star ⭐️ the repo if you found Plano useful — new releases and updates land here first.
</div>

# Overview
Building agentic demos is easy. Shipping agentic applications safely, reliably, and repeatably to production is hard. After the thrill of a quick hack, you end up building the “hidden middleware” to reach production: routing logic to reach the right agent, guardrail hooks for safety and moderation, evaluation and observability glue for continuous learning, and model/provider quirks scattered across frameworks and application code.

Plano solves this by moving core delivery concerns into a unified, out-of-process dataplane.

- **🚦 Orchestration:** Low-latency orchestration between agents; add new agents without modifying app code.
- **🔗 Model Agility:** Route [by model name, alias (semantic names) or automatically via preferences](#use-plano-as-a-llm-router).
- **🕵 Agentic Signals&trade;:** Zero-code capture of [Signals](https://docs.planoai.dev/concepts/signals.html) plus OTEL traces/metrics across every agent.
- **🛡️ Moderation & Memory Hooks:** Build jailbreak protection, add moderation policies and memory consistently via [Filter Chains](https://docs.planoai.dev/concepts/filter_chain.html).

Plano pulls rote plumbing out of your framework so you can stay focused on what matters most: the core product logic of your agentic applications. Plano is backed by [industry-leading LLM research](https://planoai.dev/research) and built on [Envoy](https://envoyproxy.io) by its core contributors, who built critical infrastructure at scale for modern worklaods.

**High-Level Network Sequence Diagram**:
![high-level network plano arcitecture for Plano](docs/source/_static/img/plano_network_diagram_high_level.png)

**Jump to our [docs](https://docs.planoai.dev)** to learn how you can use Plano to improve the speed, safety and obervability of your agentic applications.

> [!IMPORTANT]
> Plano and the Plano family of LLMs (like Plano-Orchestrator) are hosted free of charge in the US-central region to give you a great first-run developer experience of Plano. To scale and run in production, you can either run these LLMs locally or contact us on [Discord](https://discord.gg/pGZf2gcwEc) for API keys.

---

## Build Agentic Apps with Plano

Plano handles **orchestration, model management, and observability** as modular building blocks - letting you configure only what you need (edge proxying for agentic orchestration and guardrails, or LLM routing from your services, or both together) to fit cleanly into existing architectures. Below is a simple multi-agent travel agent built with Plano that showcases all three core capabilities

> 📁 **Full working code:** See [`demos/agent_orchestration/travel_agents/`](demos/agent_orchestration/travel_agents/) for complete weather and flight agents you can run locally.


### 1. Define Your Agents in YAML

```yaml
# config.yaml
version: v0.3.0

# What you declare: Agent URLs and natural language descriptions
# What you don't write: Intent classifiers, routing logic, model fallbacks, provider adapters, or tracing instrumentation

agents:
  - id: weather_agent
    url: http://localhost:10510
  - id: flight_agent
    url: http://localhost:10520

model_providers:
  - model: openai/gpt-4o
    access_key: $OPENAI_API_KEY
    default: true
  - model: anthropic/claude-3-5-sonnet
    access_key: $ANTHROPIC_API_KEY

listeners:
  - type: agent
    name: travel_assistant
    port: 8001
    router: plano_orchestrator_v1  # Powered by our 4B-parameter routing model. You can change this to different models
    agents:
      - id: weather_agent
        description: |
          Gets real-time weather and forecasts for any city worldwide.
          Handles: "What's the weather in Paris?", "Will it rain in Tokyo?"

      - id: flight_agent
        description: |
          Searches flights between airports with live status and schedules.
          Handles: "Flights from NYC to LA", "Show me flights to Seattle"

tracing:
  random_sampling: 100  # Auto-capture traces for evaluation
```

### 2. Write Simple Agent Code

Your agents are just HTTP servers that implement the OpenAI-compatible chat completions endpoint. Use any language or framework:

```python
# weather_agent.py
from fastapi import FastAPI, Request
from fastapi.responses import StreamingResponse
from openai import AsyncOpenAI

app = FastAPI()

# Point to Plano's LLM gateway - it handles model routing for you
llm = AsyncOpenAI(base_url="http://localhost:12001/v1", api_key="EMPTY")

@app.post("/v1/chat/completions")
async def chat(request: Request):
    body = await request.json()
    messages = body.get("messages", [])
    days = 7

    # Your agent logic: fetch data, call APIs, run tools
    # See demos/agent_orchestration/travel_agents/ for the full implementation
    weather_data = await get_weather_data(request, messages, days)

    # Stream the response back through Plano
    async def generate():
        stream = await llm.chat.completions.create(
            model="openai/gpt-4o",
            messages=[{"role": "system", "content": f"Weather: {weather_data}"}, *messages],
            stream=True
        )
        async for chunk in stream:
            yield f"data: {chunk.model_dump_json()}\n\n"

    return StreamingResponse(generate(), media_type="text/event-stream")
```

### 3. Start Plano & Query Your Agents

**Prerequisites:** Follow the [prerequisites guide](https://docs.planoai.dev/get_started/quickstart.html#prerequisites) to install Plano and set up your environment.

```bash
# Start Plano
planoai up config.yaml
...

# Query - Plano intelligently routes to both agents in a single conversation
curl http://localhost:8001/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4o",
    "messages": [
      {"role": "user", "content": "I want to travel from NYC to Paris next week. What is the weather like there, and can you find me some flights?"}
    ]
  }'
# → Plano routes to weather_agent for Paris weather ✓
# → Then routes to flight_agent for NYC → Paris flights ✓
# → Returns a complete travel plan with both weather info and flight options
```

### 4. Get Observability and Model Agility for Free

Every request is traced end-to-end with OpenTelemetry - no instrumentation code needed.

![Atomatic Tracing](docs/source/_static/img/demo_tracing.png)

### What You Didn't Have to Build

| Infrastructure Concern | Without Plano | With Plano |
|---------|---------------|------------|
| **Agent Orchestration** | Write intent classifier + routing logic | Declare agent descriptions in YAML |
| **Model Management** | Handle each provider's API quirks | Unified LLM APIs with state management |
| **Rich Tracing** | Instrument every service with OTEL | Automatic end-to-end traces and logs |
| **Learning Signals** | Build pipeline to capture/export spans | Zero-code agentic signals |
| **Adding Agents** | Update routing code, test, redeploy | Add to config, restart |

**Why it's efficient:** Plano uses purpose-built, lightweight LLMs (like our 4B-parameter orchestrator) instead of heavyweight frameworks or GPT-4 for routing - giving you production-grade routing at a fraction of the cost and latency.

---

## Contact
To get in touch with us, please join our [discord server](https://discord.gg/pGZf2gcwEc). We actively monitor that and offer support there.

## Getting Started

Ready to try Plano? Check out our comprehensive documentation:

- **[Quickstart Guide](https://docs.planoai.dev/get_started/quickstart.html)** - Get up and running in minutes
- **[LLM Routing](https://docs.planoai.dev/guides/llm_router.html)** - Route by model name, alias, or intelligent preferences
- **[Agent Orchestration](https://docs.planoai.dev/guides/orchestration.html)** - Build multi-agent workflows
- **[Filter Chains](https://docs.planoai.dev/concepts/filter_chain.html)** - Add guardrails, moderation, and memory hooks
- **[Prompt Targets](https://docs.planoai.dev/concepts/prompt_target.html)** - Turn prompts into deterministic API calls
- **[Observability](https://docs.planoai.dev/guides/observability/observability.html)** - Traces, metrics, and logs

## Contribution
We would love feedback on our [Roadmap](https://github.com/orgs/katanemo/projects/1) and we welcome contributions to **Plano**! Whether you're fixing bugs, adding new features, improving documentation, or creating tutorials, your help is much appreciated. Please visit our [Contribution Guide](CONTRIBUTING.md) for more details

Star ⭐️ the repo if you found Plano useful — new releases and updates land here first.
-												Fixing relative link to the shared chatbotui in the spotify demo, and add references to it in the README (#400)


											
										
										
											2025-02-10 11:23:04 -08:00
+								<div align="center">
-												Update docs to Plano (#639)


											
										
										
											2025-12-23 17:14:50 -08:00
+								  <img src="docs/source/_static/img/PlanoTagline.svg" alt="Plano Logo" width="75%" height=auto>
-												Fixing relative link to the shared chatbotui in the spotify demo, and add references to it in the README (#400)


											
										
										
											2025-02-10 11:23:04 -08:00
+								</div>
 								<div align="center">
-												Update README.md
											
										
										
											2025-01-23 11:26:21 -08:00
-												updated readme with a snippet of code to go along with the descriptio… (#674)

* updated readme with a snippet of code to go along with the description of the proejct

* updated readme

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>
											
										
										
											2026-01-03 23:22:04 -08:00
+								 _The AI-native proxy server and data plane for agentic apps._<br><br>
 								 Plano pulls out the rote plumbing work and decouples you from brittle framework abstractions, centralizing what shouldn’t be bespoke in every codebase - like agent routing and orchestration, rich agentic signals and traces for continuous improvement, guardrail filters for safety and moderation, and smart LLM routing APIs for model agility. Use any language or AI framework, and deliver agents faster to production.
-												pushing docs updated (#508)

* pushing docs updated

* Fixed README.md logo

* Fixed README.md logo

* Fixed README.md spacing

* fixed tag line

* LLM router doc fixes

* minor logo and branding changes

* minor changes to the README

* minor changes to the README

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>
											
										
										
											2025-06-17 08:16:42 -07:00
-												Salmanap/fix readme 019a (#373)

* updated README based on feedback on reddit

* fixed typo

* updating README with minor fixes

* more fixes to README

* updated README

* updated README

* updated README

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2025-01-20 14:44:40 -08:00
-												updating quick start and pre-req sections

											
										
										
											2026-01-03 23:49:27 -08:00
+								[Quickstart Guide](https://docs.planoai.dev/get_started/quickstart.html) •
-												Update docs to Plano (#639)


											
										
										
											2025-12-23 17:14:50 -08:00
+								[Build Agentic Apps with Plano](#Build-Agentic-Apps-with-Plano) •
 								[Documentation](https://docs.planoai.dev) •
-												updated the spotify bearer authorization README and fixed main README… (#402)

* updated the spotify bearer authorization README and fixed main README links

* minor fixes to SPOTIFY README

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2025-02-10 17:56:28 -08:00
+								[Contact](#Contact)
-												Salmanap/fix readme 019a (#373)

* updated README based on feedback on reddit

* fixed typo

* updating README with minor fixes

* more fixes to README

* updated README

* updated README

* updated README

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2025-01-20 14:44:40 -08:00
-												Update GitHub badges after CI consolidation (#758)

* Update GitHub badges after CI workflow consolidation

Fix broken README badges pointing to deleted workflow files (pre-commit.yml,
rust_tests.yml, e2e_tests.yml) and replace with consolidated CI badge. Add
Docker image publish badge and dynamic Trivy security scan badge.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

* Use existing ADIL_GITHUB_TOKEN secret for security badge

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

---------

Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
											
										
										
											2026-02-15 00:26:44 -08:00
+								[![CI](https://github.com/katanemo/plano/actions/workflows/ci.yml/badge.svg)](https://github.com/katanemo/plano/actions/workflows/ci.yml)
 								[![Docker Image](https://github.com/katanemo/plano/actions/workflows/docker-push-main.yml/badge.svg)](https://github.com/katanemo/plano/actions/workflows/docker-push-main.yml)
-												Update docs to Plano (#639)


											
										
										
											2025-12-23 17:14:50 -08:00
+								[![Build and Deploy Documentation](https://github.com/katanemo/plano/actions/workflows/static.yml/badge.svg)](https://github.com/katanemo/plano/actions/workflows/static.yml)
-												updating README to better describe the problems we are solving (#437)

* updating README to better describe the problems we are solving
* fixing formatting issues

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2025-03-18 22:34:42 -07:00
-												add social support banner

											
										
										
											2026-01-04 00:03:39 -08:00
+								Star ⭐️ the repo if you found Plano useful — new releases and updates land here first.
-												Fixing relative link to the shared chatbotui in the spotify demo, and add references to it in the README (#400)


											
										
										
											2025-02-10 11:23:04 -08:00
+								</div>
 								# Overview
-												Update docs to Plano (#639)


											
										
										
											2025-12-23 17:14:50 -08:00
+								Building agentic demos is easy. Shipping agentic applications safely, reliably, and repeatably to production is hard. After the thrill of a quick hack, you end up building the “hidden middleware” to reach production: routing logic to reach the right agent, guardrail hooks for safety and moderation, evaluation and observability glue for continuous learning, and model/provider quirks scattered across frameworks and application code.
-												Fixing relative link to the shared chatbotui in the spotify demo, and add references to it in the README (#400)


											
										
										
											2025-02-10 11:23:04 -08:00
-												Update docs to Plano (#639)


											
										
										
											2025-12-23 17:14:50 -08:00
+								Plano solves this by moving core delivery concerns into a unified, out-of-process dataplane.
-												Fixing relative link to the shared chatbotui in the spotify demo, and add references to it in the README (#400)


											
										
										
											2025-02-10 11:23:04 -08:00
-												Update docs to Plano (#639)


											
										
										
											2025-12-23 17:14:50 -08:00
+								- **🚦 Orchestration:** Low-latency orchestration between agents; add new agents without modifying app code.
 								- **🔗 Model Agility:** Route [by model name, alias (semantic names) or automatically via preferences](#use-plano-as-a-llm-router).
-												tweaks to web and docs to align to 0.4.2 (#680)

* tweaks to web and docs to align to 0.4.2

* made our release banner clickable

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>
											
										
										
											2026-01-07 13:51:40 -08:00
+								- **🕵 Agentic Signals&trade;:** Zero-code capture of [Signals](https://docs.planoai.dev/concepts/signals.html) plus OTEL traces/metrics across every agent.
-												Update docs to Plano (#639)


											
										
										
											2025-12-23 17:14:50 -08:00
+								- **🛡️ Moderation & Memory Hooks:** Build jailbreak protection, add moderation policies and memory consistently via [Filter Chains](https://docs.planoai.dev/concepts/filter_chain.html).
-												Adil/fix salman docs (#75)

* added the first set of docs for our technical docs

* more docuemtnation changes

* added support for prompt processing and updated life of a request

* updated docs to including getting help sections and updated life of a request

* committing local changes for getting started guide, sample applications, and full reference spec for prompt-config

* updated configuration reference, added sample app skeleton, updated favico

* fixed the configuration refernce file, and made minor changes to the intent detection. commit v1 for now

* Updated docs with use cases and example code, updated what is arch, and made minor changes throughout

* fixed imaged and minor doc fixes

* add sphinx_book_theme

* updated README, and make some minor fixes to documetnation

* fixed README.md

* fixed image width

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
											
										
										
											2024-09-24 13:54:17 -07:00
-												Update docs to Plano (#639)


											
										
										
											2025-12-23 17:14:50 -08:00
+								Plano pulls rote plumbing out of your framework so you can stay focused on what matters most: the core product logic of your agentic applications. Plano is backed by [industry-leading LLM research](https://planoai.dev/research) and built on [Envoy](https://envoyproxy.io) by its core contributors, who built critical infrastructure at scale for modern worklaods.
-												Fixing relative link to the shared chatbotui in the spotify demo, and add references to it in the README (#400)


											
										
										
											2025-02-10 11:23:04 -08:00
-												Update docs to Plano (#639)


											
										
										
											2025-12-23 17:14:50 -08:00
+								**High-Level Network Sequence Diagram**:
 								![high-level network plano arcitecture for Plano](docs/source/_static/img/plano_network_diagram_high_level.png)
-												updated all demo READMes and minor doc changes (#154)

* updated all demo READMes and minor doc changes

* minor typo fixes

* updated main Readme

* fixed README and docs

* fixed README and docs

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2024-10-08 23:58:55 -07:00
-												Update docs to Plano (#639)


											
										
										
											2025-12-23 17:14:50 -08:00
+								**Jump to our [docs](https://docs.planoai.dev)** to learn how you can use Plano to improve the speed, safety and obervability of your agentic applications.
-												Setup pre-commit so it runs locally before every git push (#12)

* Setup pre-commit so it runs locally before every git push

* Update .pre-commit-config.yaml

* added more checks

* update readme

* Apply suggestions from code review

Co-authored-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>

* remove cargo-check

---------

Co-authored-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
											
										
										
											2024-07-18 11:01:02 -07:00
-												add note about hosted arch-fc (#308)


											
										
										
											2024-11-26 14:19:10 -08:00
+								> [!IMPORTANT]
-												use plano-orchestrator for LLM routing, remove arch-router (#886)
											
										
										
											2026-04-15 16:41:42 -07:00
+								> Plano and the Plano family of LLMs (like Plano-Orchestrator) are hosted free of charge in the US-central region to give you a great first-run developer experience of Plano. To scale and run in production, you can either run these LLMs locally or contact us on [Discord](https://discord.gg/pGZf2gcwEc) for API keys.
-												updating readme and docs with note about Arch-Function (#285)

* updating readme and docs with note about Arch-Function

* minor fixes to README

* a few more minor updates to the README

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2024-11-19 08:43:56 -08:00
-												updated readme with a snippet of code to go along with the descriptio… (#674)

* updated readme with a snippet of code to go along with the description of the proejct

* updated readme

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>
											
										
										
											2026-01-03 23:22:04 -08:00
+								---
 								## Build Agentic Apps with Plano
 								Plano handles **orchestration, model management, and observability** as modular building blocks - letting you configure only what you need (edge proxying for agentic orchestration and guardrails, or LLM routing from your services, or both together) to fit cleanly into existing architectures. Below is a simple multi-agent travel agent built with Plano that showcases all three core capabilities
-												Overhaul demos directory: cleanup, restructure, and standardize configs (#760)


											
										
										
											2026-02-17 03:09:28 -08:00
+								> 📁 **Full working code:** See [`demos/agent_orchestration/travel_agents/`](demos/agent_orchestration/travel_agents/) for complete weather and flight agents you can run locally.
-												updated readme with a snippet of code to go along with the descriptio… (#674)

* updated readme with a snippet of code to go along with the description of the proejct

* updated readme

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>
											
										
										
											2026-01-03 23:22:04 -08:00
-												updating quick start and pre-req sections

											
										
										
											2026-01-03 23:49:27 -08:00
-												updated readme with a snippet of code to go along with the descriptio… (#674)

* updated readme with a snippet of code to go along with the description of the proejct

* updated readme

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>
											
										
										
											2026-01-03 23:22:04 -08:00
+								### 1. Define Your Agents in YAML
 								```yaml
 								# config.yaml
 								version: v0.3.0
 								# What you declare: Agent URLs and natural language descriptions
 								# What you don't write: Intent classifiers, routing logic, model fallbacks, provider adapters, or tracing instrumentation
 								agents:
 								  - id: weather_agent
 								    url: http://localhost:10510
 								  - id: flight_agent
 								    url: http://localhost:10520
 								model_providers:
 								  - model: openai/gpt-4o
 								    access_key: $OPENAI_API_KEY
 								    default: true
 								  - model: anthropic/claude-3-5-sonnet
 								    access_key: $ANTHROPIC_API_KEY
 								listeners:
 								  - type: agent
 								    name: travel_assistant
 								    port: 8001
 								    router: plano_orchestrator_v1  # Powered by our 4B-parameter routing model. You can change this to different models
 								    agents:
 								      - id: weather_agent
 								        description: |
 								          Gets real-time weather and forecasts for any city worldwide.
 								          Handles: "What's the weather in Paris?", "Will it rain in Tokyo?"
 								      - id: flight_agent
 								        description: |
 								          Searches flights between airports with live status and schedules.
 								          Handles: "Flights from NYC to LA", "Show me flights to Seattle"
 								tracing:
 								  random_sampling: 100  # Auto-capture traces for evaluation
 								```
 								### 2. Write Simple Agent Code
 								Your agents are just HTTP servers that implement the OpenAI-compatible chat completions endpoint. Use any language or framework:
 								```python
 								# weather_agent.py
 								from fastapi import FastAPI, Request
 								from fastapi.responses import StreamingResponse
 								from openai import AsyncOpenAI
 								app = FastAPI()
 								# Point to Plano's LLM gateway - it handles model routing for you
 								llm = AsyncOpenAI(base_url="http://localhost:12001/v1", api_key="EMPTY")
 								@app.post("/v1/chat/completions")
 								async def chat(request: Request):
 								    body = await request.json()
 								    messages = body.get("messages", [])
 								    days = 7
 								    # Your agent logic: fetch data, call APIs, run tools
-												Overhaul demos directory: cleanup, restructure, and standardize configs (#760)


											
										
										
											2026-02-17 03:09:28 -08:00
+								    # See demos/agent_orchestration/travel_agents/ for the full implementation
-												updated readme with a snippet of code to go along with the descriptio… (#674)

* updated readme with a snippet of code to go along with the description of the proejct

* updated readme

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>
											
										
										
											2026-01-03 23:22:04 -08:00
+								    weather_data = await get_weather_data(request, messages, days)
 								    # Stream the response back through Plano
 								    async def generate():
 								        stream = await llm.chat.completions.create(
 								            model="openai/gpt-4o",
 								            messages=[{"role": "system", "content": f"Weather: {weather_data}"}, *messages],
 								            stream=True
 								        )
 								        async for chunk in stream:
 								            yield f"data: {chunk.model_dump_json()}\n\n"
 								    return StreamingResponse(generate(), media_type="text/event-stream")
 								```
 								### 3. Start Plano & Query Your Agents
-												updating quick start and pre-req sections

											
										
										
											2026-01-03 23:51:17 -08:00
+								**Prerequisites:** Follow the [prerequisites guide](https://docs.planoai.dev/get_started/quickstart.html#prerequisites) to install Plano and set up your environment.
-												updating quick start and pre-req sections

											
										
										
											2026-01-03 23:49:27 -08:00
-												updated readme with a snippet of code to go along with the descriptio… (#674)

* updated readme with a snippet of code to go along with the description of the proejct

* updated readme

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>
											
										
										
											2026-01-03 23:22:04 -08:00
+								```bash
 								# Start Plano
 								planoai up config.yaml
 								...
 								# Query - Plano intelligently routes to both agents in a single conversation
 								curl http://localhost:8001/v1/chat/completions \
 								  -H "Content-Type: application/json" \
 								  -d '{
 								    "model": "gpt-4o",
 								    "messages": [
 								      {"role": "user", "content": "I want to travel from NYC to Paris next week. What is the weather like there, and can you find me some flights?"}
 								    ]
 								  }'
 								# → Plano routes to weather_agent for Paris weather ✓
 								# → Then routes to flight_agent for NYC → Paris flights ✓
 								# → Returns a complete travel plan with both weather info and flight options
 								```
 								### 4. Get Observability and Model Agility for Free
 								Every request is traced end-to-end with OpenTelemetry - no instrumentation code needed.
 								![Atomatic Tracing](docs/source/_static/img/demo_tracing.png)
 								### What You Didn't Have to Build
 								| Infrastructure Concern | Without Plano | With Plano |
 								|---------|---------------|------------|
 								| **Agent Orchestration** | Write intent classifier + routing logic | Declare agent descriptions in YAML |
 								| **Model Management** | Handle each provider's API quirks | Unified LLM APIs with state management |
 								| **Rich Tracing** | Instrument every service with OTEL | Automatic end-to-end traces and logs |
 								| **Learning Signals** | Build pipeline to capture/export spans | Zero-code agentic signals |
 								| **Adding Agents** | Update routing code, test, redeploy | Add to config, restart |
 								**Why it's efficient:** Plano uses purpose-built, lightweight LLMs (like our 4B-parameter orchestrator) instead of heavyweight frameworks or GPT-4 for routing - giving you production-grade routing at a fraction of the cost and latency.
 								---
-												fixed cli to use poetry as well. this way we make it easy to have the… (#160)


											
										
										
											2024-10-09 15:53:12 -07:00
+								## Contact
-												Update docs to Plano (#639)


											
										
										
											2025-12-23 17:14:50 -08:00
+								To get in touch with us, please join our [discord server](https://discord.gg/pGZf2gcwEc). We actively monitor that and offer support there.
-												updated all demo READMes and minor doc changes (#154)

* updated all demo READMes and minor doc changes

* minor typo fixes

* updated main Readme

* fixed README and docs

* fixed README and docs

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2024-10-08 23:58:55 -07:00
-												simplify readme and point links to docs.planoai.dev (#672)


											
										
										
											2026-01-02 13:45:29 -08:00
+								## Getting Started
-												updated all demo READMes and minor doc changes (#154)

* updated all demo READMes and minor doc changes

* minor typo fixes

* updated main Readme

* fixed README and docs

* fixed README and docs

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
											
										
										
											2024-10-08 23:58:55 -07:00
-												simplify readme and point links to docs.planoai.dev (#672)


											
										
										
											2026-01-02 13:45:29 -08:00
+								Ready to try Plano? Check out our comprehensive documentation:
-												updating docs to reflect changes in 0.1.2 like tracing via signoz and… (#271)


											
										
										
											2024-11-15 16:55:27 -08:00
-												simplify readme and point links to docs.planoai.dev (#672)


											
										
										
											2026-01-02 13:45:29 -08:00
+								- **[Quickstart Guide](https://docs.planoai.dev/get_started/quickstart.html)** - Get up and running in minutes
 								- **[LLM Routing](https://docs.planoai.dev/guides/llm_router.html)** - Route by model name, alias, or intelligent preferences
 								- **[Agent Orchestration](https://docs.planoai.dev/guides/orchestration.html)** - Build multi-agent workflows
-												updated readme with a snippet of code to go along with the descriptio… (#674)

* updated readme with a snippet of code to go along with the description of the proejct

* updated readme

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>
											
										
										
											2026-01-03 23:22:04 -08:00
+								- **[Filter Chains](https://docs.planoai.dev/concepts/filter_chain.html)** - Add guardrails, moderation, and memory hooks
-												simplify readme and point links to docs.planoai.dev (#672)


											
										
										
											2026-01-02 13:45:29 -08:00
+								- **[Prompt Targets](https://docs.planoai.dev/concepts/prompt_target.html)** - Turn prompts into deterministic API calls
 								- **[Observability](https://docs.planoai.dev/guides/observability/observability.html)** - Traces, metrics, and logs
-												Use better logs (#452)


											
										
										
											2025-03-27 10:40:20 -07:00
-												update getting started guide and add llm gateway and prompt gateway samples (#330)


											
										
										
											2024-12-06 14:37:33 -08:00
+								## Contribution
-												updated readme with a snippet of code to go along with the descriptio… (#674)

* updated readme with a snippet of code to go along with the description of the proejct

* updated readme

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>
											
										
										
											2026-01-03 23:22:04 -08:00
+								We would love feedback on our [Roadmap](https://github.com/orgs/katanemo/projects/1) and we welcome contributions to **Plano**! Whether you're fixing bugs, adding new features, improving documentation, or creating tutorials, your help is much appreciated. Please visit our [Contribution Guide](CONTRIBUTING.md) for more details
-												add social support banner

											
										
										
											2026-01-04 00:03:39 -08:00
 								Star ⭐️ the repo if you found Plano useful — new releases and updates land here first.