mirror of
https://github.com/katanemo/plano.git
synced 2026-05-06 22:32:42 +02:00
- extend brightstaff llm_chat_inner to extract X-Session-Id, check the session cache before routing, and cache the result afterward — same pattern as routing_service.rs - replace old urllib-based demo with a real FastAPI research agent that runs 3 independent tool-calling tasks with alternating intents so Plano routes to different models; demo.py is a pure httpx client that shows the routing trace side-by-side with and without session pinning |
||
|---|---|---|
| .. | ||
| advanced | ||
| agent_orchestration | ||
| filter_chains | ||
| getting_started | ||
| integrations | ||
| llm_routing | ||
| shared | ||
| README.md | ||
Plano Demos
This directory contains demos showcasing Plano's capabilities as an AI-native proxy for agentic applications.
Getting Started
| Demo | Description |
|---|---|
| Weather Forecast | Core function calling with a weather query agent, interactive chat UI, and Jaeger tracing |
| LLM Gateway | Key management and dynamic routing to multiple LLM providers with header-based model override |
LLM Routing
| Demo | Description |
|---|---|
| Preference-Based Routing | Routes prompts to LLMs based on user-defined preferences and task type (e.g. code generation vs. understanding) |
| Model Alias Routing | Maps semantic aliases (arch.summarize.v1) to provider-specific models for centralized governance |
| Claude Code Router | Extends Claude Code with multi-provider access and preference-aligned routing for coding tasks |
| Codex Router | Extends Codex CLI with multi-provider access and preference-aligned routing for coding tasks |
Agent Orchestration
| Demo | Description |
|---|---|
| Travel Agents | Multi-agent travel booking with weather and flight agents, intelligent routing, and OpenTelemetry tracing |
| Multi-Agent CrewAI & LangChain | Framework-agnostic orchestration combining CrewAI and LangChain agents in unified conversations |
Filter Chains
| Demo | Description |
|---|---|
| HTTP Filter | RAG agent with filter chains for input validation, query rewriting, and context building |
| MCP Filter | RAG agent using MCP-based filters for domain validation, query optimization, and knowledge base retrieval |
Integrations
| Demo | Description |
|---|---|
| Ollama | Use Ollama as a local LLM provider through Plano |
| Spotify Bearer Auth | Bearer token authentication for third-party APIs (Spotify new releases and top tracks) |
Advanced
| Demo | Description |
|---|---|
| Currency Exchange | Function calling with public REST APIs (Frankfurter currency exchange) |
| Stock Quote | Protected REST API integration with access key management |
| Multi-Turn RAG | Multi-turn conversational RAG agent for answering questions about energy sources |
| Model Choice Test Harness | Evaluation framework for safely testing and switching between models with benchmark fixtures |