plano/demos
Adil Hafeez 42d3de8906 bidirectional streaming for output filter chains
Replace per-chunk HTTP requests to output filters with a single
bidirectional streaming connection per filter. This eliminates
the 50-200+ round-trips per streaming LLM response.

Filters opt in via streaming: true in config. When all output filters
support streaming, brightstaff opens one POST per filter with a streaming
request body (Body::wrap_stream) and reads the streaming response. Filters
that don't opt in fall back to the existing per-chunk behavior.

Updates the PII deanonymizer demo as the reference implementation with
request.stream() + StreamingResponse support.

Made-with: Cursor
2026-03-19 02:27:26 -07:00
..
advanced add output filter chain (#822) 2026-03-18 17:58:20 -07:00
agent_orchestration add output filter chain (#822) 2026-03-18 17:58:20 -07:00
filter_chains bidirectional streaming for output filter chains 2026-03-19 02:27:26 -07:00
getting_started add output filter chain (#822) 2026-03-18 17:58:20 -07:00
integrations Run demos without Docker (#809) 2026-03-11 12:49:36 -07:00
llm_routing add k8s deployment manifests and docs for self-hosted Arch-Router (#831) 2026-03-16 12:05:30 -07:00
shared Run plano natively by default (#744) 2026-03-05 07:35:25 -08:00
README.md Support for Codex via Plano (#808) 2026-03-10 20:54:14 -07:00

Plano Demos

This directory contains demos showcasing Plano's capabilities as an AI-native proxy for agentic applications.

Getting Started

Demo Description
Weather Forecast Core function calling with a weather query agent, interactive chat UI, and Jaeger tracing
LLM Gateway Key management and dynamic routing to multiple LLM providers with header-based model override

LLM Routing

Demo Description
Preference-Based Routing Routes prompts to LLMs based on user-defined preferences and task type (e.g. code generation vs. understanding)
Model Alias Routing Maps semantic aliases (arch.summarize.v1) to provider-specific models for centralized governance
Claude Code Router Extends Claude Code with multi-provider access and preference-aligned routing for coding tasks
Codex Router Extends Codex CLI with multi-provider access and preference-aligned routing for coding tasks

Agent Orchestration

Demo Description
Travel Agents Multi-agent travel booking with weather and flight agents, intelligent routing, and OpenTelemetry tracing
Multi-Agent CrewAI & LangChain Framework-agnostic orchestration combining CrewAI and LangChain agents in unified conversations

Filter Chains

Demo Description
HTTP Filter RAG agent with filter chains for input validation, query rewriting, and context building
MCP Filter RAG agent using MCP-based filters for domain validation, query optimization, and knowledge base retrieval

Integrations

Demo Description
Ollama Use Ollama as a local LLM provider through Plano
Spotify Bearer Auth Bearer token authentication for third-party APIs (Spotify new releases and top tracks)

Advanced

Demo Description
Currency Exchange Function calling with public REST APIs (Frankfurter currency exchange)
Stock Quote Protected REST API integration with access key management
Multi-Turn RAG Multi-turn conversational RAG agent for answering questions about energy sources
Model Choice Test Harness Evaluation framework for safely testing and switching between models with benchmark fixtures