plano/demos
Adil Hafeez bd7afd911e add DigitalOcean pricing, startup validation, and demo update
- MetricsSource::DigitalOceanPricing variant: fetch public DO Gen-AI pricing, normalize as lowercase(creator)/model_id, cost = input + output per million
- cost_metrics endpoint format updated to { "model": { "input_per_million": X, "output_per_million": Y } }
- Startup errors: prefer:cheapest requires cost source, prefer:fastest requires prometheus
- Startup warning: models with no pricing/latency data ranked last
- One-per-type enforcement: digitalocean_pricing; error if cost_metrics + digitalocean_pricing both configured
- cost_snapshot() / latency_snapshot() on ModelMetricsService for startup checks
- Demo config updated to v0.4.0 top-level routing_preferences with cheapest + fastest policies
- docker-compose.yaml + prometheus.yaml + metrics_server.py for demo latency metrics
- Schema and docs updated
2026-03-27 16:54:37 -07:00
..
advanced add output filter chain (#822) 2026-03-18 17:58:20 -07:00
agent_orchestration add output filter chain (#822) 2026-03-18 17:58:20 -07:00
filter_chains add output filter chain (#822) 2026-03-18 17:58:20 -07:00
getting_started add output filter chain (#822) 2026-03-18 17:58:20 -07:00
integrations Run demos without Docker (#809) 2026-03-11 12:49:36 -07:00
llm_routing add DigitalOcean pricing, startup validation, and demo update 2026-03-27 16:54:37 -07:00
shared Run plano natively by default (#744) 2026-03-05 07:35:25 -08:00
README.md Support for Codex via Plano (#808) 2026-03-10 20:54:14 -07:00

Plano Demos

This directory contains demos showcasing Plano's capabilities as an AI-native proxy for agentic applications.

Getting Started

Demo Description
Weather Forecast Core function calling with a weather query agent, interactive chat UI, and Jaeger tracing
LLM Gateway Key management and dynamic routing to multiple LLM providers with header-based model override

LLM Routing

Demo Description
Preference-Based Routing Routes prompts to LLMs based on user-defined preferences and task type (e.g. code generation vs. understanding)
Model Alias Routing Maps semantic aliases (arch.summarize.v1) to provider-specific models for centralized governance
Claude Code Router Extends Claude Code with multi-provider access and preference-aligned routing for coding tasks
Codex Router Extends Codex CLI with multi-provider access and preference-aligned routing for coding tasks

Agent Orchestration

Demo Description
Travel Agents Multi-agent travel booking with weather and flight agents, intelligent routing, and OpenTelemetry tracing
Multi-Agent CrewAI & LangChain Framework-agnostic orchestration combining CrewAI and LangChain agents in unified conversations

Filter Chains

Demo Description
HTTP Filter RAG agent with filter chains for input validation, query rewriting, and context building
MCP Filter RAG agent using MCP-based filters for domain validation, query optimization, and knowledge base retrieval

Integrations

Demo Description
Ollama Use Ollama as a local LLM provider through Plano
Spotify Bearer Auth Bearer token authentication for third-party APIs (Spotify new releases and top tracks)

Advanced

Demo Description
Currency Exchange Function calling with public REST APIs (Frankfurter currency exchange)
Stock Quote Protected REST API integration with access key management
Multi-Turn RAG Multi-turn conversational RAG agent for answering questions about energy sources
Model Choice Test Harness Evaluation framework for safely testing and switching between models with benchmark fixtures