apunkt/plano

mirror of https://github.com/katanemo/plano.git synced 2026-06-20 15:28:07 +02:00

Adil Hafeez bd7afd911e add DigitalOcean pricing, startup validation, and demo update - MetricsSource::DigitalOceanPricing variant: fetch public DO Gen-AI pricing, normalize as lowercase(creator)/model_id, cost = input + output per million - cost_metrics endpoint format updated to { "model": { "input_per_million": X, "output_per_million": Y } } - Startup errors: prefer:cheapest requires cost source, prefer:fastest requires prometheus - Startup warning: models with no pricing/latency data ranked last - One-per-type enforcement: digitalocean_pricing; error if cost_metrics + digitalocean_pricing both configured - cost_snapshot() / latency_snapshot() on ModelMetricsService for startup checks - Demo config updated to v0.4.0 top-level routing_preferences with cheapest + fastest policies - docker-compose.yaml + prometheus.yaml + metrics_server.py for demo latency metrics - Schema and docs updated		2026-03-27 16:54:37 -07:00
..
advanced	add output filter chain (#822 )	2026-03-18 17:58:20 -07:00
agent_orchestration	add output filter chain (#822 )	2026-03-18 17:58:20 -07:00
filter_chains	add output filter chain (#822 )	2026-03-18 17:58:20 -07:00
getting_started	add output filter chain (#822 )	2026-03-18 17:58:20 -07:00
integrations	Run demos without Docker (#809 )	2026-03-11 12:49:36 -07:00
llm_routing	add DigitalOcean pricing, startup validation, and demo update	2026-03-27 16:54:37 -07:00
shared	Run plano natively by default (#744 )	2026-03-05 07:35:25 -08:00
README.md	Support for Codex via Plano (#808 )	2026-03-10 20:54:14 -07:00

README.md

Plano Demos

This directory contains demos showcasing Plano's capabilities as an AI-native proxy for agentic applications.

Getting Started

Demo	Description
Weather Forecast	Core function calling with a weather query agent, interactive chat UI, and Jaeger tracing
LLM Gateway	Key management and dynamic routing to multiple LLM providers with header-based model override

LLM Routing

Demo	Description
Preference-Based Routing	Routes prompts to LLMs based on user-defined preferences and task type (e.g. code generation vs. understanding)
Model Alias Routing	Maps semantic aliases (`arch.summarize.v1`) to provider-specific models for centralized governance
Claude Code Router	Extends Claude Code with multi-provider access and preference-aligned routing for coding tasks
Codex Router	Extends Codex CLI with multi-provider access and preference-aligned routing for coding tasks

Agent Orchestration

Demo	Description
Travel Agents	Multi-agent travel booking with weather and flight agents, intelligent routing, and OpenTelemetry tracing
Multi-Agent CrewAI & LangChain	Framework-agnostic orchestration combining CrewAI and LangChain agents in unified conversations

Filter Chains

Demo	Description
HTTP Filter	RAG agent with filter chains for input validation, query rewriting, and context building
MCP Filter	RAG agent using MCP-based filters for domain validation, query optimization, and knowledge base retrieval

Integrations

Demo	Description
Ollama	Use Ollama as a local LLM provider through Plano
Spotify Bearer Auth	Bearer token authentication for third-party APIs (Spotify new releases and top tracks)

Advanced

Demo	Description
Currency Exchange	Function calling with public REST APIs (Frankfurter currency exchange)
Stock Quote	Protected REST API integration with access key management
Multi-Turn RAG	Multi-turn conversational RAG agent for answering questions about energy sources
Model Choice Test Harness	Evaluation framework for safely testing and switching between models with benchmark fixtures