plano

mirror of https://github.com/katanemo/plano.git synced 2026-07-14 16:22:12 +02:00

Adil Hafeez 42d3de8906 bidirectional streaming for output filter chains Replace per-chunk HTTP requests to output filters with a single bidirectional streaming connection per filter. This eliminates the 50-200+ round-trips per streaming LLM response. Filters opt in via streaming: true in config. When all output filters support streaming, brightstaff opens one POST per filter with a streaming request body (Body::wrap_stream) and reads the streaming response. Filters that don't opt in fall back to the existing per-chunk behavior. Updates the PII deanonymizer demo as the reference implementation with request.stream() + StreamingResponse support. Made-with: Cursor		2026-03-19 02:27:26 -07:00
..
.vscode	use standard tracing and logging in brightstaff (#721 )	2026-02-09 13:33:27 -08:00
brightstaff	bidirectional streaming for output filter chains	2026-03-19 02:27:26 -07:00
common	bidirectional streaming for output filter chains	2026-03-19 02:27:26 -07:00
hermesllm	adding new supported models to plano (#829 )	2026-03-15 12:37:20 -07:00
llm_gateway	Unified overrides for custom router and orchestrator models (#820 )	2026-03-15 09:36:11 -07:00
prompt_gateway	Rename all arch references to plano (#745 )	2026-02-13 15:16:56 -08:00
build.sh	Use mcp tools for filter chain (#621 )	2025-12-17 17:30:14 -08:00
Cargo.lock	[ISSUE 706]: Standardize returned errors from Plano (#772 )	2026-02-24 14:34:33 -08:00
Cargo.toml	use standard tracing and logging in brightstaff (#721 )	2026-02-09 13:33:27 -08:00