Commit graph

3 commits

Author SHA1 Message Date
Adil Hafeez
da9792c2dd rename session pinning to model affinity with x-model-affinity header 2026-04-08 15:23:53 -07:00
Adil Hafeez
5789694d2f rename x-session-id to x-routing-session-id and fix routing config field name 2026-04-08 12:39:43 -07:00
Adil Hafeez
0105897692 add session pinning to llm_chat handler and rewrite session pinning demo
- extend brightstaff llm_chat_inner to extract X-Session-Id, check the
  session cache before routing, and cache the result afterward — same
  pattern as routing_service.rs
- replace old urllib-based demo with a real FastAPI research agent that
  runs 3 independent tool-calling tasks with alternating intents so
  Plano routes to different models; demo.py is a pure httpx client that
  shows the routing trace side-by-side with and without session pinning
2026-03-26 16:44:05 -07:00