mirror of
https://github.com/katanemo/plano.git
synced 2026-06-20 15:28:07 +02:00
773 B
773 B
RAG Agent Query Parser
A FastAPI service that rewrites user queries using archgw and gpt-4o-mini for better retrieval accuracy.
How it Works
- Receives a chat completion request with conversation history
- Calls archgw's LLM gateway with gpt-4o-mini to rewrite the last user query
- Returns the rewritten query as the assistant response
Setup and Running
- Start archgw with open-web and jaeger ui:
docker compose up -d - Interact with rag agent using open-web-ui: Navigate to http://localhost:8080 and try some sample queries like "What is the guaranteed uptime percentage for TechCorp's cloud services?"
Note: open-web-ui takes some time to load.
Sample screenshot of UI,