mirror of
https://github.com/katanemo/plano.git
synced 2026-06-17 15:25:17 +02:00
| .. | ||
| src/rag_agent | ||
| arch_config.yaml | ||
| docker-compose.yaml | ||
| pyproject.toml | ||
| README.md | ||
| test.hurl | ||
| test.rest | ||
| uv.lock | ||
RAG Agent Query Parser
A FastAPI service that rewrites user queries using archgw and gpt-4o-mini for better retrieval accuracy.
How it Works
- Receives a chat completion request with conversation history
- Calls archgw's LLM gateway with gpt-4o-mini to rewrite the last user query
- Returns the rewritten query as the assistant response
Setup and Running
-
Start archgw:
archgw up --foreground -
Start the query parser service:
uv run python -m rag_agent.query_parser
Configuration
# archgw LLM Gateway base URL (default: http://localhost:12000/v1)
export LLM_GATEWAY_ENDPOINT="http://localhost:12000/v1"