mirror of https://github.com/katanemo/plano.git synced 2026-06-17 15:25:17 +02:00

Adil Hafeez 6ce53c25d3 add rag agent demo		2025-10-14 14:12:13 -07:00
..
src/rag_agent	add rag agent demo	2025-10-14 14:12:13 -07:00
arch_config.yaml	add rag agent demo	2025-10-14 14:12:13 -07:00
docker-compose.yaml	add rag agent demo	2025-10-14 14:12:13 -07:00
pyproject.toml	add rag agent demo	2025-10-14 14:12:13 -07:00
README.md	add rag agent demo	2025-10-14 14:12:13 -07:00
sample_queries.md	add rag agent demo	2025-10-14 14:12:13 -07:00
start_agents.sh	add rag agent demo	2025-10-14 14:12:13 -07:00
test.rest	add rag agent demo	2025-10-14 14:12:13 -07:00
uv.lock	add rag agent demo	2025-10-14 14:12:13 -07:00

RAG Agent Query Parser

A FastAPI service that rewrites user queries using archgw and gpt-4o-mini for better retrieval accuracy.

How it Works

Start the query parser service:

uv run python -m rag_agent.query_parser

# archgw LLM Gateway base URL (default: http://localhost:12000/v1)
export LLM_GATEWAY_ENDPOINT="http://localhost:12000/v1"