plano/demos/use_cases/rag_agent
2025-10-22 16:53:25 -07:00
..
src/rag_agent add rag agent demo 2025-10-14 14:12:13 -07:00
.dockerignore add rag agent demo 2025-10-22 16:47:20 -07:00
arch_config.yaml add rag agent demo 2025-10-22 16:47:20 -07:00
docker-compose.yaml add rag agent demo 2025-10-22 16:47:20 -07:00
Dockerfile add rag agent demo 2025-10-22 16:47:20 -07:00
image.png add screenshot 2025-10-22 16:53:25 -07:00
pyproject.toml add rag agent demo 2025-10-14 14:12:13 -07:00
README.md add screenshot 2025-10-22 16:53:25 -07:00
sample_queries.md add rag agent demo 2025-10-14 14:12:13 -07:00
start_agents.sh add rag agent demo 2025-10-14 14:12:13 -07:00
test.rest add rag agent demo 2025-10-22 16:47:20 -07:00
uv.lock add rag agent demo 2025-10-14 14:12:13 -07:00

RAG Agent Query Parser

A FastAPI service that rewrites user queries using archgw and gpt-4o-mini for better retrieval accuracy.

How it Works

  1. Receives a chat completion request with conversation history
  2. Calls archgw's LLM gateway with gpt-4o-mini to rewrite the last user query
  3. Returns the rewritten query as the assistant response

Setup and Running

  1. Start archgw with open-web and jaeger ui:
    docker compose up -d
    
  2. Interact with rag agent using open-web-ui: Navigate to http://localhost:8080 and try some sample queries like "What is the guaranteed uptime percentage for TechCorp's cloud services?"

Note: open-web-ui takes some time to load.

Sample screenshot of UI,