apunkt/plano

mirror of https://github.com/katanemo/plano.git synced 2026-06-17 15:25:17 +02:00

Adil Hafeez 0c76f2e6e7 more changs		2025-09-18 11:08:51 -07:00
..
src/rag_agent	more changs	2025-09-18 11:08:51 -07:00
arch_config.yaml	more changs	2025-09-18 11:08:51 -07:00
docker-compose.yaml	more changes	2025-09-17 18:11:37 -07:00
pyproject.toml	changes to the agents	2025-09-11 13:25:09 -07:00
README.md	changes to the agents	2025-09-11 13:25:09 -07:00
test.rest	more changs	2025-09-18 11:08:51 -07:00
uv.lock	fix more	2025-09-11 15:55:25 -07:00

README.md

RAG Agent Query Parser

A FastAPI service that rewrites user queries using archgw and gpt-4o-mini for better retrieval accuracy.

How it Works

Receives a chat completion request with conversation history
Calls archgw's LLM gateway with gpt-4o-mini to rewrite the last user query
Returns the rewritten query as the assistant response

Setup and Running

Start archgw:
```
archgw up --foreground
```

Start the query parser service:

uv run python -m rag_agent.query_parser

Configuration

# archgw LLM Gateway base URL (default: http://localhost:12000/v1)
export LLM_GATEWAY_ENDPOINT="http://localhost:12000/v1"