mirror of
https://github.com/katanemo/plano.git
synced 2026-05-13 17:52:38 +02:00
| .. | ||
| grafana | ||
| prometheus | ||
| docker-compose.yaml | ||
| download_mistral_7b.sh | ||
| katanemo-config.yaml | ||
| README.md | ||
Weather forecasting
This demo shows how you can use intelligent prompt gateway to provide realtime weather forecast using Mistral LLM locally hosted using llama.cpp as LLM Hosting Service.
Startig the demo
- Ensure that submodule is up to date
git submodule sync --recursive - Download mistral 7b model using following shell command
sh download_mistral_7b.sh - Start services
docker compose up - Navigate to http://localhost:18080/
- You can type in queries like "how is the weather in Seattle"
- You can also ask follow up questions like "show me sunny days"
- To see metrics navigate to "http://localhost:3000/" (use admin/grafana for login)
- Open up dahsboard named "Intelligent Gateway Overview"
- On this dashboard you can see reuqest latency and number of requests