mirror of
https://github.com/katanemo/plano.git
synced 2026-04-25 08:46:24 +02:00
* initial commit of the insurange agent demo, with the CLI tool * committing the cli * fixed some field descriptions for generate-prompt-targets * CLI works with buil, up and down commands. Function calling example works stand-alone * fixed README to install archgw cli * fixing based on feedback * fixing based on feedback --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2 KiB
2 KiB
Function calling
This demo shows how you can use intelligent prompt gateway to do function calling. This demo assumes you are using ollama running natively. If you want to run ollama running inside docker then please update ollama endpoint in docker-compose file.
Starting the demo
- Ensure that submodule is up to date
git submodule sync --recursive - Create
.envfile and set OpenAI key using env varOPENAI_API_KEY - Start services
docker compose up - Download Bolt-FC model. This demo assumes we have downloaded Arch-Function-Calling-1.5B:Q4_K_M to local folder.
- If running ollama natively run
ollama serve - Create model file in ollama repository
ollama create Arch-Function-Calling-1.5B:Q4_K_M -f Arch-Function-Calling-1.5B-Q4_K_M.model_file - Navigate to http://localhost:18080/
- You can type in queries like "how is the weather in Seattle"
- You can also ask follow up questions like "show me sunny days"
- To see metrics navigate to "http://localhost:3000/" (use admin/grafana for login)
- Open up dahsboard named "Intelligent Gateway Overview"
- On this dashboard you can see reuqest latency and number of requests
Observability
Arch gateway publishes stats endpoint at http://localhost:19901/stats. In this demo we are using prometheus to pull stats from arch and we are using grafana to visalize the stats in dashboard. To see grafana dashboard follow instructions below,
- Start grafana and prometheus using following command
docker compose --profile monitoring up - Navigate to http://localhost:3000/ to open grafana UI (use admin/grafana as credentials)
- From grafana left nav click on dashboards and select "Intelligent Gateway Overview" to view arch gateway stats
Here is sample interaction,