mirror of
https://github.com/katanemo/plano.git
synced 2026-04-28 02:23:56 +02:00
* adding function_calling functionality via rust * fixed rendered YAML file * removed model_server from envoy.template and forwarding traffic to bright_staff * fixed bugs in function_calling.rs that were breaking tests. All good now * updating e2e test to clean up disk usage * removing Arch* models to be used as a default model if one is not specified * if the user sets arch-function base_url we should honor it * fixing demos as we needed to pin to a particular version of huggingface_hub else the chatbot ui wouldn't build * adding a constant for Arch-Function model name * fixing some edge cases with calls made to Arch-Function * fixed JSON parsing issues in function_calling.rs * fixed bug where the raw response from Arch-Function was re-encoded * removed debug from supervisord.conf * commenting out disk cleanup * adding back disk space --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local> Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local> |
||
|---|---|---|
| .. | ||
| .vscode | ||
| common.py | ||
| common_scripts.sh | ||
| docker-compose.yaml | ||
| poetry.lock | ||
| pyproject.toml | ||
| README.md | ||
| response.hex | ||
| response_with_tools.hex | ||
| run_e2e_tests.sh | ||
| test_model_alias_routing.py | ||
| test_prompt_gateway.py | ||
e2e tests
e2e tests for arch llm gateway and prompt gateway
To be able to run e2e tests successfully run_e2e_script prepares environment in following way,
- build and start weather_forecast demo (using docker compose)
- build, install and start model server async (using poetry)
- build and start arch gateway (using docker compose)
- wait for model server to be ready
- wait for arch gateway to be ready
- start e2e tests (using poetry)
- runs llm gateway tests for llm routing
- runs prompt gateway tests to test function calling, parameter gathering and summarization
- cleanup
- stops arch gateway
- stops model server
- stops weather_forecast demo
How to run
To run locally make sure that following requirements are met.
Requirements
- Python 3.10
- Poetry
- Docker
Running tests locally
sh run_e2e_test.sh