A transparent (O)llama proxy with model deployment aware routing which auto-manages multiple (O)llama instances in a given network.
| config.yaml | ||
| LICENSE | ||
| README.md | ||
| requirements.txt | ||
| router.py | ||
Hello World!
| config.yaml | ||
| LICENSE | ||
| README.md | ||
| requirements.txt | ||
| router.py | ||
Hello World!