nomyo-router/README.md

29 lines
934 B
Markdown
Raw Normal View History

2025-08-26 18:40:56 +02:00
# NOMYO Router
2025-08-26 18:40:23 +02:00
is a transparent proxy for ![Ollama](https://github.com/ollama/ollama) with model deployment aware routing.
It runs between your frontend application and Ollama backend and is transparent for both, the front- and backend.
2025-08-26 19:41:45 +02:00
![arch](https://github.com/user-attachments/assets/1e0064ab-de54-4226-8a15-c0fcca64704c)
2025-08-26 18:40:23 +02:00
2025-08-26 18:42:32 +02:00
# Installation
2025-08-26 18:40:23 +02:00
Copy/Clone the repository, edit the config.yaml by adding your Ollama backend servers and the max_concurrent_connections setting per endpoint. This equals to your OLLAMA_NUM_PARALLEL config settings.
Run the NOMYO Router in a dedicated virtual environment, install the requirements and run with uvicorn:
```
python3 -m venv .venv/router
source .venv/router/bin/activate
pip3 install requirements.txt -r
```
finally you can
```
uvicorn router:app --host 127.0.0.1 --port 12434
```
# Routing
2025-08-26 18:42:32 +02:00
![routing](https://github.com/user-attachments/assets/ed05dfbb-fcc8-4ff2-b8ca-3cdce2660c9f)