mirror of
https://github.com/katanemo/plano.git
synced 2026-05-18 13:45:15 +02:00
rename arch provider to plano, use llm_routing_model and agent_orchestration_model
This commit is contained in:
parent
680dee60a0
commit
6f8bf96d38
16 changed files with 37 additions and 50 deletions
|
|
@ -254,7 +254,7 @@ Using Ollama (recommended for local development)
|
|||
.. code-block:: yaml
|
||||
|
||||
overrides:
|
||||
router_model: plano/hf.co/katanemo/Arch-Router-1.5B.gguf:Q4_K_M
|
||||
llm_routing_model: plano/hf.co/katanemo/Arch-Router-1.5B.gguf:Q4_K_M
|
||||
|
||||
model_providers:
|
||||
- model: plano/hf.co/katanemo/Arch-Router-1.5B.gguf:Q4_K_M
|
||||
|
|
@ -323,7 +323,7 @@ vLLM provides higher throughput and GPU optimizations suitable for production de
|
|||
.. code-block:: yaml
|
||||
|
||||
overrides:
|
||||
router_model: plano/Arch-Router
|
||||
llm_routing_model: plano/Arch-Router
|
||||
|
||||
model_providers:
|
||||
- model: plano/Arch-Router
|
||||
|
|
|
|||
|
|
@ -404,10 +404,11 @@ Using vLLM
|
|||
.. code-block:: yaml
|
||||
|
||||
overrides:
|
||||
orchestrator_model: plano/katanemo/Plano-Orchestrator-4B
|
||||
agent_orchestration_model: plano/katanemo/Plano-Orchestrator-4B
|
||||
|
||||
model_providers:
|
||||
- model: plano/katanemo/Plano-Orchestrator-4B
|
||||
- model: katanemo/Plano-Orchestrator-4B
|
||||
provider_interface: plano
|
||||
base_url: http://<your-server-ip>:8000
|
||||
|
||||
5. **Verify the server is running**
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue