mirror of
https://github.com/katanemo/plano.git
synced 2026-05-21 13:55:15 +02:00
update orchestrator model name
This commit is contained in:
parent
48bf83fa0d
commit
680dee60a0
3 changed files with 9 additions and 7 deletions
|
|
@ -141,7 +141,7 @@ vllm serve katanemo/Plano-Orchestrator-4B \
|
|||
--gpu-memory-utilization 0.3 \
|
||||
--tokenizer katanemo/Plano-Orchestrator-4B \
|
||||
--chat-template chat_template.jinja \
|
||||
--served-model-name Plano-Orchestrator \
|
||||
--served-model-name katanemo/Plano-Orchestrator-4B \
|
||||
--enable-prefix-caching
|
||||
```
|
||||
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue