openai compatible endpoints not used #128
Labels
No labels
bug
dependencies
duplicate
enhancement
help wanted
invalid
question
renovate: stop-updating
wontfix
bug
duplicate
enhancement
help wanted
invalid
question
renovate: stop-updating
security
wontfix
No milestone
No project
No assignees
2 participants
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference: nomyo-ai/nomyo-router#128
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Hi alpha nerd, I am not entirely sure if I get this right, but I think that openai compatible endpoints won't be used in combination with ollama endpoints that promote the same model.
I am currently adding a vllm server to the nomyo config. If the endpoint is used solely, everything works just fine.
However, if nomyo is trying to check whether a model is already loaded in
_fetch_loaded_models_internalopenai compatible endpoints will be filtered out. I guess an exception for those should fix this, so for openai endpoints models are always loaded.Thank you for considering this ;)
In addition to this nomyo also sometimes tries to use models that are not promoted by the endpoint:
[generate_proxy] upstream error from (http://192.168.0.1:8000/v1, qwen2.5-coder:1.5b-base) status=404 type=NotFoundError: Error code: 404 - {'error': {'message': 'The modelqwen2.5-coder:1.5b-basedoes not exist.', 'type': 'NotFoundError', 'param': 'model', 'code': 404}}@JTHesse wrote in #128 (comment):
This is a clear bug in the routing engine and might have been introduced during refactoring.
Thank you for pointing this out. A fix is on the way.
docker pull bitfreedom.net/nomyo-ai/nomyo-router:latestcontains this fix