plano/model_server/app
2024-12-06 14:37:33 -08:00
..
commons hallucination with log probs (#281) 2024-11-27 15:17:02 -08:00
function_calling hallucination with log probs (#281) 2024-11-27 15:17:02 -08:00
prompt_guard Refactor model server hardware config + add unit tests to load/request to the server (#189) 2024-10-16 16:58:10 -07:00
tests hallucination with log probs (#281) 2024-11-27 15:17:02 -08:00
__init__.py Improve cli (#179) 2024-10-10 17:44:41 -07:00
cli.py Use large github action machine to run e2e tests (#230) 2024-10-30 17:54:51 -07:00
loader.py Refactor model server hardware config + add unit tests to load/request to the server (#189) 2024-10-16 16:58:10 -07:00
main.py update getting started guide and add llm gateway and prompt gateway samples (#330) 2024-12-06 14:37:33 -08:00