plano/model_server/app
2024-11-26 13:13:02 -08:00
..
commons remove dependency on docker-compose when starting up archgw (#305) 2024-11-26 13:13:02 -08:00
function_calling add prefill and test (#236) 2024-11-07 11:59:29 -08:00
prompt_guard Refactor model server hardware config + add unit tests to load/request to the server (#189) 2024-10-16 16:58:10 -07:00
tests release 0.1.2 (#266) 2024-11-12 23:56:33 -08:00
__init__.py Improve cli (#179) 2024-10-10 17:44:41 -07:00
cli.py Use large github action machine to run e2e tests (#230) 2024-10-30 17:54:51 -07:00
loader.py Refactor model server hardware config + add unit tests to load/request to the server (#189) 2024-10-16 16:58:10 -07:00
main.py add support for jaeger tracing (#229) 2024-11-07 22:11:00 -06:00