plano/model_server/app
CTran fb67788be0
add prefill and test (#236)
* add prefill and test

* fix stream

* fix

* feedback

* address comments

* update

* add e2e test

* fix e2e test

* update fix

* fix

* address cmt

* address cmt
2024-11-07 11:59:29 -08:00
..
commons add prefill and test (#236) 2024-11-07 11:59:29 -08:00
function_calling add prefill and test (#236) 2024-11-07 11:59:29 -08:00
prompt_guard Refactor model server hardware config + add unit tests to load/request to the server (#189) 2024-10-16 16:58:10 -07:00
tests add prefill and test (#236) 2024-11-07 11:59:29 -08:00
__init__.py Improve cli (#179) 2024-10-10 17:44:41 -07:00
cli.py Use large github action machine to run e2e tests (#230) 2024-10-30 17:54:51 -07:00
loader.py Refactor model server hardware config + add unit tests to load/request to the server (#189) 2024-10-16 16:58:10 -07:00
main.py add prefill and test (#236) 2024-11-07 11:59:29 -08:00