plano/model_server/app
Co Tran 8b5db45507
Fix gpu dependency and only leverage onnx when GPU is available (#157)
* replacing appending instead of write

* fix eetq dependency

* gpu guard required eetq

* fix bug when gpu is available

* fix for gpu device

* reverse

* fix

* replace gpu -> cuda
2024-10-09 11:42:05 -07:00
..
arch_fc ensure that we can call the new api.fc.archgw.com url, logging fixes … (#142) 2024-10-08 12:40:24 -07:00
__init__.py lint + formating with black (#158) 2024-10-09 11:25:07 -07:00
guard_model_config.yaml Fix gpu dependency and only leverage onnx when GPU is available (#157) 2024-10-09 11:42:05 -07:00
load_models.py Fix gpu dependency and only leverage onnx when GPU is available (#157) 2024-10-09 11:42:05 -07:00
main.py lint + formating with black (#158) 2024-10-09 11:25:07 -07:00
network_data_generator.py [Kan-103] add support toxic/jailbreak model (#49) 2024-09-23 12:07:31 -07:00
openai_params.yaml model server build (#127) 2024-10-06 18:21:43 -07:00
test.ipynb Rename bolt_config to arch_config (#100) 2024-09-30 18:47:35 -07:00
utils.py lint + formating with black (#158) 2024-10-09 11:25:07 -07:00