Shuguang Chen
ba7279becb
Use intent model from archfc to pick prompt gateway ( #328 )
2024-12-20 13:25:01 -08:00
Co Tran
80d2229053
Cotran/onnx conversion ( #145 )
...
* onnx replacement
* onnx conversion for nli and embedding model
* fix naming
* fix naming
* fix naming
* pin version
2024-10-08 14:37:48 -07:00
Salman Paracha
b60ceb9168
model server build ( #127 )
...
* first commit to have model_server not be dependent on Docker
* making changes to fix the docker-compose file for archgw to set DNS_V4 and minor fixes with the build
* additional fixes for model server to be separated out in the build
* additional fixes for model server to be separated out in the build
* fix to get model_server to be built as a separate python process. TODO: fix the embeddings logs after cli completes
* fixing init to pull tempfile using the tempfile python package
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-06 18:21:43 -07:00
Co Tran
79b1c5415f
[Kan-103] add support toxic/jailbreak model ( #49 )
...
* add toxic/jailbreak model
* fix path loading model
* fix syntax
* fix bug,lint, format
* fix bug
* formatting
* add parallel + chunking
* fix bug
* working version
* fix onnnx name erorr
* device
* fix jailbreak config
* fix syntax error
* format
* add requirement + cli download for dockerfile
* add task
* add skeleton change for envoy filter for prompt guard
* fix hardware config
* fix bug
* add config changes
* add gitignore
* merge main
* integrate arch-guard with filter
* add hardware config
* nothing
* add hardware config feature
* fix requirement
* fix chat ui
* fix onnx
* fix lint
* remove non intel cpu
* remove onnx
* working version
* modify docker
* fix guard time
* add nvidia support
* remove nvidia
* add gpu
* add gpu
* add gpu support
* add gpu support for compose
* add gpu support for compose
* add gpu support for compose
* add gpu support for compose
* add gpu support for compose
* fix docker file
* fix int test
* correct gpu docker
* upgrad python 10
* fix logits to be gpu compatible
* default to cpu dockerfile
* resolve comments
* fix lint + unused parameters
* fix
* remove eetq install for cpu
* remove deploy gpu
---------
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-09-23 12:07:31 -07:00