Adil Hafeez
1b57a49c9d
add support for default target ( #111 )
...
* add support for default target
* add more fixes
2024-10-02 20:43:16 -07:00
Salman Paracha
8654d3d5c5
simplify developer getting started experience ( #102 )
...
* Fixed build. Now, we have a bare bones version of the docker-compose file with only two services, archgw and archgw-model-server. Tested using CLI
* some pre-commit fixes
* fixed cargo formatting issues
* fixed model server conflict changes
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-01 10:02:23 -07:00
Adil Hafeez
f4395d39f9
Fold function_resolver into model_server ( #103 )
2024-10-01 09:13:50 -07:00
Co Tran
79b1c5415f
[Kan-103] add support toxic/jailbreak model ( #49 )
...
* add toxic/jailbreak model
* fix path loading model
* fix syntax
* fix bug,lint, format
* fix bug
* formatting
* add parallel + chunking
* fix bug
* working version
* fix onnnx name erorr
* device
* fix jailbreak config
* fix syntax error
* format
* add requirement + cli download for dockerfile
* add task
* add skeleton change for envoy filter for prompt guard
* fix hardware config
* fix bug
* add config changes
* add gitignore
* merge main
* integrate arch-guard with filter
* add hardware config
* nothing
* add hardware config feature
* fix requirement
* fix chat ui
* fix onnx
* fix lint
* remove non intel cpu
* remove onnx
* working version
* modify docker
* fix guard time
* add nvidia support
* remove nvidia
* add gpu
* add gpu
* add gpu support
* add gpu support for compose
* add gpu support for compose
* add gpu support for compose
* add gpu support for compose
* add gpu support for compose
* fix docker file
* fix int test
* correct gpu docker
* upgrad python 10
* fix logits to be gpu compatible
* default to cpu dockerfile
* resolve comments
* fix lint + unused parameters
* fix
* remove eetq install for cpu
* remove deploy gpu
---------
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-09-23 12:07:31 -07:00
Adil Hafeez
060a0d665e
improve service names ( #54 )
...
- embedding-server => model_server
- public-types => public_types
- chatbot-ui => chatbot_ui
- function-calling => function_calling
2024-09-17 08:47:35 -07:00