Adil Hafeez
e81ca8d5cf
llm listener split ( #155 )
2024-10-09 15:47:32 -07:00
Adil Hafeez
285aa1419b
Split listener ( #141 )
2024-10-08 16:24:08 -07:00
Shuguang Chen
b30ad791f7
Fix errors and improve Doc ( #143 )
...
* Fix link issues and add icons
* Improve Doc
* fix test
* making minor modifications to shuguangs' doc changes
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-08 13:18:34 -07:00
Co Tran
b1fa127704
Hallucination integration with rust ( #122 )
2024-10-07 18:38:55 -07:00
Shuguang Chen
5c7567584d
Doc Update ( #129 )
...
* init update
* Update terminology.rst
* fix the branch to create an index.html, and fix pre-commit issues
* Doc update
* made several changes to the docs after Shuguang's revision
* fixing pre-commit issues
* fixed the reference file to the final prompt config file
* added google analytics
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-06 16:54:34 -07:00
Adil Hafeez
2a747df7c0
don't compute embeddings for names and other fixes see description ( #126 )
...
* serialize tools - 2
* fix int tests
* fix int test
* fix unit tests
2024-10-05 19:25:16 -07:00
José Ulises Niño Rivera
2a9b9486f3
Clean up Embeddings Store ( #121 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-10-04 19:33:52 -07:00
José Ulises Niño Rivera
8ea917aae5
Add the ability to use LLM Providers from the Arch config ( #112 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-10-03 10:57:01 -07:00
Adil Hafeez
15869825e3
add messages in params when making api calls ( #104 )
2024-10-01 09:14:01 -07:00
Adil Hafeez
2207021b9c
remove method type ( #101 )
2024-09-30 17:59:29 -07:00
Adil Hafeez
cc35eb0cd7
update config ( #93 )
2024-09-30 17:49:05 -07:00
Adil Hafeez
75cf5e5304
Add supported parameter type, validation and tests ( #88 )
...
* Add supported parameter type and validation
* make the tools format more compliant with openai
* more updates
* fix more
* fix unit test
2024-09-27 13:33:05 -07:00
Adil Hafeez
3511798fa8
Integrate Arch-Function-Calling-1.5B model ( #85 )
...
* add arch support
* add missing file
* e2e tests
* delete old files and fix response
* fmt
2024-09-25 23:30:50 -07:00
José Ulises Niño Rivera
370f3bb2c5
Fix bug in PromptGuard configuration ( #80 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-09-25 23:36:55 -05:00
Sampreeth Sarma
7f0fcb372b
Added Float type to the function parameter values ( #77 )
2024-09-25 13:29:20 -07:00
Adil Hafeez
87900beddd
Remove OMF ( #78 )
...
* Remove OMF
* remove submodule from github workflow
2024-09-24 15:18:20 -07:00
Co Tran
79b1c5415f
[Kan-103] add support toxic/jailbreak model ( #49 )
...
* add toxic/jailbreak model
* fix path loading model
* fix syntax
* fix bug,lint, format
* fix bug
* formatting
* add parallel + chunking
* fix bug
* working version
* fix onnnx name erorr
* device
* fix jailbreak config
* fix syntax error
* format
* add requirement + cli download for dockerfile
* add task
* add skeleton change for envoy filter for prompt guard
* fix hardware config
* fix bug
* add config changes
* add gitignore
* merge main
* integrate arch-guard with filter
* add hardware config
* nothing
* add hardware config feature
* fix requirement
* fix chat ui
* fix onnx
* fix lint
* remove non intel cpu
* remove onnx
* working version
* modify docker
* fix guard time
* add nvidia support
* remove nvidia
* add gpu
* add gpu
* add gpu support
* add gpu support for compose
* add gpu support for compose
* add gpu support for compose
* add gpu support for compose
* add gpu support for compose
* fix docker file
* fix int test
* correct gpu docker
* upgrad python 10
* fix logits to be gpu compatible
* default to cpu dockerfile
* resolve comments
* fix lint + unused parameters
* fix
* remove eetq install for cpu
* remove deploy gpu
---------
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-09-23 12:07:31 -07:00
Adil Hafeez
97b47c2ab4
Include param default in parameters ( #68 )
...
* Include param default in parameters
* improve default param pr
* fix integration tests
2024-09-20 09:02:24 -07:00
Adil Hafeez
4438dc7979
remove embeddings config from config ( #64 )
...
* remove embeddings config from config
* remove embedding provider
2024-09-19 17:49:42 -07:00
Adil Hafeez
2cd5ec5adf
use openai standard response in arch-fc and in gradio client ( #62 )
...
* use openai standard response in arch-fc and in gradio client
also fix code bug in usage
* fix int test
2024-09-19 12:19:14 -07:00
Adil Hafeez
3135ba8eae
add ability to override default values from config ( #58 )
2024-09-17 22:37:58 -07:00
José Ulises Niño Rivera
9f3c845610
Add ability to stream a response ( #50 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-09-17 16:12:41 -07:00
Adil Hafeez
8a265de7c6
Propagate enums from prompt-target to bolt-fc ( #56 )
2024-09-17 12:03:21 -07:00
Adil Hafeez
63143fbd01
add bolt guard config ( #55 )
2024-09-17 10:59:50 -07:00
Adil Hafeez
060a0d665e
improve service names ( #54 )
...
- embedding-server => model_server
- public-types => public_types
- chatbot-ui => chatbot_ui
- function-calling => function_calling
2024-09-17 08:47:35 -07:00