Salman Paracha
13dff3089d
Adil/fix salman docs ( #75 )
...
* added the first set of docs for our technical docs
* more docuemtnation changes
* added support for prompt processing and updated life of a request
* updated docs to including getting help sections and updated life of a request
* committing local changes for getting started guide, sample applications, and full reference spec for prompt-config
* updated configuration reference, added sample app skeleton, updated favico
* fixed the configuration refernce file, and made minor changes to the intent detection. commit v1 for now
* Updated docs with use cases and example code, updated what is arch, and made minor changes throughout
* fixed imaged and minor doc fixes
* add sphinx_book_theme
* updated README, and make some minor fixes to documetnation
* fixed README.md
* fixed image width
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-09-24 13:54:17 -07:00
Adil Hafeez
2d31aeaa36
fix debug log
2024-09-24 13:45:30 -07:00
Co Tran
d5d79256b0
remove guard config json ( #70 )
...
* remove guard config json
* formating
2024-09-24 13:33:31 -07:00
Adil Hafeez
dd8c43a392
improve cluster not configured error ( #73 )
...
* improve cluster not configured error
* dont panic
* update format
* Merge branch 'main' into adil/fix_salman_docs
2024-09-24 13:24:26 -07:00
Adil Hafeez
16a8927889
add details on how to use grafana dashboards ( #72 )
2024-09-24 11:51:31 -07:00
Adil Hafeez
eff4cd9826
improve response handling ( #71 )
2024-09-23 22:56:35 -07:00
Co Tran
79b1c5415f
[Kan-103] add support toxic/jailbreak model ( #49 )
...
* add toxic/jailbreak model
* fix path loading model
* fix syntax
* fix bug,lint, format
* fix bug
* formatting
* add parallel + chunking
* fix bug
* working version
* fix onnnx name erorr
* device
* fix jailbreak config
* fix syntax error
* format
* add requirement + cli download for dockerfile
* add task
* add skeleton change for envoy filter for prompt guard
* fix hardware config
* fix bug
* add config changes
* add gitignore
* merge main
* integrate arch-guard with filter
* add hardware config
* nothing
* add hardware config feature
* fix requirement
* fix chat ui
* fix onnx
* fix lint
* remove non intel cpu
* remove onnx
* working version
* modify docker
* fix guard time
* add nvidia support
* remove nvidia
* add gpu
* add gpu
* add gpu support
* add gpu support for compose
* add gpu support for compose
* add gpu support for compose
* add gpu support for compose
* add gpu support for compose
* fix docker file
* fix int test
* correct gpu docker
* upgrad python 10
* fix logits to be gpu compatible
* default to cpu dockerfile
* resolve comments
* fix lint + unused parameters
* fix
* remove eetq install for cpu
* remove deploy gpu
---------
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-09-23 12:07:31 -07:00
Salman Paracha
80c554ce1a
Docs branch - v1 of our tech docs ( #69 )
...
* added the first set of docs for our technical docs
* more docuemtnation changes
* added support for prompt processing and updated life of a request
* updated docs to including getting help sections and updated life of a request
* committing local changes for getting started guide, sample applications, and full reference spec for prompt-config
* updated configuration reference, added sample app skeleton, updated favico
* fixed the configuration refernce file, and made minor changes to the intent detection. commit v1 for now
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-09-20 17:08:42 -07:00
Adil Hafeez
233976a568
comment required param check
2024-09-20 15:49:49 -07:00
Adil Hafeez
31f26ef7ac
move demo functions out of model_server ( #67 )
...
* pending
* remove
* fix docker build
2024-09-20 14:38:10 -07:00
Adil Hafeez
ca5c9e4824
improve logging
2024-09-20 10:16:58 -07:00
Sampreeth Sarma
941869ad24
fix similarity bug ( #63 )
2024-09-20 09:49:28 -07:00
Adil Hafeez
97b47c2ab4
Include param default in parameters ( #68 )
...
* Include param default in parameters
* improve default param pr
* fix integration tests
2024-09-20 09:02:24 -07:00
Adil Hafeez
aa73a5aaae
Update Dockerfile
2024-09-19 20:06:38 -07:00
Adil Hafeez
4438dc7979
remove embeddings config from config ( #64 )
...
* remove embeddings config from config
* remove embedding provider
2024-09-19 17:49:42 -07:00
José Ulises Niño Rivera
7568507764
Delete unused envoy.yaml file ( #65 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-09-19 17:49:26 -07:00
Adil Hafeez
3215d81e58
fix webui url and dependencies ( #66 )
2024-09-19 17:48:50 -07:00
Adil Hafeez
2cd5ec5adf
use openai standard response in arch-fc and in gradio client ( #62 )
...
* use openai standard response in arch-fc and in gradio client
also fix code bug in usage
* fix int test
2024-09-19 12:19:14 -07:00
Sampreeth Sarma
ed6a9139e6
demos for network copilot and sql analyzer ( #57 )
...
* pulled from main branch after adding enums and made changes
* added sql_analyzer folder and built a demo for Employee stats function calling. "top_employees" and "aggregate_stats".
* sql_anayzer
* After addressing PR comments
* PR comments
* PR comments
* Addeed Network Analyzer FC Code
* Added network Analyzer code for diff timeframes
* Network Copilot and Employee Details demos are updated with their descriptions and resolved the PR comments
* Added 2nd function in network copilot
* Added 2nd function in network copilot
* Added 2nd function in network copilot
* Added 2nd function in network copilot
* Added 2nd function in network copilot
2024-09-19 11:40:31 -07:00
Adil Hafeez
a91fbdbf1c
Add ability to define clusters in config ( #61 )
2024-09-18 20:03:26 -07:00
Adil Hafeez
215d276acf
send all tools when sending request to arch-fc ( #59 )
2024-09-18 15:54:40 -07:00
Adil Hafeez
3135ba8eae
add ability to override default values from config ( #58 )
2024-09-17 22:37:58 -07:00
José Ulises Niño Rivera
9f3c845610
Add ability to stream a response ( #50 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-09-17 16:12:41 -07:00
Adil Hafeez
118bff7c7e
add open-webui
2024-09-17 16:08:13 -07:00
Adil Hafeez
8a265de7c6
Propagate enums from prompt-target to bolt-fc ( #56 )
2024-09-17 12:03:21 -07:00
Adil Hafeez
63143fbd01
add bolt guard config ( #55 )
2024-09-17 10:59:50 -07:00
Adil Hafeez
dd35e77a73
add streaming chatbot-ui
2024-09-17 09:01:45 -07:00
Adil Hafeez
b8841fb8f4
rename bolt-config => bolt_config
2024-09-17 08:56:47 -07:00
Adil Hafeez
060a0d665e
improve service names ( #54 )
...
- embedding-server => model_server
- public-types => public_types
- chatbot-ui => chatbot_ui
- function-calling => function_calling
2024-09-17 08:47:35 -07:00
Adil Hafeez
215f96e273
rename envoy-1 => bolt-1 ( #53 )
2024-09-16 23:20:26 -07:00
Adil Hafeez
a0d132f5b6
remove unused demos
2024-09-16 19:21:59 -07:00
Adil Hafeez
ed329bd419
improve docker build speed
2024-09-16 19:20:27 -07:00
Adil Hafeez
9e50957f22
Improve prompt target intent matching ( #51 )
2024-09-16 19:20:07 -07:00
José Ulises Niño Rivera
8565462ec4
Resetting instead of resuming ( #39 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-09-14 10:58:25 -07:00
José Ulises Niño Rivera
33a1a68218
Add small clarification to function calling demo ( #48 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-09-11 11:37:53 -07:00
José Ulises Niño Rivera
3c5ac65e2a
More idiomatic conversion ( #47 )
...
* More idiomatic conversion
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
* errors
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
---------
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-09-11 10:00:25 -07:00
Adil Hafeez
de8c706a69
update readme
2024-09-10 14:42:58 -07:00
Adil Hafeez
d95bac7650
Update README.md
2024-09-10 14:27:14 -07:00
Adil Hafeez
b53e1944dd
Update README.md
2024-09-10 14:25:52 -07:00
Adil Hafeez
7b5203a2ce
Add function calling support using bolt-fc-1b ( #35 )
2024-09-10 14:24:46 -07:00
José Ulises Niño Rivera
fdfad87347
Fix wasm load error ( #45 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-09-05 16:29:41 -07:00
José Ulises Niño Rivera
dd48689aee
Add Ratelimit on request tokens ( #44 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-09-04 17:28:12 -07:00
José Ulises Niño Rivera
d98517f240
Move shared types into their own crate ( #41 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-09-04 15:31:05 -07:00
José Ulises Niño Rivera
4dd1f3693e
Add the Ratelimit module ( #34 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-08-28 11:11:05 -07:00
José Ulises Niño Rivera
2a4cde950d
Add Ratelimit config ( #32 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-08-07 14:15:26 -07:00
Adil Hafeez
445b1ea210
Add support for local llm (mistral 7b) ( #31 )
2024-08-06 23:40:06 -07:00
José Ulises Niño Rivera
b49fc2f264
Simplify Entity struct ( #33 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-08-06 17:04:32 -07:00
Adil Hafeez
1fa5215753
Update README.md
2024-08-01 13:26:42 -07:00
Adil Hafeez
b01c9949bc
Update README.md
2024-08-01 13:25:42 -07:00
Adil Hafeez
0d02147912
Update README.md
2024-08-01 13:23:12 -07:00