Commit graph

246 commits

Author SHA1 Message Date
Adil Hafeez
15869825e3
add messages in params when making api calls (#104) 2024-10-01 09:14:01 -07:00
Adil Hafeez
f4395d39f9
Fold function_resolver into model_server (#103) 2024-10-01 09:13:50 -07:00
José Ulises Niño Rivera
b0ce5eca93
Rename bolt_config to arch_config (#100)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-09-30 18:47:35 -07:00
Adil Hafeez
cc35eb0cd7
update config (#93) 2024-09-30 17:49:05 -07:00
Aayush
1e61452310
changes prometheus config target to arch so that data collection works (#98) 2024-09-30 14:35:21 -07:00
Adil Hafeez
bb746e237a
add support for 3b model (#96) 2024-09-30 09:54:58 -07:00
Adil Hafeez
4d7c07a63c update ctx size to 4k 2024-09-29 17:13:05 -07:00
Adil Hafeez
ea86f73605
rename envoyfilter => arch (#91)
* rename envoyfilter => arch

* fix more files

* more fixes

* more renames
2024-09-27 16:41:39 -07:00
Adil Hafeez
8a4e11077c update arch-fc parameters 2024-09-27 13:34:10 -07:00
Adil Hafeez
75cf5e5304
Add supported parameter type, validation and tests (#88)
* Add supported parameter type and validation

* make the tools format more compliant with openai

* more updates

* fix more

* fix unit test
2024-09-27 13:33:05 -07:00
Adil Hafeez
774c389951
add bolt support (#90)
* add support for bolt

* improve logging

* add support for bolt-fc

* fix int tests
2024-09-26 17:47:01 -07:00
Adil Hafeez
e3a835e5d3
expose access logs from envoy (#89) 2024-09-26 16:03:48 -07:00
Adil Hafeez
3511798fa8
Integrate Arch-Function-Calling-1.5B model (#85)
* add arch support

* add missing file

* e2e tests

* delete old files and fix response

* fmt
2024-09-25 23:30:50 -07:00
José Ulises Niño Rivera
9ea6bb0d73
Revert "Revert "Add support for multiple LLM Providers (#60)"" (#83)
* Revert "Revert "Add support for multiple LLM Providers (#60)""

This reverts commit 43d6bc80e9.

* wip

Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>

* Revert "wip"

This reverts commit 7c4dde5d1f.

* fix parameter name

Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>

* force use openai

---------

Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-09-25 23:15:17 -07:00
Sampreeth Sarma
7f0fcb372b
Added Float type to the function parameter values (#77) 2024-09-25 13:29:20 -07:00
Adil Hafeez
43d6bc80e9 Revert "Add support for multiple LLM Providers (#60)"
This reverts commit bd8206742a.
2024-09-25 08:15:22 -07:00
José Ulises Niño Rivera
bd8206742a
Add support for multiple LLM Providers (#60)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-09-24 17:13:55 -07:00
Adil Hafeez
685144bbd7
fix demos code (#76) 2024-09-24 14:34:22 -07:00
Adil Hafeez
16a8927889
add details on how to use grafana dashboards (#72) 2024-09-24 11:51:31 -07:00
Adil Hafeez
eff4cd9826
improve response handling (#71) 2024-09-23 22:56:35 -07:00
Co Tran
79b1c5415f
[Kan-103] add support toxic/jailbreak model (#49)
* add toxic/jailbreak model

* fix path loading model

* fix syntax

* fix bug,lint, format

* fix bug

* formatting

* add parallel + chunking

* fix bug

* working version

* fix onnnx name erorr

* device

* fix jailbreak config

* fix syntax error

* format

* add requirement + cli download for dockerfile

* add task

* add skeleton change for envoy filter for prompt guard

* fix hardware config

* fix bug

* add config changes

* add gitignore

* merge main

* integrate arch-guard with filter

* add hardware config

* nothing

* add hardware config feature

* fix requirement

* fix chat ui

* fix onnx

* fix lint

* remove non intel cpu

* remove onnx

* working version

* modify docker

* fix guard time

* add nvidia support

* remove nvidia

* add gpu

* add gpu

* add gpu support

* add gpu support for compose

* add gpu support for compose

* add gpu support for compose

* add gpu support for compose

* add gpu support for compose

* fix docker file

* fix int test

* correct gpu docker

* upgrad python 10

* fix logits to be gpu compatible

* default to cpu dockerfile

* resolve comments

* fix lint + unused parameters

* fix

* remove eetq install for cpu

* remove deploy gpu

---------

Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-09-23 12:07:31 -07:00
Adil Hafeez
31f26ef7ac
move demo functions out of model_server (#67)
* pending

* remove

* fix docker build
2024-09-20 14:38:10 -07:00
Adil Hafeez
97b47c2ab4
Include param default in parameters (#68)
* Include param default in parameters

* improve default param pr

* fix integration tests
2024-09-20 09:02:24 -07:00
Adil Hafeez
4438dc7979
remove embeddings config from config (#64)
* remove embeddings config from config

* remove embedding provider
2024-09-19 17:49:42 -07:00
Adil Hafeez
3215d81e58
fix webui url and dependencies (#66) 2024-09-19 17:48:50 -07:00
Sampreeth Sarma
ed6a9139e6
demos for network copilot and sql analyzer (#57)
* pulled from main branch after adding enums and made changes

* added sql_analyzer folder and built a demo for Employee stats function calling. "top_employees" and "aggregate_stats".

* sql_anayzer

* After addressing PR comments

* PR comments

* PR comments

* Addeed Network Analyzer FC Code

* Added network Analyzer code for diff timeframes

* Network Copilot and Employee Details demos are updated with their descriptions and resolved the PR comments

* Added 2nd function in network copilot

* Added 2nd function in network copilot

* Added 2nd function in network copilot

* Added 2nd function in network copilot

* Added 2nd function in network copilot
2024-09-19 11:40:31 -07:00
Adil Hafeez
a91fbdbf1c
Add ability to define clusters in config (#61) 2024-09-18 20:03:26 -07:00
Adil Hafeez
215d276acf
send all tools when sending request to arch-fc (#59) 2024-09-18 15:54:40 -07:00
Adil Hafeez
3135ba8eae
add ability to override default values from config (#58) 2024-09-17 22:37:58 -07:00
Adil Hafeez
118bff7c7e add open-webui 2024-09-17 16:08:13 -07:00
Adil Hafeez
63143fbd01
add bolt guard config (#55) 2024-09-17 10:59:50 -07:00
Adil Hafeez
b8841fb8f4 rename bolt-config => bolt_config 2024-09-17 08:56:47 -07:00
Adil Hafeez
060a0d665e
improve service names (#54)
- embedding-server => model_server
- public-types => public_types
- chatbot-ui => chatbot_ui
- function-calling => function_calling
2024-09-17 08:47:35 -07:00
Adil Hafeez
215f96e273
rename envoy-1 => bolt-1 (#53) 2024-09-16 23:20:26 -07:00
Adil Hafeez
a0d132f5b6 remove unused demos 2024-09-16 19:21:59 -07:00
Adil Hafeez
9e50957f22
Improve prompt target intent matching (#51) 2024-09-16 19:20:07 -07:00
José Ulises Niño Rivera
33a1a68218
Add small clarification to function calling demo (#48)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-09-11 11:37:53 -07:00
Adil Hafeez
de8c706a69 update readme 2024-09-10 14:42:58 -07:00
Adil Hafeez
b53e1944dd
Update README.md 2024-09-10 14:25:52 -07:00
Adil Hafeez
7b5203a2ce
Add function calling support using bolt-fc-1b (#35) 2024-09-10 14:24:46 -07:00
Adil Hafeez
445b1ea210
Add support for local llm (mistral 7b) (#31) 2024-08-06 23:40:06 -07:00
Adil Hafeez
6f6454069f
Improve demo experience (#28)
* Imrpove demo experience
- generate envoy.yaml using jinja2 template

* add todo

* add config file
2024-07-31 17:13:39 -07:00
Adil Hafeez
9774148c75
Remove top level container and start snake-case for config files (#27)
t
2024-07-31 14:05:52 -07:00
Adil Hafeez
09824c7236
Update README.md 2024-07-30 16:30:19 -07:00
Adil Hafeez
1f544d836c
Update README.md 2024-07-30 16:25:01 -07:00
Adil Hafeez
33f9dd22e6
Add workflow logic for weather forecast demo (#24) 2024-07-30 16:23:23 -07:00