Commit graph

102 commits

Author SHA1 Message Date
cotran
3cbf88fc1e fix validating bug 2024-12-11 14:36:53 -08:00
cotran
29a5234bdd Merge branch 'shuguang/main' of https://github.com/katanemo/arch into shuguang/main 2024-12-11 14:22:41 -08:00
cotran
c5a79bdbf7 fixbug 2024-12-11 14:22:34 -08:00
Adil Hafeez
f4475496e8 retry model_server tests 2024-12-11 14:16:52 -08:00
cotran
991e336179 add type check and length checl 2024-12-11 13:35:20 -08:00
cotran
8cfef7bcd4 Merge branch 'shuguang/main' of https://github.com/katanemo/arch into shuguang/main 2024-12-11 13:33:45 -08:00
cotran
770ebbdd4e add type check and length checl 2024-12-11 13:33:38 -08:00
Adil Hafeez
0ef9d62565 add support for format 2024-12-11 12:06:59 -08:00
Adil Hafeez
5bf9a80283 remove nli model 2024-12-10 21:03:02 -08:00
cotran
3e4b04099c fix test 2024-12-10 19:31:35 -08:00
cotran
b2ef3f7266 fix test 2024-12-10 19:31:02 -08:00
Adil Hafeez
e5b4fd00ed fix model_server test bootup 2024-12-10 19:02:20 -08:00
cotran
2405fb36e3 Merge branch 'shuguang/main' of https://github.com/katanemo/arch into shuguang/main 2024-12-10 18:04:56 -08:00
cotran
188b0a8a39 modify changes 2024-12-10 18:04:50 -08:00
Adil Hafeez
60dfb18018 send latency numbers from model_server as metadata 2024-12-10 16:01:17 -08:00
Shuguang Chen
791ce0a7ed Merge branch 'shuguang/main' of https://github.com/katanemo/archgw into shuguang/main 2024-12-09 15:41:06 -08:00
Shuguang Chen
3859e8eb43 Fix bugs 2024-12-09 15:40:57 -08:00
cotran
0ec6762cda fix 2024-12-09 15:40:50 -08:00
cotran
21fc0b5624 add log 2024-12-09 15:37:17 -08:00
cotran
9998c8b0f3 add log 2024-12-09 15:33:59 -08:00
cotran
5439e07e0e new thd 2024-12-09 15:22:02 -08:00
cotran
18d76d0427 latest threshold 2024-12-09 15:02:53 -08:00
cotran
95740ca051 Merge branch 'shuguang/main' of https://github.com/katanemo/arch into shuguang/main 2024-12-09 14:44:17 -08:00
cotran
8d3cf21f79 change thd 2024-12-09 14:44:14 -08:00
Shuguang Chen
8871d3f751 Collect debugging log 2024-12-09 14:15:10 -08:00
cotran
f13947732c add more test 2024-12-09 13:48:30 -08:00
cotran
63cc2ef3f3 fix bug 2024-12-09 13:33:10 -08:00
CTran
9dd7f15eab
Cotran/integration (#341)
* add hallucination

* add test and fix bug
2024-12-09 13:30:52 -08:00
Adil Hafeez
8f1b21124b fix model_server cli and update version to 0.1.7 2024-12-09 12:11:17 -08:00
Adil Hafeez
1299afd05c add overrides module to model server 2024-12-09 11:52:57 -08:00
Adil Hafeez
0e22c3e7de Merge branch 'main' into shuguang/main 2024-12-09 11:43:42 -08:00
Shuguang Chen
1a3d33409b Fix bugs 2024-12-09 11:19:09 -08:00
Shuguang Chen
e0d4ee7357 Update ArchFunctionHandler 2024-12-08 16:41:45 -08:00
Shuguang Chen
95e167c2f6 Refacotr model configs 2024-12-08 16:00:25 -08:00
Shuguang Chen
320f4612b8 Update cli and related utils 2024-12-08 11:16:34 -08:00
Shuguang Chen
b4f4695f16 Reorganize model_server 2024-12-08 09:21:53 -08:00
CTran
a40cdc7b75
Cotran/intent (#339)
* add else

* integrate hallucination

* remove test
2024-12-08 08:56:35 -08:00
Adil Hafeez
af0e7d178b
update cli to 0.1.6 (#338) 2024-12-06 15:48:07 -08:00
Adil Hafeez
a54db1a098
update getting started guide and add llm gateway and prompt gateway samples (#330) 2024-12-06 14:37:33 -08:00
Shuguang Chen
afec644789 Update the logic of intent detection 2024-12-06 14:14:44 -08:00
Shuguang Chen
79eafc0241 Update ArchBaseHandler 2024-12-06 14:07:01 -08:00
Shuguang Chen
2fd8a5a06d Update Arch-Guard and corresponding e2e test 2024-12-06 13:41:18 -08:00
Shuguang Chen
818ec0a9db Fix the issue of Arch-Intent 2024-12-06 11:53:46 -08:00
Shuguang Chen
4fcfd83639 Refine model_server 2024-12-05 15:19:41 -08:00
Shuguang Chen
a5bd005411 Fix a bug in function_calling.py 2024-12-05 12:06:56 -08:00
Shuguang Chen
09f7e1e604 Update guardrail_handler and its associated tests 2024-12-05 11:30:58 -08:00
Shuguang Chen
b686cf8b87 refactor model_handler 2024-12-05 11:00:22 -08:00
Shuguang Chen
afe1410b37 Init update on model_server 2024-12-04 16:41:30 -08:00
CTran
cadd3cdaf9
hallucination with log probs (#281)
* first init

* fix

* fix test

* new implemenetation

* fix bug

* fix bug

* fix bug

* address issue

* address issues

* address comments

* fix test

* fix

* move constatns

* remove consts
2024-11-27 15:17:02 -08:00
Adil Hafeez
704b928d61
release 0.1.5 (#307) 2024-11-26 13:28:52 -08:00