Commit graph

86 commits

Author SHA1 Message Date
cotran
188b0a8a39 modify changes 2024-12-10 18:04:50 -08:00
cotran
0ec6762cda fix 2024-12-09 15:40:50 -08:00
cotran
21fc0b5624 add log 2024-12-09 15:37:17 -08:00
cotran
9998c8b0f3 add log 2024-12-09 15:33:59 -08:00
cotran
5439e07e0e new thd 2024-12-09 15:22:02 -08:00
cotran
18d76d0427 latest threshold 2024-12-09 15:02:53 -08:00
cotran
95740ca051 Merge branch 'shuguang/main' of https://github.com/katanemo/arch into shuguang/main 2024-12-09 14:44:17 -08:00
cotran
8d3cf21f79 change thd 2024-12-09 14:44:14 -08:00
Shuguang Chen
8871d3f751 Collect debugging log 2024-12-09 14:15:10 -08:00
cotran
f13947732c add more test 2024-12-09 13:48:30 -08:00
cotran
63cc2ef3f3 fix bug 2024-12-09 13:33:10 -08:00
CTran
9dd7f15eab
Cotran/integration (#341)
* add hallucination

* add test and fix bug
2024-12-09 13:30:52 -08:00
Adil Hafeez
8f1b21124b fix model_server cli and update version to 0.1.7 2024-12-09 12:11:17 -08:00
Adil Hafeez
1299afd05c add overrides module to model server 2024-12-09 11:52:57 -08:00
Adil Hafeez
0e22c3e7de Merge branch 'main' into shuguang/main 2024-12-09 11:43:42 -08:00
Shuguang Chen
1a3d33409b Fix bugs 2024-12-09 11:19:09 -08:00
Shuguang Chen
e0d4ee7357 Update ArchFunctionHandler 2024-12-08 16:41:45 -08:00
Shuguang Chen
95e167c2f6 Refacotr model configs 2024-12-08 16:00:25 -08:00
Shuguang Chen
320f4612b8 Update cli and related utils 2024-12-08 11:16:34 -08:00
Shuguang Chen
b4f4695f16 Reorganize model_server 2024-12-08 09:21:53 -08:00
CTran
a40cdc7b75
Cotran/intent (#339)
* add else

* integrate hallucination

* remove test
2024-12-08 08:56:35 -08:00
Adil Hafeez
af0e7d178b
update cli to 0.1.6 (#338) 2024-12-06 15:48:07 -08:00
Adil Hafeez
a54db1a098
update getting started guide and add llm gateway and prompt gateway samples (#330) 2024-12-06 14:37:33 -08:00
Shuguang Chen
afec644789 Update the logic of intent detection 2024-12-06 14:14:44 -08:00
Shuguang Chen
79eafc0241 Update ArchBaseHandler 2024-12-06 14:07:01 -08:00
Shuguang Chen
2fd8a5a06d Update Arch-Guard and corresponding e2e test 2024-12-06 13:41:18 -08:00
Shuguang Chen
818ec0a9db Fix the issue of Arch-Intent 2024-12-06 11:53:46 -08:00
Shuguang Chen
4fcfd83639 Refine model_server 2024-12-05 15:19:41 -08:00
Shuguang Chen
a5bd005411 Fix a bug in function_calling.py 2024-12-05 12:06:56 -08:00
Shuguang Chen
09f7e1e604 Update guardrail_handler and its associated tests 2024-12-05 11:30:58 -08:00
Shuguang Chen
b686cf8b87 refactor model_handler 2024-12-05 11:00:22 -08:00
Shuguang Chen
afe1410b37 Init update on model_server 2024-12-04 16:41:30 -08:00
CTran
cadd3cdaf9
hallucination with log probs (#281)
* first init

* fix

* fix test

* new implemenetation

* fix bug

* fix bug

* fix bug

* address issue

* address issues

* address comments

* fix test

* fix

* move constatns

* remove consts
2024-11-27 15:17:02 -08:00
Adil Hafeez
704b928d61
release 0.1.5 (#307) 2024-11-26 13:28:52 -08:00
Adil Hafeez
0ff3d43008
remove dependency on docker-compose when starting up archgw (#305) 2024-11-26 13:13:02 -08:00
Adil Hafeez
9c6fcdb771
use fix prompt guards (#303) 2024-11-25 17:16:35 -08:00
Adil Hafeez
9cee04ed31
release 0.1.3 (#280)
* release 0.1.3

* udpate ver
2024-11-17 17:12:01 -08:00
Adil Hafeez
d1dd8710a4
release 0.1.2 (#266) 2024-11-12 23:56:33 -08:00
Adil Hafeez
d87105882b
update rust toolchain to 1.82 (#255)
* update rust to 1.82 pin it, also update envoy to 1.32 and python to 3.13

* use python:3.12
2024-11-12 10:35:14 -08:00
Adil Hafeez
a72bb804eb
add support for jaeger tracing (#229) 2024-11-07 22:11:00 -06:00
CTran
fb67788be0
add prefill and test (#236)
* add prefill and test

* fix stream

* fix

* feedback

* address comments

* update

* add e2e test

* fix e2e test

* update fix

* fix

* address cmt

* address cmt
2024-11-07 11:59:29 -08:00
Adil Hafeez
8c6ad87c1c
release 0.1.0 (#239)
* set version to 0.1.0

* update readme
2024-10-30 18:56:49 -07:00
Adil Hafeez
e462e393b1
Use large github action machine to run e2e tests (#230) 2024-10-30 17:54:51 -07:00
Adil Hafeez
60299244b9
Improve Gradio UI and fix arch_state bug (#227) 2024-10-29 11:27:13 -07:00
José Ulises Niño Rivera
662a840ac5
Add support for streaming and fixes few issues (see description) (#202) 2024-10-28 17:05:06 -07:00
CTran
25dddcbfd9
fix model server stop process (#217)
* fix model server stop process

* replace

* replace

* add test

* add multiple pids test

* add check install for linux

* reformat
2024-10-24 19:21:47 -07:00
Salman Paracha
708fa15a9b
HR agent demo (#206)
* commiting my hr_agent branch

* updating the HR agent config

* pushing to remote

* fix hr agent

* committing to merge with main

* updating to merge from main

* updating the demo and model-server-tests to pull from poetry

* updating the poetry.lock files

* updating based on feedback

* updated sysmte prompt for hr_agent

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-23 14:32:40 -07:00
Adil Hafeez
dd1c7be706
Pass tool call and app function response back in metadata (#193) 2024-10-18 13:25:39 -07:00
Adil Hafeez
28421353fd
Update vscode workspce (#199)
- add recommended extensions
- set python interpreter path for all python projects to be venv/bin/python
- update project structure in workspace
- rename project file from gatewa -> archgw
2024-10-18 12:57:58 -07:00
Adil Hafeez
6cd05572c4
update lock file (#192)
```
Installing dependencies from lock file

pyproject.toml changed significantly since poetry.lock was last generated. Run `poetry lock [--no-update]` to fix the lock file.
Error installing model server dependencies: Command '['poetry', 'install', '--no-cache']' returned non-zero exit status 1.
```
2024-10-17 10:42:15 -07:00