Shuguang Chen
791ce0a7ed
Merge branch 'shuguang/main' of https://github.com/katanemo/archgw into shuguang/main
2024-12-09 15:41:06 -08:00
Shuguang Chen
3859e8eb43
Fix bugs
2024-12-09 15:40:57 -08:00
cotran
0ec6762cda
fix
2024-12-09 15:40:50 -08:00
cotran
21fc0b5624
add log
2024-12-09 15:37:17 -08:00
cotran
9998c8b0f3
add log
2024-12-09 15:33:59 -08:00
cotran
5439e07e0e
new thd
2024-12-09 15:22:02 -08:00
cotran
18d76d0427
latest threshold
2024-12-09 15:02:53 -08:00
cotran
95740ca051
Merge branch 'shuguang/main' of https://github.com/katanemo/arch into shuguang/main
2024-12-09 14:44:17 -08:00
cotran
8d3cf21f79
change thd
2024-12-09 14:44:14 -08:00
Shuguang Chen
8871d3f751
Collect debugging log
2024-12-09 14:15:10 -08:00
cotran
f13947732c
add more test
2024-12-09 13:48:30 -08:00
cotran
63cc2ef3f3
fix bug
2024-12-09 13:33:10 -08:00
CTran
9dd7f15eab
Cotran/integration ( #341 )
...
* add hallucination
* add test and fix bug
2024-12-09 13:30:52 -08:00
Adil Hafeez
8f1b21124b
fix model_server cli and update version to 0.1.7
2024-12-09 12:11:17 -08:00
Adil Hafeez
1299afd05c
add overrides module to model server
2024-12-09 11:52:57 -08:00
Adil Hafeez
0e22c3e7de
Merge branch 'main' into shuguang/main
2024-12-09 11:43:42 -08:00
Shuguang Chen
1a3d33409b
Fix bugs
2024-12-09 11:19:09 -08:00
Shuguang Chen
e0d4ee7357
Update ArchFunctionHandler
2024-12-08 16:41:45 -08:00
Shuguang Chen
95e167c2f6
Refacotr model configs
2024-12-08 16:00:25 -08:00
Shuguang Chen
320f4612b8
Update cli and related utils
2024-12-08 11:16:34 -08:00
Shuguang Chen
b4f4695f16
Reorganize model_server
2024-12-08 09:21:53 -08:00
CTran
a40cdc7b75
Cotran/intent ( #339 )
...
* add else
* integrate hallucination
* remove test
2024-12-08 08:56:35 -08:00
Adil Hafeez
af0e7d178b
update cli to 0.1.6 ( #338 )
2024-12-06 15:48:07 -08:00
Adil Hafeez
a54db1a098
update getting started guide and add llm gateway and prompt gateway samples ( #330 )
2024-12-06 14:37:33 -08:00
Shuguang Chen
afec644789
Update the logic of intent detection
2024-12-06 14:14:44 -08:00
Shuguang Chen
79eafc0241
Update ArchBaseHandler
2024-12-06 14:07:01 -08:00
Shuguang Chen
2fd8a5a06d
Update Arch-Guard and corresponding e2e test
2024-12-06 13:41:18 -08:00
Shuguang Chen
818ec0a9db
Fix the issue of Arch-Intent
2024-12-06 11:53:46 -08:00
Shuguang Chen
4fcfd83639
Refine model_server
2024-12-05 15:19:41 -08:00
Shuguang Chen
a5bd005411
Fix a bug in function_calling.py
2024-12-05 12:06:56 -08:00
Shuguang Chen
09f7e1e604
Update guardrail_handler and its associated tests
2024-12-05 11:30:58 -08:00
Shuguang Chen
b686cf8b87
refactor model_handler
2024-12-05 11:00:22 -08:00
Shuguang Chen
afe1410b37
Init update on model_server
2024-12-04 16:41:30 -08:00
CTran
cadd3cdaf9
hallucination with log probs ( #281 )
...
* first init
* fix
* fix test
* new implemenetation
* fix bug
* fix bug
* fix bug
* address issue
* address issues
* address comments
* fix test
* fix
* move constatns
* remove consts
2024-11-27 15:17:02 -08:00
Adil Hafeez
704b928d61
release 0.1.5 ( #307 )
2024-11-26 13:28:52 -08:00
Adil Hafeez
0ff3d43008
remove dependency on docker-compose when starting up archgw ( #305 )
2024-11-26 13:13:02 -08:00
Adil Hafeez
9c6fcdb771
use fix prompt guards ( #303 )
2024-11-25 17:16:35 -08:00
Adil Hafeez
9cee04ed31
release 0.1.3 ( #280 )
...
* release 0.1.3
* udpate ver
2024-11-17 17:12:01 -08:00
Adil Hafeez
d1dd8710a4
release 0.1.2 ( #266 )
2024-11-12 23:56:33 -08:00
Adil Hafeez
d87105882b
update rust toolchain to 1.82 ( #255 )
...
* update rust to 1.82 pin it, also update envoy to 1.32 and python to 3.13
* use python:3.12
2024-11-12 10:35:14 -08:00
Adil Hafeez
a72bb804eb
add support for jaeger tracing ( #229 )
2024-11-07 22:11:00 -06:00
CTran
fb67788be0
add prefill and test ( #236 )
...
* add prefill and test
* fix stream
* fix
* feedback
* address comments
* update
* add e2e test
* fix e2e test
* update fix
* fix
* address cmt
* address cmt
2024-11-07 11:59:29 -08:00
Adil Hafeez
8c6ad87c1c
release 0.1.0 ( #239 )
...
* set version to 0.1.0
* update readme
2024-10-30 18:56:49 -07:00
Adil Hafeez
e462e393b1
Use large github action machine to run e2e tests ( #230 )
2024-10-30 17:54:51 -07:00
Adil Hafeez
60299244b9
Improve Gradio UI and fix arch_state bug ( #227 )
2024-10-29 11:27:13 -07:00
José Ulises Niño Rivera
662a840ac5
Add support for streaming and fixes few issues (see description) ( #202 )
2024-10-28 17:05:06 -07:00
CTran
25dddcbfd9
fix model server stop process ( #217 )
...
* fix model server stop process
* replace
* replace
* add test
* add multiple pids test
* add check install for linux
* reformat
2024-10-24 19:21:47 -07:00
Salman Paracha
708fa15a9b
HR agent demo ( #206 )
...
* commiting my hr_agent branch
* updating the HR agent config
* pushing to remote
* fix hr agent
* committing to merge with main
* updating to merge from main
* updating the demo and model-server-tests to pull from poetry
* updating the poetry.lock files
* updating based on feedback
* updated sysmte prompt for hr_agent
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-23 14:32:40 -07:00
Adil Hafeez
dd1c7be706
Pass tool call and app function response back in metadata ( #193 )
2024-10-18 13:25:39 -07:00
Adil Hafeez
28421353fd
Update vscode workspce ( #199 )
...
- add recommended extensions
- set python interpreter path for all python projects to be venv/bin/python
- update project structure in workspace
- rename project file from gatewa -> archgw
2024-10-18 12:57:58 -07:00