cotran
3e4b04099c
fix test
2024-12-10 19:31:35 -08:00
cotran
b2ef3f7266
fix test
2024-12-10 19:31:02 -08:00
Adil Hafeez
e5b4fd00ed
fix model_server test bootup
2024-12-10 19:02:20 -08:00
Adil Hafeez
94c18925de
fix tests
2024-12-10 18:51:40 -08:00
cotran
2405fb36e3
Merge branch 'shuguang/main' of https://github.com/katanemo/arch into shuguang/main
2024-12-10 18:04:56 -08:00
cotran
188b0a8a39
modify changes
2024-12-10 18:04:50 -08:00
Adil Hafeez
60dfb18018
send latency numbers from model_server as metadata
2024-12-10 16:01:17 -08:00
Adil Hafeez
14625e2a1d
fix rest file
2024-12-10 15:12:50 -08:00
Adil Hafeez
44872107a8
integrate arch with model_server
2024-12-10 15:12:31 -08:00
Shuguang Chen
791ce0a7ed
Merge branch 'shuguang/main' of https://github.com/katanemo/archgw into shuguang/main
2024-12-09 15:41:06 -08:00
Shuguang Chen
3859e8eb43
Fix bugs
2024-12-09 15:40:57 -08:00
cotran
0ec6762cda
fix
2024-12-09 15:40:50 -08:00
cotran
21fc0b5624
add log
2024-12-09 15:37:17 -08:00
cotran
9998c8b0f3
add log
2024-12-09 15:33:59 -08:00
cotran
5439e07e0e
new thd
2024-12-09 15:22:02 -08:00
cotran
18d76d0427
latest threshold
2024-12-09 15:02:53 -08:00
cotran
95740ca051
Merge branch 'shuguang/main' of https://github.com/katanemo/arch into shuguang/main
2024-12-09 14:44:17 -08:00
cotran
8d3cf21f79
change thd
2024-12-09 14:44:14 -08:00
Shuguang Chen
1635d44e4a
Update api_model_server.rest
2024-12-09 14:35:03 -08:00
Shuguang Chen
8871d3f751
Collect debugging log
2024-12-09 14:15:10 -08:00
cotran
f13947732c
add more test
2024-12-09 13:48:30 -08:00
cotran
63cc2ef3f3
fix bug
2024-12-09 13:33:10 -08:00
cotran
3b0ba352a9
Merge branch 'shuguang/main' of https://github.com/katanemo/arch into shuguang/main
2024-12-09 13:32:24 -08:00
CTran
9dd7f15eab
Cotran/integration ( #341 )
...
* add hallucination
* add test and fix bug
2024-12-09 13:30:52 -08:00
cotran
7bb6c8c750
add test and fix bug
2024-12-09 13:21:28 -08:00
Adil Hafeez
8f1b21124b
fix model_server cli and update version to 0.1.7
2024-12-09 12:11:17 -08:00
cotran
c12d63a109
working hallucination integration
2024-12-09 12:00:44 -08:00
Adil Hafeez
1299afd05c
add overrides module to model server
2024-12-09 11:52:57 -08:00
Adil Hafeez
0e22c3e7de
Merge branch 'main' into shuguang/main
2024-12-09 11:43:42 -08:00
cotran
423cfc0872
add hallucination
2024-12-09 11:33:41 -08:00
Shuguang Chen
1a3d33409b
Fix bugs
2024-12-09 11:19:09 -08:00
José Ulises Niño Rivera
cd1b561192
Break apart metrics into their own module ( #335 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-12-09 10:46:46 -08:00
Shuguang Chen
e0d4ee7357
Update ArchFunctionHandler
2024-12-08 16:41:45 -08:00
Shuguang Chen
95e167c2f6
Refacotr model configs
2024-12-08 16:00:25 -08:00
Shuguang Chen
320f4612b8
Update cli and related utils
2024-12-08 11:16:34 -08:00
Shuguang Chen
b4f4695f16
Reorganize model_server
2024-12-08 09:21:53 -08:00
CTran
a40cdc7b75
Cotran/intent ( #339 )
...
* add else
* integrate hallucination
* remove test
2024-12-08 08:56:35 -08:00
José Ulises Niño Rivera
d002b2042a
Break apart common_types mod ( #334 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-12-06 17:25:42 -08:00
Adil Hafeez
93d3d349a2
fix code bug ( #340 )
2024-12-06 17:20:59 -08:00
Adil Hafeez
af0e7d178b
update cli to 0.1.6 ( #338 )
2024-12-06 15:48:07 -08:00
Adil Hafeez
c186c3dfc0
update quick start rst to be in sync with readme.md ( #337 )
2024-12-06 15:15:26 -08:00
Adil Hafeez
a54db1a098
update getting started guide and add llm gateway and prompt gateway samples ( #330 )
2024-12-06 14:37:33 -08:00
Shuguang Chen
afec644789
Update the logic of intent detection
2024-12-06 14:14:44 -08:00
Shuguang Chen
79eafc0241
Update ArchBaseHandler
2024-12-06 14:07:01 -08:00
Aayush
9d8fe02729
fix the README for the weather_forecasting demo ( #336 )
...
* README fix
* add missing colon
2024-12-06 14:02:41 -08:00
Ikko Eltociear Ashimine
4e919613f1
docs: update README.md ( #332 )
...
minor fix
2024-12-06 13:44:33 -08:00
Aayush
885acc899f
322 add support for pydantic logfire for llm agent tracing ( #329 )
...
* set up otel-collector and implement sending to logfire
* moved rest of the files for the demo into the folder
* update docker-compose.yaml and run_demo.sh to properly check for LOGFIRE_API_KEY
* refactor weather_forecast demo to only be one demo
* add a default docker-compose for e2e tests
* update based on requested changes
* fix replace comma with colon in readme
* remove weather_forecast_service folder, and make logfire demo fail instantly if no key is set
* remove the unused weather forecast service folder
* Changed stop_demo to only stop one file at a time
* update readme with new demo stopping setup
* Revert changes to end behavior
* fix silly formatting mistake
2024-12-06 13:44:22 -08:00
Shuguang Chen
2fd8a5a06d
Update Arch-Guard and corresponding e2e test
2024-12-06 13:41:18 -08:00
Shuguang Chen
6dc92fbbc1
Add e2e test of function_calling endpoint
2024-12-06 11:54:01 -08:00
Shuguang Chen
818ec0a9db
Fix the issue of Arch-Intent
2024-12-06 11:53:46 -08:00