cotran
5e164e8e3c
integrate hallucination
2024-12-06 15:50:03 -08:00
cotran
f7d69d52a7
Merge branch 'shuguang/main' of https://github.com/katanemo/arch into cotran/intent
2024-12-06 14:34:55 -08:00
cotran
d5c034ea79
add else
2024-12-06 14:34:51 -08:00
Shuguang Chen
afec644789
Update the logic of intent detection
2024-12-06 14:14:44 -08:00
Shuguang Chen
79eafc0241
Update ArchBaseHandler
2024-12-06 14:07:01 -08:00
Shuguang Chen
2fd8a5a06d
Update Arch-Guard and corresponding e2e test
2024-12-06 13:41:18 -08:00
Shuguang Chen
6dc92fbbc1
Add e2e test of function_calling endpoint
2024-12-06 11:54:01 -08:00
Shuguang Chen
818ec0a9db
Fix the issue of Arch-Intent
2024-12-06 11:53:46 -08:00
Shuguang Chen
4fcfd83639
Refine model_server
2024-12-05 15:19:41 -08:00
Shuguang Chen
a5bd005411
Fix a bug in function_calling.py
2024-12-05 12:06:56 -08:00
Shuguang Chen
09f7e1e604
Update guardrail_handler and its associated tests
2024-12-05 11:30:58 -08:00
Shuguang Chen
b686cf8b87
refactor model_handler
2024-12-05 11:00:22 -08:00
Shuguang Chen
afe1410b37
Init update on model_server
2024-12-04 16:41:30 -08:00
Shuguang Chen
1d9de28086
Update .gitignore
2024-12-02 15:12:33 -08:00
Salman Paracha
a0c159c9ba
updating doc versions, images and cleaning up section for prompt-guard ( #320 )
...
* updating doc versions, images and cleaning up section for prompt-guard
* updating based on feedback
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-12-01 23:02:08 -08:00
CTran
cadd3cdaf9
hallucination with log probs ( #281 )
...
* first init
* fix
* fix test
* new implemenetation
* fix bug
* fix bug
* fix bug
* address issue
* address issues
* address comments
* fix test
* fix
* move constatns
* remove consts
2024-11-27 15:17:02 -08:00
Peter Jausovec
f5cdafb7c8
update alertmanager version to v2, remove the merge artifacts ( #309 )
...
Signed-off-by: Peter Jausovec <peter.jausovec@solo.io>
2024-11-27 11:41:31 -08:00
Adil Hafeez
ec5326250e
correctly map stats port to host ( #311 )
2024-11-27 11:28:41 -08:00
Adil Hafeez
adec02e742
add note about hosted arch-fc ( #308 )
2024-11-26 14:19:10 -08:00
Adil Hafeez
704b928d61
release 0.1.5 ( #307 )
2024-11-26 13:28:52 -08:00
Adil Hafeez
0ff3d43008
remove dependency on docker-compose when starting up archgw ( #305 )
2024-11-26 13:13:02 -08:00
Adil Hafeez
726f1a3185
add schema change to use enum in arch_config ( #304 )
2024-11-25 17:51:25 -08:00
José Ulises Niño Rivera
be8c3c9ea3
Remove blanket unused imports from the common crate ( #292 )
...
* Remove blanket unused imports from the common crate
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
* updatE
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
---------
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-11-25 17:19:06 -08:00
Adil Hafeez
9c6fcdb771
use fix prompt guards ( #303 )
2024-11-25 17:16:35 -08:00
Adil Hafeez
6f4a57b56d
update readme with python version ( #302 )
2024-11-25 16:01:40 -08:00
Salman Paracha
970db68575
updating readme and docs with note about Arch-Function ( #285 )
...
* updating readme and docs with note about Arch-Function
* minor fixes to README
* a few more minor updates to the README
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-11-19 08:43:56 -08:00
Adil Hafeez
33ab24292c
publish docker image on release only ( #284 )
2024-11-18 18:18:46 -08:00
Adil Hafeez
3d3d015aea
publish docker image ( #283 )
2024-11-18 17:55:47 -08:00
Adil Hafeez
36489b4adc
use envoy to publish traces ( #270 )
2024-11-18 17:55:39 -08:00
Adil Hafeez
9cee04ed31
release 0.1.3 ( #280 )
...
* release 0.1.3
* udpate ver
2024-11-17 17:12:01 -08:00
Adil Hafeez
097513ee60
fix start time of llm filter ( #278 )
...
* fix start time of llm filter
* fix int tests
2024-11-17 17:01:19 -08:00
Salman Paracha
df0cd50cbd
updating website to track analytics ( #273 )
...
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-11-16 21:37:04 -08:00
Salman Paracha
8e9608995c
updated website with PH badge ( #272 )
2024-11-16 13:02:23 -08:00
Salman Paracha
a0d87d86c9
updating docs to reflect changes in 0.1.2 like tracing via signoz and… ( #271 )
2024-11-15 16:55:27 -08:00
Adil Hafeez
d3c17c7abd
move custom tracer to llm filter ( #267 )
2024-11-15 10:44:01 -08:00
Aayush
1d229cba8f
Add in tpot ( #269 )
...
* add in tpot and tokens per second
* add in debug logs for new stats and update integration tests
* update shared dashboard to include new stats
2024-11-14 15:03:08 -08:00
Salman Paracha
9eeb790c7f
updating README with PH launch results ( #268 )
2024-11-14 01:11:26 -08:00
Adil Hafeez
d1dd8710a4
release 0.1.2 ( #266 )
2024-11-12 23:56:33 -08:00
Adil Hafeez
31749bfc74
move grafana and prometheus to shared ( #265 )
2024-11-12 15:23:30 -08:00
Aayush
5993e36f22
Update arch stats ( #250 )
2024-11-12 15:03:26 -08:00
Adil Hafeez
30647fd508
Add service to stream custom otel traces to otel-collector ( #262 )
2024-11-12 11:09:40 -08:00
Adil Hafeez
d87105882b
update rust toolchain to 1.82 ( #255 )
...
* update rust to 1.82 pin it, also update envoy to 1.32 and python to 3.13
* use python:3.12
2024-11-12 10:35:14 -08:00
Salman Paracha
4b2b371876
removing depdency on mistral keys ( #256 )
2024-11-08 16:09:04 -08:00
Adil Hafeez
9081eb0f7f
obfuscate auth header ( #254 )
2024-11-08 15:17:39 -06:00
Adil Hafeez
88d0f99866
add requirements to readme ( #249 )
2024-11-08 10:43:18 -08:00
Adil Hafeez
6b62662e01
update docs with weather_forecast path ( #253 )
2024-11-08 10:00:15 -08:00
Adil Hafeez
a72bb804eb
add support for jaeger tracing ( #229 )
2024-11-07 22:11:00 -06:00
CTran
fb67788be0
add prefill and test ( #236 )
...
* add prefill and test
* fix stream
* fix
* feedback
* address comments
* update
* add e2e test
* fix e2e test
* update fix
* fix
* address cmt
* address cmt
2024-11-07 11:59:29 -08:00
Ikko Eltociear Ashimine
f48489f7c0
chore: update stream_context.rs ( #248 )
...
initalize -> initialize
2024-11-05 10:18:33 -08:00
Adil Hafeez
9a6ae2efee
retry embeddings fetch ( #245 )
2024-11-05 10:04:36 -08:00