Commit graph

301 commits

Author SHA1 Message Date
cotran
c12d63a109 working hallucination integration 2024-12-09 12:00:44 -08:00
cotran
423cfc0872 add hallucination 2024-12-09 11:33:41 -08:00
Shuguang Chen
1a3d33409b Fix bugs 2024-12-09 11:19:09 -08:00
Shuguang Chen
e0d4ee7357 Update ArchFunctionHandler 2024-12-08 16:41:45 -08:00
Shuguang Chen
95e167c2f6 Refacotr model configs 2024-12-08 16:00:25 -08:00
Shuguang Chen
320f4612b8 Update cli and related utils 2024-12-08 11:16:34 -08:00
Shuguang Chen
b4f4695f16 Reorganize model_server 2024-12-08 09:21:53 -08:00
CTran
a40cdc7b75
Cotran/intent (#339)
* add else

* integrate hallucination

* remove test
2024-12-08 08:56:35 -08:00
Shuguang Chen
afec644789 Update the logic of intent detection 2024-12-06 14:14:44 -08:00
Shuguang Chen
79eafc0241 Update ArchBaseHandler 2024-12-06 14:07:01 -08:00
Shuguang Chen
2fd8a5a06d Update Arch-Guard and corresponding e2e test 2024-12-06 13:41:18 -08:00
Shuguang Chen
6dc92fbbc1 Add e2e test of function_calling endpoint 2024-12-06 11:54:01 -08:00
Shuguang Chen
818ec0a9db Fix the issue of Arch-Intent 2024-12-06 11:53:46 -08:00
Shuguang Chen
4fcfd83639 Refine model_server 2024-12-05 15:19:41 -08:00
Shuguang Chen
a5bd005411 Fix a bug in function_calling.py 2024-12-05 12:06:56 -08:00
Shuguang Chen
09f7e1e604 Update guardrail_handler and its associated tests 2024-12-05 11:30:58 -08:00
Shuguang Chen
b686cf8b87 refactor model_handler 2024-12-05 11:00:22 -08:00
Shuguang Chen
afe1410b37 Init update on model_server 2024-12-04 16:41:30 -08:00
Shuguang Chen
1d9de28086 Update .gitignore 2024-12-02 15:12:33 -08:00
Salman Paracha
a0c159c9ba
updating doc versions, images and cleaning up section for prompt-guard (#320)
* updating doc versions, images and cleaning up section for prompt-guard

* updating based on feedback

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-12-01 23:02:08 -08:00
CTran
cadd3cdaf9
hallucination with log probs (#281)
* first init

* fix

* fix test

* new implemenetation

* fix bug

* fix bug

* fix bug

* address issue

* address issues

* address comments

* fix test

* fix

* move constatns

* remove consts
2024-11-27 15:17:02 -08:00
Peter Jausovec
f5cdafb7c8
update alertmanager version to v2, remove the merge artifacts (#309)
Signed-off-by: Peter Jausovec <peter.jausovec@solo.io>
2024-11-27 11:41:31 -08:00
Adil Hafeez
ec5326250e
correctly map stats port to host (#311) 2024-11-27 11:28:41 -08:00
Adil Hafeez
adec02e742
add note about hosted arch-fc (#308) 2024-11-26 14:19:10 -08:00
Adil Hafeez
704b928d61
release 0.1.5 (#307) 2024-11-26 13:28:52 -08:00
Adil Hafeez
0ff3d43008
remove dependency on docker-compose when starting up archgw (#305) 2024-11-26 13:13:02 -08:00
Adil Hafeez
726f1a3185
add schema change to use enum in arch_config (#304) 2024-11-25 17:51:25 -08:00
José Ulises Niño Rivera
be8c3c9ea3
Remove blanket unused imports from the common crate (#292)
* Remove blanket unused imports from the common crate

Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>

* updatE

Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>

---------

Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-11-25 17:19:06 -08:00
Adil Hafeez
9c6fcdb771
use fix prompt guards (#303) 2024-11-25 17:16:35 -08:00
Adil Hafeez
6f4a57b56d
update readme with python version (#302) 2024-11-25 16:01:40 -08:00
Salman Paracha
970db68575
updating readme and docs with note about Arch-Function (#285)
* updating readme and docs with note about Arch-Function

* minor fixes to README

* a few more minor updates to the README

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-11-19 08:43:56 -08:00
Adil Hafeez
33ab24292c
publish docker image on release only (#284) 2024-11-18 18:18:46 -08:00
Adil Hafeez
3d3d015aea
publish docker image (#283) 2024-11-18 17:55:47 -08:00
Adil Hafeez
36489b4adc
use envoy to publish traces (#270) 2024-11-18 17:55:39 -08:00
Adil Hafeez
9cee04ed31
release 0.1.3 (#280)
* release 0.1.3

* udpate ver
2024-11-17 17:12:01 -08:00
Adil Hafeez
097513ee60
fix start time of llm filter (#278)
* fix start time of llm filter

* fix int tests
2024-11-17 17:01:19 -08:00
Salman Paracha
df0cd50cbd
updating website to track analytics (#273)
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-11-16 21:37:04 -08:00
Salman Paracha
8e9608995c
updated website with PH badge (#272) 2024-11-16 13:02:23 -08:00
Salman Paracha
a0d87d86c9
updating docs to reflect changes in 0.1.2 like tracing via signoz and… (#271) 2024-11-15 16:55:27 -08:00
Adil Hafeez
d3c17c7abd
move custom tracer to llm filter (#267) 2024-11-15 10:44:01 -08:00
Aayush
1d229cba8f
Add in tpot (#269)
* add in tpot and tokens per second

* add in debug logs for new stats and update integration tests

* update shared dashboard to include new stats
2024-11-14 15:03:08 -08:00
Salman Paracha
9eeb790c7f
updating README with PH launch results (#268) 2024-11-14 01:11:26 -08:00
Adil Hafeez
d1dd8710a4
release 0.1.2 (#266) 2024-11-12 23:56:33 -08:00
Adil Hafeez
31749bfc74
move grafana and prometheus to shared (#265) 2024-11-12 15:23:30 -08:00
Aayush
5993e36f22
Update arch stats (#250) 2024-11-12 15:03:26 -08:00
Adil Hafeez
30647fd508
Add service to stream custom otel traces to otel-collector (#262) 2024-11-12 11:09:40 -08:00
Adil Hafeez
d87105882b
update rust toolchain to 1.82 (#255)
* update rust to 1.82 pin it, also update envoy to 1.32 and python to 3.13

* use python:3.12
2024-11-12 10:35:14 -08:00
Salman Paracha
4b2b371876
removing depdency on mistral keys (#256) 2024-11-08 16:09:04 -08:00
Adil Hafeez
9081eb0f7f
obfuscate auth header (#254) 2024-11-08 15:17:39 -06:00
Adil Hafeez
88d0f99866
add requirements to readme (#249) 2024-11-08 10:43:18 -08:00