Commit graph

95 commits

Author SHA1 Message Date
Adil Hafeez
c9b62fc52e
Merge branch 'main' into integrate-Arch-Function-v1.1 2025-04-07 00:34:25 -07:00
Adil Hafeez
4d2d8bd7a1
release 0.2.5 (#457) 2025-04-06 01:24:01 -07:00
cotran
09a1bebff5 fix json formatting for tool call msg 2025-04-04 15:01:38 -07:00
Shuguang Chen
d8dabfeec0 Fix message formatting 2025-04-04 10:51:37 -07:00
Shuguang Chen
cbd181a092 Fix a bug in message formatting 2025-04-04 09:53:54 -07:00
co tran
0c3d52bfe4 format fix 2025-04-01 20:25:15 +00:00
co tran
1b39ee3dd8 fix bugs when no logprob for prefill and bug in function calling loop when it always no tool call 2025-04-01 20:18:56 +00:00
co tran
a3ab6be51c modify hallucination threshold and temperature 2025-04-01 17:36:54 +00:00
Adil Hafeez
f2323f771c
update response from upstream llm to now include it in dict with "response" 2025-03-31 18:42:46 -07:00
co tran
5bd991e97b fix precommit 2025-04-01 01:28:18 +00:00
co tran
b7916ce192 clean code + remove cmts 2025-04-01 01:19:29 +00:00
co tran
6e5cb5d485 fix test 2025-04-01 01:18:15 +00:00
co tran
a61a2a1d70 remove test until more evaluation on example 2025-04-01 00:12:03 +00:00
co tran
610843d98d remove test until more evaluation on example 2025-04-01 00:03:14 +00:00
co tran
7c6ddc9396 remove test until more evaluation on example 2025-03-31 23:41:11 +00:00
Shuguang Chen
6ec4c14407 Fix prompt prefilling 2025-03-31 15:08:38 -07:00
co tran
afe7cc9e9e fix bug and test 2025-03-31 21:50:05 +00:00
co tran
cc0845bce4 fix hallucination loop 2025-03-31 21:08:47 +00:00
Shuguang Chen
f035d166c8 Fix hallucination check 2025-03-28 16:30:03 -07:00
Shuguang Chen
425f9b0dd5 Update model usage 2025-03-28 15:10:51 -07:00
Adil Hafeez
8290d1969f
use public endpoint for arch v1.1 2025-03-28 12:38:44 -07:00
CTran
a3f2b3cef9
add hallucination modification (#455)
* add hallucination modification

* disable test
2025-03-28 09:49:20 -07:00
Adil Hafeez
b31a7a569a
update rest and other parts of the code to work with arch fc 1.1 2025-03-28 03:04:21 -07:00
Shuguang Chen
8335f0c3de minor update 2025-03-27 10:26:47 -07:00
Shuguang Chen
820c0443ee disable hallucination check 2025-03-24 17:07:07 -07:00
Shuguang Chen
cf30e94415 Init update 2025-03-24 16:53:10 -07:00
Adil Hafeez
9f59943041
update code to use 0.2.4 release (#446)
* update code to use 0.2.4 release

* update lock file
2025-03-21 16:08:59 -07:00
Adil Hafeez
84cd1df7bf
add preliminary support for llm agents (#432) 2025-03-19 15:21:34 -07:00
Adil Hafeez
d8b833fe69
release 0.2.3 (#423) 2025-03-04 14:30:44 -08:00
Shuguang Chen
e77fc47225
Handle intent matching better in arch gateway (#391) 2025-03-04 12:49:13 -08:00
Adil Hafeez
1bbc5d2233
release 0.2.2 (#413) 2025-02-14 20:02:59 -08:00
CTran
e7b370cd2f
fix error in function name + new thresholds (#406)
* fix error in function name + new thresholds

* fix

* fix

* remove example

* remove example
2025-02-14 09:57:39 -08:00
Adil Hafeez
4ec03af16e
use archfc hosted on aws (#409) 2025-02-13 11:03:34 -08:00
Adil Hafeez
0ea237fbac
release 0.2.1 (#399) 2025-02-07 19:21:20 -08:00
Adil Hafeez
8de6eacfbd
spotify demo with optimized context window code change (#397) 2025-02-07 19:14:15 -08:00
Adil Hafeez
7830f4b431
release 0.2.0 (#384)
* release 0.2.0

* update versions
2025-01-24 17:31:48 -08:00
Adil Hafeez
452084423c
add PR to release 0.1.9 (#371) 2025-01-17 18:47:26 -08:00
Shuguang Chen
88a02dc478
Some fixes on model server (#362)
* Some fixes on model server

* Remove prompt_prefilling message

* Fix logging

* Fix poetry issues

* Improve logging and update the support for text truncation

* Fix tests

* Fix tests

* Fix tests

* Fix modelserver tests

* Update modelserver tests
2025-01-10 16:45:36 -08:00
Shuguang Chen
ba7279becb
Use intent model from archfc to pick prompt gateway (#328) 2024-12-20 13:25:01 -08:00
Adil Hafeez
af0e7d178b
update cli to 0.1.6 (#338) 2024-12-06 15:48:07 -08:00
Adil Hafeez
a54db1a098
update getting started guide and add llm gateway and prompt gateway samples (#330) 2024-12-06 14:37:33 -08:00
CTran
cadd3cdaf9
hallucination with log probs (#281)
* first init

* fix

* fix test

* new implemenetation

* fix bug

* fix bug

* fix bug

* address issue

* address issues

* address comments

* fix test

* fix

* move constatns

* remove consts
2024-11-27 15:17:02 -08:00
Adil Hafeez
704b928d61
release 0.1.5 (#307) 2024-11-26 13:28:52 -08:00
Adil Hafeez
0ff3d43008
remove dependency on docker-compose when starting up archgw (#305) 2024-11-26 13:13:02 -08:00
Adil Hafeez
9c6fcdb771
use fix prompt guards (#303) 2024-11-25 17:16:35 -08:00
Adil Hafeez
9cee04ed31
release 0.1.3 (#280)
* release 0.1.3

* udpate ver
2024-11-17 17:12:01 -08:00
Adil Hafeez
d1dd8710a4
release 0.1.2 (#266) 2024-11-12 23:56:33 -08:00
Adil Hafeez
d87105882b
update rust toolchain to 1.82 (#255)
* update rust to 1.82 pin it, also update envoy to 1.32 and python to 3.13

* use python:3.12
2024-11-12 10:35:14 -08:00
Adil Hafeez
a72bb804eb
add support for jaeger tracing (#229) 2024-11-07 22:11:00 -06:00
CTran
fb67788be0
add prefill and test (#236)
* add prefill and test

* fix stream

* fix

* feedback

* address comments

* update

* add e2e test

* fix e2e test

* update fix

* fix

* address cmt

* address cmt
2024-11-07 11:59:29 -08:00