Commit graph

432 commits

Author SHA1 Message Date
Adil Hafeez
04f6c20e38
remove extra log 2025-06-05 23:41:27 -07:00
Adil Hafeez
8bf3f32973
fix tests 2025-06-05 22:24:45 -07:00
Adil Hafeez
b1233da268
add arch 2025-06-05 20:17:07 -07:00
Adil Hafeez
a5bdd9c04f
Merge branch 'main' into adil/hermesllm 2025-06-05 20:14:38 -07:00
Adil Hafeez
96b583c819
make model required in readme and rst files (#503) 2025-06-05 20:14:13 -07:00
Adil Hafeez
950625a65a
fix tests 2025-06-05 20:06:54 -07:00
Adil Hafeez
8e4960df4a
fix more 2025-06-05 20:03:27 -07:00
Adil Hafeez
a3872a57ed
fix docker cli 2025-06-05 17:45:00 -07:00
Adil Hafeez
259683c3c4
dont use host bright staff for github 2025-06-05 17:03:31 -07:00
Adil Hafeez
df340f58c9
Merge branch 'main' into adil/hermesllm 2025-06-05 17:00:40 -07:00
Adil Hafeez
e43d41ba32
add support for bortli compression (#502) 2025-06-05 17:00:14 -07:00
Adil Hafeez
db19395eb7
more changes 2025-06-05 16:58:45 -07:00
Adil Hafeez
25f1b72e7c
more changes 2025-06-05 16:14:40 -07:00
Adil Hafeez
22fde1f333
fmt/clippy 2025-06-05 12:06:50 -07:00
Adil Hafeez
29503ba95e
revert supervisord 2025-06-04 16:49:46 -07:00
Adil Hafeez
6a6afb0888
move models to hermesllm 2025-06-04 16:36:47 -07:00
Adil Hafeez
0c7aa132ee
use hermesllm in arch gateway for both stream and non stream messages 2025-06-04 16:19:45 -07:00
Adil Hafeez
670907145a
more changes 2025-06-04 10:58:09 -07:00
Adil Hafeez
b0c1e97dc5
use req/resp from hermesllm in llm gateway 2025-06-03 15:57:30 -07:00
Adil Hafeez
f10e0fcece
add more changes 2025-06-03 15:00:57 -07:00
Adil Hafeez
2d4d0b01ee
add more changes 2025-06-03 09:58:06 -07:00
Adil Hafeez
9befd6364c
more changes 2025-06-03 00:17:22 -07:00
Adil Hafeez
59dbbd6743
add openai protocol 2025-06-02 23:57:03 -07:00
Adil Hafeez
21ca21dc3c
Merge branch 'main' into adil/hermesllm 2025-06-02 09:29:38 -07:00
Dougal Ballantyne
93224ed551
Update Dockerfile to fix warnings (#500) 2025-05-31 21:27:29 -07:00
Adil Hafeez
2e47d41a8c
Add ARCH_API_KEY in preference based routing demo (#498) 2025-05-31 01:52:25 -07:00
Adil Hafeez
aff389d342
don't run docker compose up for preference based router e2e demo tests (#499) 2025-05-31 01:16:17 -07:00
Adil Hafeez
0f139baf13
use consistent version across all arch_config files (#497) 2025-05-31 01:11:14 -07:00
Adil Hafeez
c7a3a668a9
update readme for preference based routing (#496) 2025-05-30 18:09:10 -07:00
Adil Hafeez
ed28bbaf04
release 0.3.1 (#495) 2025-05-30 17:47:59 -07:00
Adil Hafeez
0d190a6e5c
update code to use new json based system prompt for routing (#493) 2025-05-30 17:40:46 -07:00
Adil Hafeez
8d12a9a6e0
add arch provider (#494) 2025-05-30 17:12:52 -07:00
CTran
6a01eea813
LLM Router api doc (#492)
* Create router.rst

* add doc

* update api

* update api

* Update docs/source/guides/llm_router.rst

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* Update docs/source/guides/llm_router.rst

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* fix whitespace

* Update llm_router.rst

* remove faeture and align examples

* remove faeture and align examples

---------

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Shuguang Chen <54548843+nehcgs@users.noreply.github.com>
2025-05-30 16:15:26 -07:00
Adil Hafeez
176f039bbc
fix model warning and use openwebui for preference based router demo 2025-05-30 12:29:56 -07:00
Adil Hafeez
fffa837a06
separate out currency exchange and preference based routing (#491) 2025-05-30 02:14:37 -07:00
Adil Hafeez
cd7b8ef649
add hermes llm 2025-05-29 13:33:41 -07:00
Adil Hafeez
470cdf9843
use provider_name as model_id /v1/models api (#490) 2025-05-29 11:23:18 -07:00
Adil Hafeez
9c4733590f
add support for openwebui (#487) 2025-05-28 19:08:00 -07:00
Adil Hafeez
4899117876
add compress/decompress filter to llm listener (#489) 2025-05-28 15:06:52 -07:00
Adil Hafeez
d29eba4102
trim conversation if it exceed max limit of what router model can handle (#488) 2025-05-27 20:28:22 -07:00
Adil Hafeez
79cbcb5fe1
add claude-4 in llm_routing demo (#486) 2025-05-23 10:21:21 -07:00
Adil Hafeez
dc271f1f76
release 0.3.0 (#483) 2025-05-23 09:52:23 -07:00
Adil Hafeez
99dd900a34
fix panic in brightstaff (#485)
make router section optional in arch_config
2025-05-23 09:37:25 -07:00
Adil Hafeez
21faae605f
correctly map envoy stats to host (#484)
host port 19901 -> envoy container port 9901
2025-05-23 09:37:15 -07:00
Adil Hafeez
a0d10153f9
update archgw logs file to stream access logs from container (#482) 2025-05-23 09:15:44 -07:00
Adil Hafeez
d050dfb85a
When router usage is defined ensure that router model is defined too (#481) 2025-05-23 08:46:12 -07:00
Adil Hafeez
218e9c540d
Add support for json based content types in Message (#480) 2025-05-23 00:51:53 -07:00
Adil Hafeez
f5e77bbe65
add support for claude and add first class support for groq and deepseek (#479) 2025-05-22 22:55:46 -07:00
Adil Hafeez
27c0f2fdce
Introduce brightstaff a new terminal service for llm routing (#477) 2025-05-19 09:59:22 -07:00
Adil Hafeez
1f95fac4af
update arch_config sample on readme to match with new format (#475) 2025-04-29 12:36:46 -07:00