Commit graph

436 commits

Author SHA1 Message Date
Adil Hafeez
92a425facd
release 0.3.6 (#536) 2025-07-22 12:48:20 -07:00
Adil Hafeez
d341f4365b
In request path use same format for usage preferences as arch_config (#533) 2025-07-21 18:31:19 -07:00
Adil Hafeez
79a62fffe8
release 0.3.5 (#534) 2025-07-21 10:09:22 -07:00
Adil Hafeez
83f4d33434
refactor logging in brightstaff (#532)
refactor logs, move unnecessary info log statements to debug and start logging latest chat completion message to log
2025-07-17 16:00:04 -07:00
Adil Hafeez
f819ee3507
pass model name in header when a route is selected when using usage preferences (#531) 2025-07-17 13:41:58 -07:00
Salman Paracha
2340a45353
chatgpt.com updated its backend api path. fixing (#530) 2025-07-14 21:20:23 -07:00
Salman Paracha
5e65572573
updating the messaging to call ourselves the edge and AI gateway for … (#527)
* updating the messaging to call ourselves the edge and AI gateway for agents

* updating README to tidy up some language

* updating README to tidy up some language

* updating README to tidy up some language

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>
2025-07-12 03:25:09 -07:00
Adil Hafeez
14f19f02a7
release 0.3.4 (#525)
* release 0.3.4

* update lock file
2025-07-11 17:24:21 -07:00
Adil Hafeez
a7fddf30f9
better model names (#517) 2025-07-11 16:42:16 -07:00
Salman Paracha
4e2355965b
removing the tabs permission and reverting to window.postMessage (#524)
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>
2025-07-11 12:07:32 -07:00
Salman Paracha
5ea51c8cbd
added the tabs permission again (#521) 2025-07-08 17:50:41 -07:00
Salman Paracha
c0748718f1
Salmanap/chrome extension routing (#511)
* first commit of the chatGPT selector

* stashing changes as checkpoint

* pending changes for chrome extension

* commiting a working version

* converting conversation into messages object

* working version of the extension

* working version with fixed styling and better tested

* fixed the issue that the drop down was too small, and fixed the issue where the route was not displayed on the screen

* updating folder with README.md

* fixes for default model, and to update the manifest.json file

* made changes to the dark mode. improved styles

* fix installation bug

* added dark mode

* fixed default model selection

* fixed the scrolling issue

* Update README.md

* updated content.js to update the labels even when default model

* fixed readme

* upddated the title of the packag

* removing the unnecessary permissions

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>
Co-authored-by: cotran <cotran2@utexas.edu>
Co-authored-by: Shuguang Chen <54548843+nehcgs@users.noreply.github.com>
2025-07-08 16:16:32 -07:00
Adil Hafeez
5fb7ce576c
release 0.3.3 (#519) 2025-07-08 00:59:33 -07:00
Adil Hafeez
147908ba7e
make arch-router cluster optional (#518) 2025-07-08 00:33:40 -07:00
Salman Paracha
a212dd79da
Update README.md 2025-07-03 06:59:53 -07:00
Salman Paracha
dfc7e04821
Update README.md 2025-07-03 06:59:20 -07:00
Adil Hafeez
00dc95e034
Add support for updating model preferences (#510) 2025-07-02 14:08:19 -07:00
Salman Paracha
1963020c21
updating the REAMDE to reflect preference based routing and clean up … (#512) 2025-07-01 12:30:30 -07:00
Salman Paracha
a03aef8b4c
local support for Arch-Router via Ollama (#509)
* local support for Arch-Router via Ollama

* fixed issue withe non-local yaml config

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>
2025-06-20 17:32:14 -07:00
Salman Paracha
1bbd35f1ab
pushing docs updated (#508)
* pushing docs updated

* Fixed README.md logo

* Fixed README.md logo

* Fixed README.md spacing

* fixed tag line

* LLM router doc fixes

* minor logo and branding changes

* minor changes to the README

* minor changes to the README

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>
2025-06-17 08:16:42 -07:00
Adil Hafeez
7baec20772
release 0.3.2 (#507) 2025-06-13 17:02:20 -07:00
Adil Hafeez
0d6e06dfec
update .rest file to show how to call arch-router model 2025-06-13 10:29:22 -07:00
Adil Hafeez
341b14c1be
upgrade envoy to 1.34 (#506) 2025-06-11 15:17:43 -07:00
Adil Hafeez
aa9d747fa9
add support for gemini (#505) 2025-06-11 15:15:00 -07:00
Adil Hafeez
e734b76086
fix build break 2025-06-10 13:07:35 -07:00
Adil Hafeez
fb34dbdf6c
remove unnecessary rust files 2025-06-10 12:54:31 -07:00
Adil Hafeez
6c53510f49
Introduce hermesllm library to handle llm message translation (#501) 2025-06-10 12:53:27 -07:00
Adil Hafeez
96b583c819
make model required in readme and rst files (#503) 2025-06-05 20:14:13 -07:00
Adil Hafeez
e43d41ba32
add support for bortli compression (#502) 2025-06-05 17:00:14 -07:00
Dougal Ballantyne
93224ed551
Update Dockerfile to fix warnings (#500) 2025-05-31 21:27:29 -07:00
Adil Hafeez
2e47d41a8c
Add ARCH_API_KEY in preference based routing demo (#498) 2025-05-31 01:52:25 -07:00
Adil Hafeez
aff389d342
don't run docker compose up for preference based router e2e demo tests (#499) 2025-05-31 01:16:17 -07:00
Adil Hafeez
0f139baf13
use consistent version across all arch_config files (#497) 2025-05-31 01:11:14 -07:00
Adil Hafeez
c7a3a668a9
update readme for preference based routing (#496) 2025-05-30 18:09:10 -07:00
Adil Hafeez
ed28bbaf04
release 0.3.1 (#495) 2025-05-30 17:47:59 -07:00
Adil Hafeez
0d190a6e5c
update code to use new json based system prompt for routing (#493) 2025-05-30 17:40:46 -07:00
Adil Hafeez
8d12a9a6e0
add arch provider (#494) 2025-05-30 17:12:52 -07:00
CTran
6a01eea813
LLM Router api doc (#492)
* Create router.rst

* add doc

* update api

* update api

* Update docs/source/guides/llm_router.rst

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* Update docs/source/guides/llm_router.rst

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* fix whitespace

* Update llm_router.rst

* remove faeture and align examples

* remove faeture and align examples

---------

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Shuguang Chen <54548843+nehcgs@users.noreply.github.com>
2025-05-30 16:15:26 -07:00
Adil Hafeez
176f039bbc
fix model warning and use openwebui for preference based router demo 2025-05-30 12:29:56 -07:00
Adil Hafeez
fffa837a06
separate out currency exchange and preference based routing (#491) 2025-05-30 02:14:37 -07:00
Adil Hafeez
470cdf9843
use provider_name as model_id /v1/models api (#490) 2025-05-29 11:23:18 -07:00
Adil Hafeez
9c4733590f
add support for openwebui (#487) 2025-05-28 19:08:00 -07:00
Adil Hafeez
4899117876
add compress/decompress filter to llm listener (#489) 2025-05-28 15:06:52 -07:00
Adil Hafeez
d29eba4102
trim conversation if it exceed max limit of what router model can handle (#488) 2025-05-27 20:28:22 -07:00
Adil Hafeez
79cbcb5fe1
add claude-4 in llm_routing demo (#486) 2025-05-23 10:21:21 -07:00
Adil Hafeez
dc271f1f76
release 0.3.0 (#483) 2025-05-23 09:52:23 -07:00
Adil Hafeez
99dd900a34
fix panic in brightstaff (#485)
make router section optional in arch_config
2025-05-23 09:37:25 -07:00
Adil Hafeez
21faae605f
correctly map envoy stats to host (#484)
host port 19901 -> envoy container port 9901
2025-05-23 09:37:15 -07:00
Adil Hafeez
a0d10153f9
update archgw logs file to stream access logs from container (#482) 2025-05-23 09:15:44 -07:00
Adil Hafeez
d050dfb85a
When router usage is defined ensure that router model is defined too (#481) 2025-05-23 08:46:12 -07:00