Adil Hafeez
92a425facd
release 0.3.6 ( #536 )
2025-07-22 12:48:20 -07:00
Adil Hafeez
d341f4365b
In request path use same format for usage preferences as arch_config ( #533 )
2025-07-21 18:31:19 -07:00
Adil Hafeez
79a62fffe8
release 0.3.5 ( #534 )
2025-07-21 10:09:22 -07:00
Adil Hafeez
83f4d33434
refactor logging in brightstaff ( #532 )
...
refactor logs, move unnecessary info log statements to debug and start logging latest chat completion message to log
2025-07-17 16:00:04 -07:00
Adil Hafeez
f819ee3507
pass model name in header when a route is selected when using usage preferences ( #531 )
2025-07-17 13:41:58 -07:00
Salman Paracha
2340a45353
chatgpt.com updated its backend api path. fixing ( #530 )
2025-07-14 21:20:23 -07:00
Salman Paracha
5e65572573
updating the messaging to call ourselves the edge and AI gateway for … ( #527 )
...
* updating the messaging to call ourselves the edge and AI gateway for agents
* updating README to tidy up some language
* updating README to tidy up some language
* updating README to tidy up some language
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>
2025-07-12 03:25:09 -07:00
Adil Hafeez
14f19f02a7
release 0.3.4 ( #525 )
...
* release 0.3.4
* update lock file
2025-07-11 17:24:21 -07:00
Adil Hafeez
a7fddf30f9
better model names ( #517 )
2025-07-11 16:42:16 -07:00
Salman Paracha
4e2355965b
removing the tabs permission and reverting to window.postMessage ( #524 )
...
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>
2025-07-11 12:07:32 -07:00
Salman Paracha
5ea51c8cbd
added the tabs permission again ( #521 )
2025-07-08 17:50:41 -07:00
Salman Paracha
c0748718f1
Salmanap/chrome extension routing ( #511 )
...
* first commit of the chatGPT selector
* stashing changes as checkpoint
* pending changes for chrome extension
* commiting a working version
* converting conversation into messages object
* working version of the extension
* working version with fixed styling and better tested
* fixed the issue that the drop down was too small, and fixed the issue where the route was not displayed on the screen
* updating folder with README.md
* fixes for default model, and to update the manifest.json file
* made changes to the dark mode. improved styles
* fix installation bug
* added dark mode
* fixed default model selection
* fixed the scrolling issue
* Update README.md
* updated content.js to update the labels even when default model
* fixed readme
* upddated the title of the packag
* removing the unnecessary permissions
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>
Co-authored-by: cotran <cotran2@utexas.edu>
Co-authored-by: Shuguang Chen <54548843+nehcgs@users.noreply.github.com>
2025-07-08 16:16:32 -07:00
Adil Hafeez
5fb7ce576c
release 0.3.3 ( #519 )
2025-07-08 00:59:33 -07:00
Adil Hafeez
147908ba7e
make arch-router cluster optional ( #518 )
2025-07-08 00:33:40 -07:00
Salman Paracha
a212dd79da
Update README.md
2025-07-03 06:59:53 -07:00
Salman Paracha
dfc7e04821
Update README.md
2025-07-03 06:59:20 -07:00
Adil Hafeez
00dc95e034
Add support for updating model preferences ( #510 )
2025-07-02 14:08:19 -07:00
Salman Paracha
1963020c21
updating the REAMDE to reflect preference based routing and clean up … ( #512 )
2025-07-01 12:30:30 -07:00
Salman Paracha
a03aef8b4c
local support for Arch-Router via Ollama ( #509 )
...
* local support for Arch-Router via Ollama
* fixed issue withe non-local yaml config
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>
2025-06-20 17:32:14 -07:00
Salman Paracha
1bbd35f1ab
pushing docs updated ( #508 )
...
* pushing docs updated
* Fixed README.md logo
* Fixed README.md logo
* Fixed README.md spacing
* fixed tag line
* LLM router doc fixes
* minor logo and branding changes
* minor changes to the README
* minor changes to the README
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>
2025-06-17 08:16:42 -07:00
Adil Hafeez
7baec20772
release 0.3.2 ( #507 )
2025-06-13 17:02:20 -07:00
Adil Hafeez
0d6e06dfec
update .rest file to show how to call arch-router model
2025-06-13 10:29:22 -07:00
Adil Hafeez
341b14c1be
upgrade envoy to 1.34 ( #506 )
2025-06-11 15:17:43 -07:00
Adil Hafeez
aa9d747fa9
add support for gemini ( #505 )
2025-06-11 15:15:00 -07:00
Adil Hafeez
e734b76086
fix build break
2025-06-10 13:07:35 -07:00
Adil Hafeez
fb34dbdf6c
remove unnecessary rust files
2025-06-10 12:54:31 -07:00
Adil Hafeez
6c53510f49
Introduce hermesllm library to handle llm message translation ( #501 )
2025-06-10 12:53:27 -07:00
Adil Hafeez
96b583c819
make model required in readme and rst files ( #503 )
2025-06-05 20:14:13 -07:00
Adil Hafeez
e43d41ba32
add support for bortli compression ( #502 )
2025-06-05 17:00:14 -07:00
Dougal Ballantyne
93224ed551
Update Dockerfile to fix warnings ( #500 )
2025-05-31 21:27:29 -07:00
Adil Hafeez
2e47d41a8c
Add ARCH_API_KEY in preference based routing demo ( #498 )
2025-05-31 01:52:25 -07:00
Adil Hafeez
aff389d342
don't run docker compose up for preference based router e2e demo tests ( #499 )
2025-05-31 01:16:17 -07:00
Adil Hafeez
0f139baf13
use consistent version across all arch_config files ( #497 )
2025-05-31 01:11:14 -07:00
Adil Hafeez
c7a3a668a9
update readme for preference based routing ( #496 )
2025-05-30 18:09:10 -07:00
Adil Hafeez
ed28bbaf04
release 0.3.1 ( #495 )
2025-05-30 17:47:59 -07:00
Adil Hafeez
0d190a6e5c
update code to use new json based system prompt for routing ( #493 )
2025-05-30 17:40:46 -07:00
Adil Hafeez
8d12a9a6e0
add arch provider ( #494 )
2025-05-30 17:12:52 -07:00
CTran
6a01eea813
LLM Router api doc ( #492 )
...
* Create router.rst
* add doc
* update api
* update api
* Update docs/source/guides/llm_router.rst
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
* Update docs/source/guides/llm_router.rst
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
* fix whitespace
* Update llm_router.rst
* remove faeture and align examples
* remove faeture and align examples
---------
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Shuguang Chen <54548843+nehcgs@users.noreply.github.com>
2025-05-30 16:15:26 -07:00
Adil Hafeez
176f039bbc
fix model warning and use openwebui for preference based router demo
2025-05-30 12:29:56 -07:00
Adil Hafeez
fffa837a06
separate out currency exchange and preference based routing ( #491 )
2025-05-30 02:14:37 -07:00
Adil Hafeez
470cdf9843
use provider_name as model_id /v1/models api ( #490 )
2025-05-29 11:23:18 -07:00
Adil Hafeez
9c4733590f
add support for openwebui ( #487 )
2025-05-28 19:08:00 -07:00
Adil Hafeez
4899117876
add compress/decompress filter to llm listener ( #489 )
2025-05-28 15:06:52 -07:00
Adil Hafeez
d29eba4102
trim conversation if it exceed max limit of what router model can handle ( #488 )
2025-05-27 20:28:22 -07:00
Adil Hafeez
79cbcb5fe1
add claude-4 in llm_routing demo ( #486 )
2025-05-23 10:21:21 -07:00
Adil Hafeez
dc271f1f76
release 0.3.0 ( #483 )
2025-05-23 09:52:23 -07:00
Adil Hafeez
99dd900a34
fix panic in brightstaff ( #485 )
...
make router section optional in arch_config
2025-05-23 09:37:25 -07:00
Adil Hafeez
21faae605f
correctly map envoy stats to host ( #484 )
...
host port 19901 -> envoy container port 9901
2025-05-23 09:37:15 -07:00
Adil Hafeez
a0d10153f9
update archgw logs file to stream access logs from container ( #482 )
2025-05-23 09:15:44 -07:00
Adil Hafeez
d050dfb85a
When router usage is defined ensure that router model is defined too ( #481 )
2025-05-23 08:46:12 -07:00