Commit graph

103 commits

Author SHA1 Message Date
Salman Paracha
d5a273f740
enable state management for v1/responses (#631)
* first commit with tests to enable state mamangement via memory

* fixed logs to follow the conversational flow a bit better

* added support for supabase

* added the state_storage_v1_responses flag, and use that to store state appropriately

* cleaned up logs and fixed issue with connectivity for llm gateway in weather forecast demo

* fixed mixed inputs from openai v1/responses api (#632)

* fixed mixed inputs from openai v1/responses api

* removing tracing from model-alias-rouing

* handling additional input types from openairs

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>

* resolving PR comments

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>
2025-12-17 12:18:38 -08:00
Adil Hafeez
8adb9795d8
release 0.3.22 (#629) 2025-12-11 11:20:19 -08:00
Adil Hafeez
09c0b999b2
release 0.3.21 (#626) 2025-12-03 17:12:34 -08:00
Adil Hafeez
b01a81927d
release 0.3.20 (#620) 2025-11-22 19:29:04 -08:00
Salman Paracha
d37af7605c
removing model_server. buh bye (#619) 2025-11-22 15:04:41 -08:00
Salman Paracha
88c2bd1851
removing model_server python module to brightstaff (function calling) (#615)
* adding function_calling functionality via rust

* fixed rendered YAML file

* removed model_server from envoy.template and forwarding traffic to bright_staff

* fixed bugs in function_calling.rs that were breaking tests. All good now

* updating e2e test to clean up disk usage

* removing Arch* models to be used as a default model if one is not specified

* if the user sets arch-function base_url we should honor it

* fixing demos as we needed to pin to a particular version of huggingface_hub else the chatbot ui wouldn't build

* adding a constant for Arch-Function model name

* fixing some edge cases with calls made to Arch-Function

* fixed JSON parsing issues in function_calling.rs

* fixed bug where the raw response from Arch-Function was re-encoded

* removed debug from supervisord.conf

* commenting out disk cleanup

* adding back disk space

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>
2025-11-22 12:55:00 -08:00
Adil Hafeez
126b029345
release 0.3.18 (#611) 2025-10-31 12:24:49 -07:00
Salman Paracha
cdfcfb9169
support base_url path for model providers (#608)
* adding support for base_url

* updated docs

* fixed tests for config generator

* making fixes based on PR comments

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>
2025-10-29 17:08:07 -07:00
Adil Hafeez
f26bb05d35
release 0.3.17 (#604) 2025-10-24 17:52:15 -07:00
Adil Hafeez
6d70545459
release 0.3.16 (#596) 2025-10-22 14:43:33 -07:00
Salman Paracha
7a6f87de3e
fixed test and docs for deployment (#595)
* fixed test and docs for deployment

* updating the main logo image

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>
2025-10-22 14:13:16 -07:00
Salman Paracha
9407ae6af7
Add support for Amazon Bedrock Converse and ConverseStream (#588)
* first commit to get Bedrock Converse API working. Next commit support for streaming and binary frames

* adding translation from BedrockBinaryFrameDecoder to AnthropicMessagesEvent

* Claude Code works with Amazon Bedrock

* added tests for openai streaming from bedrock

* PR comments fixed

* adding support for bedrock in docs as supported provider

* cargo fmt

* revertted to chatgpt models for claude code routing

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>
Co-authored-by: Adil Hafeez <adil.hafeez@gmail.com>
2025-10-22 11:31:21 -07:00
Adil Hafeez
96e0732089
add support for agents (#564) 2025-10-14 14:01:11 -07:00
Salman Paracha
6a06d9ac97
add claude code router to the README (#586)
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>
2025-10-05 13:38:39 -07:00
Salman Paracha
03d8cc1894
fixing docs (#584)
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>
2025-10-01 22:26:54 -07:00
Salman Paracha
226139e907
adding support for Qwen models and fixed issue with passing PATH vari… (#583)
* adding support for Qwen models and fixed issue with passing PATH variable

* don't need to have qwen in the model alias routing example

* fixed base_url for qwen

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>
2025-10-01 21:57:58 -07:00
Adil Hafeez
cd563c2706
release 0.3.15 (#579) 2025-09-30 13:44:11 -07:00
Salman Paracha
045a5e9751
adding support for moonshot and z-ai (#578)
* adding support for moonshot and z-ai

* Revert unwanted changes to arch_config.yaml

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>
2025-09-30 12:24:06 -07:00
Adil Hafeez
7df1b8cdb0
release 0.3.14 (#577) 2025-09-29 23:11:43 -07:00
Adil Hafeez
7ce8d44d8e
release 0.3.13 (#572) 2025-09-19 11:26:49 -07:00
Salman Paracha
fbe82351c0
Salmanap/fix docs new providers model alias (#571)
* fixed docs and added ollama as a first-class LLM provider

* matching the LLM routing section on the README.md to the docs

* updated the section on preference-based routing

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-167.local>
2025-09-19 10:19:57 -07:00
Salman Paracha
8d0b468345
draft commit to add support for xAI, TogehterAI, AzureOpenAI (#570)
* draft commit to add support for xAI, LambdaAI, TogehterAI, AzureOpenAI

* fixing failing tests and updating rederend config file

* Update arch_config_with_aliases.yaml

* adding the AZURE_API_KEY to the GH workflow for e2e

* fixing GH secerts

* adding valdiating for azure_openai

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-167.local>
2025-09-18 18:36:30 -07:00
Adil Hafeez
118f60eea7
release 0.3.12 (#567) 2025-09-16 11:56:05 -07:00
Adil Hafeez
1e8c81d8f6
release 0.3.11 (#565) 2025-09-11 18:44:18 -07:00
Adil Hafeez
1fdde8181a
release 0.3.10 (#555) 2025-08-13 14:50:10 -07:00
Adil Hafeez
ad4cea227f
release 0.3.9 (#552) 2025-08-12 13:43:43 -07:00
Adil Hafeez
2639323dab
release 0.3.8 (#550) 2025-08-11 14:12:17 -07:00
Salman Paracha
62a092fa63
consistent messaging (#546)
* consistent messaging

* updating README

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>
2025-07-28 11:45:07 -07:00
Musa
d215724864
Update llm_provider.rst (#543) 2025-07-27 09:26:12 -07:00
Adil Hafeez
ac3fb4cb5b
release 0.3.7 (#542) 2025-07-25 19:08:24 -07:00
Adil Hafeez
92a425facd
release 0.3.6 (#536) 2025-07-22 12:48:20 -07:00
Adil Hafeez
79a62fffe8
release 0.3.5 (#534) 2025-07-21 10:09:22 -07:00
Salman Paracha
5e65572573
updating the messaging to call ourselves the edge and AI gateway for … (#527)
* updating the messaging to call ourselves the edge and AI gateway for agents

* updating README to tidy up some language

* updating README to tidy up some language

* updating README to tidy up some language

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>
2025-07-12 03:25:09 -07:00
Adil Hafeez
14f19f02a7
release 0.3.4 (#525)
* release 0.3.4

* update lock file
2025-07-11 17:24:21 -07:00
Adil Hafeez
a7fddf30f9
better model names (#517) 2025-07-11 16:42:16 -07:00
Adil Hafeez
5fb7ce576c
release 0.3.3 (#519) 2025-07-08 00:59:33 -07:00
Salman Paracha
1963020c21
updating the REAMDE to reflect preference based routing and clean up … (#512) 2025-07-01 12:30:30 -07:00
Salman Paracha
1bbd35f1ab
pushing docs updated (#508)
* pushing docs updated

* Fixed README.md logo

* Fixed README.md logo

* Fixed README.md spacing

* fixed tag line

* LLM router doc fixes

* minor logo and branding changes

* minor changes to the README

* minor changes to the README

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>
2025-06-17 08:16:42 -07:00
Adil Hafeez
7baec20772
release 0.3.2 (#507) 2025-06-13 17:02:20 -07:00
Adil Hafeez
96b583c819
make model required in readme and rst files (#503) 2025-06-05 20:14:13 -07:00
Adil Hafeez
0f139baf13
use consistent version across all arch_config files (#497) 2025-05-31 01:11:14 -07:00
Adil Hafeez
ed28bbaf04
release 0.3.1 (#495) 2025-05-30 17:47:59 -07:00
CTran
6a01eea813
LLM Router api doc (#492)
* Create router.rst

* add doc

* update api

* update api

* Update docs/source/guides/llm_router.rst

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* Update docs/source/guides/llm_router.rst

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* fix whitespace

* Update llm_router.rst

* remove faeture and align examples

* remove faeture and align examples

---------

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Shuguang Chen <54548843+nehcgs@users.noreply.github.com>
2025-05-30 16:15:26 -07:00
Adil Hafeez
dc271f1f76
release 0.3.0 (#483) 2025-05-23 09:52:23 -07:00
Adil Hafeez
9c803f4d69
release 0.2.8 (#472) 2025-04-21 17:02:36 -07:00
Adil Hafeez
00fb1be8a0
release 0.2.7 (#469) 2025-04-16 13:55:24 -07:00
Adil Hafeez
c7c0553427
release 0.2.6 (#463) 2025-04-15 14:50:09 -07:00
Mat Sylvia
e7b0de2a72
Tweak readme docs for minor nits (#461)
Co-authored-by: darkdatter <msylvia@tradestax.io>
2025-04-12 23:52:20 -07:00
Adil Hafeez
4d2d8bd7a1
release 0.2.5 (#457) 2025-04-06 01:24:01 -07:00
Joseph D Alchemist
8ba1f71430
remove typo (#456) 2025-04-03 11:34:57 -07:00