Commit graph

176 commits

Author SHA1 Message Date
Adil Hafeez
92a8782332
add support for model_providers 2025-09-30 12:18:29 -07:00
Adil Hafeez
2cebc0c85f
Merge branch 'main' into adil/agent_format 2025-09-30 11:39:34 -07:00
Adil Hafeez
7df1b8cdb0
release 0.3.14 (#577) 2025-09-29 23:11:43 -07:00
Salman Paracha
f00870dccb
adding support for claude code routing (#575)
* fixed for claude code routing. first commit

* removing redundant enum tags for cache_control

* making sure that claude code can run via the archgw cli

* fixing broken config

* adding a README.md and updated the cli to use more of our defined patterns for params

* fixed config.yaml

* minor fixes to make sure PR is clean. Ready to ship

* adding claude-sonnet-4-5 to the config

* fixes based on PR

* fixed alias for README

* fixed 400 error handling tests, now that we write temperature to 1.0 for GPT-5

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-257.local>
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>
2025-09-29 19:23:08 -07:00
Adil Hafeez
090f613583 update dockerfile to tail logs on console 2025-09-25 14:14:55 -07:00
Adil Hafeez
4d5d7e13d0 update port 2025-09-25 13:17:01 -07:00
Adil Hafeez
6dd99a6c05
some more changes 2025-09-23 10:19:05 -07:00
Adil Hafeez
be33311002
add ollama fix 2025-09-22 15:25:52 -07:00
Adil Hafeez
ed0772fdd0
fix tests 2025-09-22 15:00:30 -07:00
Adil Hafeez
b69a3d5bad
add more changes 2025-09-22 14:48:43 -07:00
Adil Hafeez
4f31edfaf5
more changes 2025-09-22 14:46:33 -07:00
Adil Hafeez
ac68e802d8
Merge branch 'main' into adil/agent_format 2025-09-22 14:39:06 -07:00
Adil Hafeez
7ce8d44d8e
release 0.3.13 (#572) 2025-09-19 11:26:49 -07:00
Salman Paracha
fbe82351c0
Salmanap/fix docs new providers model alias (#571)
* fixed docs and added ollama as a first-class LLM provider

* matching the LLM routing section on the README.md to the docs

* updated the section on preference-based routing

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-167.local>
2025-09-19 10:19:57 -07:00
Salman Paracha
8d0b468345
draft commit to add support for xAI, TogehterAI, AzureOpenAI (#570)
* draft commit to add support for xAI, LambdaAI, TogehterAI, AzureOpenAI

* fixing failing tests and updating rederend config file

* Update arch_config_with_aliases.yaml

* adding the AZURE_API_KEY to the GH workflow for e2e

* fixing GH secerts

* adding valdiating for azure_openai

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-167.local>
2025-09-18 18:36:30 -07:00
Adil Hafeez
124b910746
fix schema validation 2025-09-17 01:10:06 -07:00
Adil Hafeez
c173074864
more changes 2025-09-17 00:57:26 -07:00
Adil Hafeez
6cfce60501
add more changes 2025-09-16 15:25:47 -07:00
Adil Hafeez
c1757bec88
Merge branch 'main' into adil/agent_format 2025-09-16 14:54:43 -07:00
Adil Hafeez
118f60eea7
release 0.3.12 (#567) 2025-09-16 11:56:05 -07:00
Salman Paracha
4eb2b410c5
adding support for model aliases in archgw (#566)
* adding support for model aliases in archgw

* fixed PR based on feedback

* removing README. Not relevant for PR

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-136.local>
2025-09-16 11:12:08 -07:00
Adil Hafeez
e57000000d
more changes 2025-09-15 16:01:48 -07:00
Adil Hafeez
a016212588
more changes 2025-09-15 13:18:52 -07:00
Adil Hafeez
4e9df76452
Merge branch 'main' into adil/agent_format 2025-09-12 10:26:20 -07:00
Adil Hafeez
1e8c81d8f6
release 0.3.11 (#565) 2025-09-11 18:44:18 -07:00
Adil Hafeez
093834bb05
fix more 2025-09-11 15:55:25 -07:00
Adil Hafeez
9da4628cd0
fix tests 2025-09-10 23:42:23 -07:00
Adil Hafeez
9120e5b7ba
fix tests 2025-09-10 17:10:15 -07:00
Adil Hafeez
d327aaa92c
more updates 2025-09-10 16:38:07 -07:00
Adil Hafeez
36cfa55233
add file format 2025-09-10 13:48:51 -07:00
Adil Hafeez
5df01d73f0
Merge branch 'main' into adil/tmobile_work 2025-09-10 10:47:33 -07:00
Salman Paracha
fb0581fd39
add support for v1/messages and transformations (#558)
* pushing draft PR

* transformations are working. Now need to add some tests next

* updated tests and added necessary response transformations for Anthropics' message response object

* fixed bugs for integration tests

* fixed doc tests

* fixed serialization issues with enums on response

* adding some debug logs to help

* fixed issues with non-streaming responses

* updated the stream_context to update response bytes

* the serialized bytes length must be set in the response side

* fixed the debug statement that was causing the integration tests for wasm to fail

* fixing json parsing errors

* intentionally removing the headers

* making sure that we convert the raw bytes to the correct provider type upstream

* fixing non-streaming responses to tranform correctly

* /v1/messages works with transformations to and from /v1/chat/completions

* updating the CLI and demos to support anthropic vs. claude

* adding the anthropic key to the preference based routing tests

* fixed test cases and added more structured logs

* fixed integration tests and cleaned up logs

* added python client tests for anthropic and openai

* cleaned up logs and fixed issue with connectivity for llm gateway in weather forecast demo

* fixing the tests. python dependency order was broken

* updated the openAI client to fix demos

* removed the raw response debug statement

* fixed the dup cloning issue and cleaned up the ProviderRequestType enum and traits

* fixing logs

* moved away from string literals to consts

* fixed streaming from Anthropic Client to OpenAI

* removed debug statement that would likely trip up integration tests

* fixed integration tests for llm_gateway

* cleaned up test cases and removed unnecessary crates

* fixing comments from PR

* fixed bug whereby we were sending an OpenAIChatCompletions request object to llm_gateway even though the request may have been AnthropicMessages

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-4.local>
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-9.local>
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-10.local>
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-41.local>
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-136.local>
2025-09-10 07:40:30 -07:00
Adil Hafeez
c3a0874829
add more 2025-09-07 23:26:45 -07:00
Adil Hafeez
22c84fb689
pending tmo work 2025-09-07 23:26:00 -07:00
Adil Hafeez
1fdde8181a
release 0.3.10 (#555) 2025-08-13 14:50:10 -07:00
Adil Hafeez
359b6fc4ab
update base image to python3.13 (#554) 2025-08-13 14:20:46 -07:00
Adil Hafeez
ad4cea227f
release 0.3.9 (#552) 2025-08-12 13:43:43 -07:00
Adil Hafeez
950c9b443c
fix cve_2025-6020 by removing libpam (#551)
* fix cve_2025-6020 by removing libpam

* Apply suggestions from code review

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

---------

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2025-08-12 13:20:04 -07:00
Adil Hafeez
2639323dab
release 0.3.8 (#550) 2025-08-11 14:12:17 -07:00
Adil Hafeez
ac3fb4cb5b
release 0.3.7 (#542) 2025-07-25 19:08:24 -07:00
Adil Hafeez
92a425facd
release 0.3.6 (#536) 2025-07-22 12:48:20 -07:00
Adil Hafeez
79a62fffe8
release 0.3.5 (#534) 2025-07-21 10:09:22 -07:00
Adil Hafeez
14f19f02a7
release 0.3.4 (#525)
* release 0.3.4

* update lock file
2025-07-11 17:24:21 -07:00
Adil Hafeez
a7fddf30f9
better model names (#517) 2025-07-11 16:42:16 -07:00
Adil Hafeez
5fb7ce576c
release 0.3.3 (#519) 2025-07-08 00:59:33 -07:00
Adil Hafeez
147908ba7e
make arch-router cluster optional (#518) 2025-07-08 00:33:40 -07:00
Adil Hafeez
7baec20772
release 0.3.2 (#507) 2025-06-13 17:02:20 -07:00
Adil Hafeez
341b14c1be
upgrade envoy to 1.34 (#506) 2025-06-11 15:17:43 -07:00
Adil Hafeez
aa9d747fa9
add support for gemini (#505) 2025-06-11 15:15:00 -07:00
Adil Hafeez
6c53510f49
Introduce hermesllm library to handle llm message translation (#501) 2025-06-10 12:53:27 -07:00