Commit graph

483 commits

Author SHA1 Message Date
Salman Paracha
879c8eeff3 fixed streaming from Anthropic Client to OpenAI 2025-09-06 23:16:29 -07:00
Salman Paracha
06c71c1392 moved away from string literals to consts 2025-09-06 10:47:05 -07:00
Salman Paracha
2de75d18db fixing logs 2025-09-05 21:52:46 -07:00
Salman Paracha
e8881c7b8a fixed the dup cloning issue and cleaned up the ProviderRequestType enum and traits 2025-09-05 21:18:55 -07:00
Salman Paracha
c8b59aeda7 removed the raw response debug statement 2025-09-05 12:52:01 -07:00
Salman Paracha
79e391067d updated the openAI client to fix demos 2025-09-05 12:35:22 -07:00
Salman Paracha
ccd2a590bb fixing the tests. python dependency order was broken 2025-09-05 11:23:49 -07:00
Salman Paracha
2895a07088 cleaned up logs and fixed issue with connectivity for llm gateway in weather forecast demo 2025-09-05 09:09:17 -07:00
Salman Paracha
04208e9cd1 added python client tests for anthropic and openai 2025-09-04 22:36:32 -07:00
Salman Paracha
00c543667d fixed integration tests and cleaned up logs 2025-09-04 21:49:54 -07:00
Salman Paracha
ee52c608f7 fixed test cases and added more structured logs 2025-09-04 19:28:47 -07:00
Salman Paracha
e218b1c380 adding the anthropic key to the preference based routing tests 2025-09-04 18:55:53 -07:00
Salman Paracha
412b5b450a updating the CLI and demos to support anthropic vs. claude 2025-09-04 15:16:21 -07:00
Salman Paracha
ecf453ed70 /v1/messages works with transformations to and from /v1/chat/completions 2025-09-04 15:13:53 -07:00
Salman Paracha
2813a8cfa5 fixing non-streaming responses to tranform correctly 2025-09-02 17:42:02 -07:00
Salman Paracha
d4dfbe600f making sure that we convert the raw bytes to the correct provider type upstream 2025-09-02 16:19:45 -07:00
Salman Paracha
c55979307e intentionally removing the headers 2025-08-30 23:00:04 -07:00
Salman Paracha
7c4174a821 fixing json parsing errors 2025-08-30 12:52:59 -07:00
Salman Paracha
041a9eda3a fixed the debug statement that was causing the integration tests for wasm to fail 2025-08-29 18:33:18 -07:00
Salman Paracha
0a0d2c95a3 the serialized bytes length must be set in the response side 2025-08-29 18:18:32 -07:00
Salman Paracha
e7238fb7fd updated the stream_context to update response bytes 2025-08-28 22:55:12 -07:00
Salman Paracha
9f6d2464f6 fixed issues with non-streaming responses 2025-08-24 18:52:48 -07:00
Salman Paracha
77491b4a69 adding some debug logs to help 2025-08-24 14:11:21 -07:00
Salman Paracha
0b41496c45 fixed serialization issues with enums on response 2025-08-24 13:12:15 -07:00
Salman Paracha
2fa983d50d fixed doc tests 2025-08-23 16:48:06 -07:00
Salman Paracha
7345657612 fixed bugs for integration tests 2025-08-23 16:37:52 -07:00
Salman Paracha
9f3a6f71a3 updated tests and added necessary response transformations for Anthropics' message response object 2025-08-22 16:06:48 -07:00
Salman Paracha
e73a9eb61c transformations are working. Now need to add some tests next 2025-08-22 14:36:46 -07:00
Salman Paracha
0aa9243093 pushing draft PR 2025-08-21 22:24:07 -07:00
Salman Paracha
85f10fc75a
Update README.md 2025-08-21 08:53:39 -07:00
Salman Paracha
f169090c3d
Update README.md 2025-08-21 06:40:56 -07:00
Salman Paracha
bcd973ebf3
Update README.md 2025-08-21 06:33:33 -07:00
Salman Paracha
95d28df725
updating readme and see how it flows (#556)
* updating readme and see how it flows

* fixed links
---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-4.local>
2025-08-21 06:29:47 -07:00
Salman Paracha
89ab51697a
updating the implementation of /v1/chat/completions to use the generi… (#548)
* updating the implementation of /v1/chat/completions to use the generic provider interfaces

* saving changes, although we will need a small re-factor after this as well

* more refactoring changes, getting close

* more refactoring changes to avoid unecessary re-direction and duplication

* more clean up

* more refactoring

* more refactoring to clean code and make stream_context.rs work

* removing unecessary trait implemenations

* some more clean-up

* fixed bugs

* fixing test cases, and making sure all references to the ChatCOmpletions* objects point to the new types

* refactored changes to support enum dispatch

* removed the dependency on try_streaming_from_bytes into a try_from trait implementation

* updated readme based on new usage

* updated code based on code review comments

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-2.local>
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-4.local>
2025-08-20 12:55:29 -07:00
Adil Hafeez
1fdde8181a
release 0.3.10 (#555) 2025-08-13 14:50:10 -07:00
Adil Hafeez
359b6fc4ab
update base image to python3.13 (#554) 2025-08-13 14:20:46 -07:00
Adil Hafeez
5908ca6a16
publish to ghrc (#553) 2025-08-13 10:35:26 -07:00
Adil Hafeez
ad4cea227f
release 0.3.9 (#552) 2025-08-12 13:43:43 -07:00
Adil Hafeez
950c9b443c
fix cve_2025-6020 by removing libpam (#551)
* fix cve_2025-6020 by removing libpam

* Apply suggestions from code review

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

---------

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2025-08-12 13:20:04 -07:00
Adil Hafeez
2639323dab
release 0.3.8 (#550) 2025-08-11 14:12:17 -07:00
Adil Hafeez
202409cc9a
update torch==2.6.0 (#526) 2025-08-11 13:23:40 -07:00
Salman Paracha
93ff4d7b1f
pushing new apis module for hermes (#547) 2025-08-07 12:42:09 -07:00
Salman Paracha
62a092fa63
consistent messaging (#546)
* consistent messaging

* updating README

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>
2025-07-28 11:45:07 -07:00
Matt Kafonek
5d5762ac20
swap out python to sys.executable to support uv tool install workflows (#544) 2025-07-28 00:48:35 -07:00
Musa
d215724864
Update llm_provider.rst (#543) 2025-07-27 09:26:12 -07:00
Adil Hafeez
ac3fb4cb5b
release 0.3.7 (#542) 2025-07-25 19:08:24 -07:00
Adil Hafeez
04c7e5a175
bug fix - allow image content to pass through (#539)
fixes https://github.com/katanemo/archgw/issues/535
2025-07-25 01:22:06 -07:00
Adil Hafeez
92a425facd
release 0.3.6 (#536) 2025-07-22 12:48:20 -07:00
Adil Hafeez
d341f4365b
In request path use same format for usage preferences as arch_config (#533) 2025-07-21 18:31:19 -07:00
Adil Hafeez
79a62fffe8
release 0.3.5 (#534) 2025-07-21 10:09:22 -07:00