Salman Paracha
ccd2a590bb
fixing the tests. python dependency order was broken
2025-09-05 11:23:49 -07:00
Salman Paracha
2895a07088
cleaned up logs and fixed issue with connectivity for llm gateway in weather forecast demo
2025-09-05 09:09:17 -07:00
Salman Paracha
04208e9cd1
added python client tests for anthropic and openai
2025-09-04 22:36:32 -07:00
Salman Paracha
00c543667d
fixed integration tests and cleaned up logs
2025-09-04 21:49:54 -07:00
Salman Paracha
ee52c608f7
fixed test cases and added more structured logs
2025-09-04 19:28:47 -07:00
Salman Paracha
e218b1c380
adding the anthropic key to the preference based routing tests
2025-09-04 18:55:53 -07:00
Salman Paracha
412b5b450a
updating the CLI and demos to support anthropic vs. claude
2025-09-04 15:16:21 -07:00
Salman Paracha
ecf453ed70
/v1/messages works with transformations to and from /v1/chat/completions
2025-09-04 15:13:53 -07:00
Salman Paracha
2813a8cfa5
fixing non-streaming responses to tranform correctly
2025-09-02 17:42:02 -07:00
Salman Paracha
d4dfbe600f
making sure that we convert the raw bytes to the correct provider type upstream
2025-09-02 16:19:45 -07:00
Salman Paracha
c55979307e
intentionally removing the headers
2025-08-30 23:00:04 -07:00
Salman Paracha
7c4174a821
fixing json parsing errors
2025-08-30 12:52:59 -07:00
Salman Paracha
041a9eda3a
fixed the debug statement that was causing the integration tests for wasm to fail
2025-08-29 18:33:18 -07:00
Salman Paracha
0a0d2c95a3
the serialized bytes length must be set in the response side
2025-08-29 18:18:32 -07:00
Salman Paracha
e7238fb7fd
updated the stream_context to update response bytes
2025-08-28 22:55:12 -07:00
Salman Paracha
9f6d2464f6
fixed issues with non-streaming responses
2025-08-24 18:52:48 -07:00
Salman Paracha
77491b4a69
adding some debug logs to help
2025-08-24 14:11:21 -07:00
Salman Paracha
0b41496c45
fixed serialization issues with enums on response
2025-08-24 13:12:15 -07:00
Salman Paracha
2fa983d50d
fixed doc tests
2025-08-23 16:48:06 -07:00
Salman Paracha
7345657612
fixed bugs for integration tests
2025-08-23 16:37:52 -07:00
Salman Paracha
9f3a6f71a3
updated tests and added necessary response transformations for Anthropics' message response object
2025-08-22 16:06:48 -07:00
Salman Paracha
e73a9eb61c
transformations are working. Now need to add some tests next
2025-08-22 14:36:46 -07:00
Salman Paracha
0aa9243093
pushing draft PR
2025-08-21 22:24:07 -07:00
Salman Paracha
85f10fc75a
Update README.md
2025-08-21 08:53:39 -07:00
Salman Paracha
f169090c3d
Update README.md
2025-08-21 06:40:56 -07:00
Salman Paracha
bcd973ebf3
Update README.md
2025-08-21 06:33:33 -07:00
Salman Paracha
95d28df725
updating readme and see how it flows ( #556 )
...
* updating readme and see how it flows
* fixed links
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-4.local>
2025-08-21 06:29:47 -07:00
Salman Paracha
89ab51697a
updating the implementation of /v1/chat/completions to use the generi… ( #548 )
...
* updating the implementation of /v1/chat/completions to use the generic provider interfaces
* saving changes, although we will need a small re-factor after this as well
* more refactoring changes, getting close
* more refactoring changes to avoid unecessary re-direction and duplication
* more clean up
* more refactoring
* more refactoring to clean code and make stream_context.rs work
* removing unecessary trait implemenations
* some more clean-up
* fixed bugs
* fixing test cases, and making sure all references to the ChatCOmpletions* objects point to the new types
* refactored changes to support enum dispatch
* removed the dependency on try_streaming_from_bytes into a try_from trait implementation
* updated readme based on new usage
* updated code based on code review comments
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-2.local>
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-4.local>
2025-08-20 12:55:29 -07:00
Adil Hafeez
1fdde8181a
release 0.3.10 ( #555 )
2025-08-13 14:50:10 -07:00
Adil Hafeez
359b6fc4ab
update base image to python3.13 ( #554 )
2025-08-13 14:20:46 -07:00
Adil Hafeez
5908ca6a16
publish to ghrc ( #553 )
2025-08-13 10:35:26 -07:00
Adil Hafeez
ad4cea227f
release 0.3.9 ( #552 )
2025-08-12 13:43:43 -07:00
Adil Hafeez
950c9b443c
fix cve_2025-6020 by removing libpam ( #551 )
...
* fix cve_2025-6020 by removing libpam
* Apply suggestions from code review
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
---------
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2025-08-12 13:20:04 -07:00
Adil Hafeez
2639323dab
release 0.3.8 ( #550 )
2025-08-11 14:12:17 -07:00
Adil Hafeez
202409cc9a
update torch==2.6.0 ( #526 )
2025-08-11 13:23:40 -07:00
Salman Paracha
93ff4d7b1f
pushing new apis module for hermes ( #547 )
2025-08-07 12:42:09 -07:00
Salman Paracha
62a092fa63
consistent messaging ( #546 )
...
* consistent messaging
* updating README
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>
2025-07-28 11:45:07 -07:00
Matt Kafonek
5d5762ac20
swap out python to sys.executable to support uv tool install workflows ( #544 )
2025-07-28 00:48:35 -07:00
Musa
d215724864
Update llm_provider.rst ( #543 )
2025-07-27 09:26:12 -07:00
Adil Hafeez
ac3fb4cb5b
release 0.3.7 ( #542 )
2025-07-25 19:08:24 -07:00
Adil Hafeez
04c7e5a175
bug fix - allow image content to pass through ( #539 )
...
fixes https://github.com/katanemo/archgw/issues/535
2025-07-25 01:22:06 -07:00
Adil Hafeez
92a425facd
release 0.3.6 ( #536 )
2025-07-22 12:48:20 -07:00
Adil Hafeez
d341f4365b
In request path use same format for usage preferences as arch_config ( #533 )
2025-07-21 18:31:19 -07:00
Adil Hafeez
79a62fffe8
release 0.3.5 ( #534 )
2025-07-21 10:09:22 -07:00
Adil Hafeez
83f4d33434
refactor logging in brightstaff ( #532 )
...
refactor logs, move unnecessary info log statements to debug and start logging latest chat completion message to log
2025-07-17 16:00:04 -07:00
Adil Hafeez
f819ee3507
pass model name in header when a route is selected when using usage preferences ( #531 )
2025-07-17 13:41:58 -07:00
Salman Paracha
2340a45353
chatgpt.com updated its backend api path. fixing ( #530 )
2025-07-14 21:20:23 -07:00
Salman Paracha
5e65572573
updating the messaging to call ourselves the edge and AI gateway for … ( #527 )
...
* updating the messaging to call ourselves the edge and AI gateway for agents
* updating README to tidy up some language
* updating README to tidy up some language
* updating README to tidy up some language
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>
2025-07-12 03:25:09 -07:00
Adil Hafeez
14f19f02a7
release 0.3.4 ( #525 )
...
* release 0.3.4
* update lock file
2025-07-11 17:24:21 -07:00
Adil Hafeez
a7fddf30f9
better model names ( #517 )
2025-07-11 16:42:16 -07:00