Commit graph

497 commits

Author SHA1 Message Date
Adil Hafeez
0c76f2e6e7
more changs 2025-09-18 11:08:51 -07:00
Adil Hafeez
a73c8d1dc1
more changes 2025-09-17 18:11:37 -07:00
Adil Hafeez
a3f93de85d
add retry 2025-09-17 17:53:39 -07:00
Adil Hafeez
71658ddbd9
Update docker-push-main.yml 2025-09-17 14:32:31 -07:00
Adil Hafeez
06069bdee4
publish image 2025-09-17 13:03:37 -07:00
Adil Hafeez
9758354e33
cargo clippy and fmt 2025-09-17 11:15:42 -07:00
Adil Hafeez
0c6600ac47
cargo fmt brightstaff 2025-09-17 11:08:08 -07:00
Adil Hafeez
2229f0d4d4
add tests and some refactor 2025-09-17 11:03:51 -07:00
Adil Hafeez
08471d8adf
add streaming 2025-09-17 09:39:10 -07:00
Adil Hafeez
4588787427
disable rust integration tests 2025-09-17 01:11:36 -07:00
Adil Hafeez
124b910746
fix schema validation 2025-09-17 01:10:06 -07:00
Adil Hafeez
c173074864
more changes 2025-09-17 00:57:26 -07:00
Adil Hafeez
6cfce60501
add more changes 2025-09-16 15:25:47 -07:00
Adil Hafeez
c1757bec88
Merge branch 'main' into adil/agent_format 2025-09-16 14:54:43 -07:00
Adil Hafeez
3eb6af8829
add default implementation for common openai types (#568) 2025-09-16 12:48:07 -07:00
Adil Hafeez
709a25686f
pending changes 2025-09-16 12:02:19 -07:00
Adil Hafeez
118f60eea7
release 0.3.12 (#567) 2025-09-16 11:56:05 -07:00
Adil Hafeez
71e4435500
more changes 2025-09-16 11:23:34 -07:00
Salman Paracha
4eb2b410c5
adding support for model aliases in archgw (#566)
* adding support for model aliases in archgw

* fixed PR based on feedback

* removing README. Not relevant for PR

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-136.local>
2025-09-16 11:12:08 -07:00
Adil Hafeez
e57000000d
more changes 2025-09-15 16:01:48 -07:00
Adil Hafeez
a016212588
more changes 2025-09-15 13:18:52 -07:00
Salman Paracha
f5f33f6de2 fixed csv parsing 2025-09-12 11:52:36 -07:00
Adil Hafeez
4e9df76452
Merge branch 'main' into adil/agent_format 2025-09-12 10:26:20 -07:00
Adil Hafeez
1e8c81d8f6
release 0.3.11 (#565) 2025-09-11 18:44:18 -07:00
Adil Hafeez
093834bb05
fix more 2025-09-11 15:55:25 -07:00
Salman Paracha
32838584cf changes to the agents 2025-09-11 13:25:09 -07:00
Adil Hafeez
c1e142f55f
push demo changes 2025-09-11 11:35:51 -07:00
Adil Hafeez
34fd8d5f82
add fixed rendered file 2025-09-11 00:00:33 -07:00
Adil Hafeez
9da4628cd0
fix tests 2025-09-10 23:42:23 -07:00
Adil Hafeez
9120e5b7ba
fix tests 2025-09-10 17:10:15 -07:00
Adil Hafeez
33ef00f997
remove file 2025-09-10 16:39:41 -07:00
Adil Hafeez
d327aaa92c
more updates 2025-09-10 16:38:07 -07:00
Adil Hafeez
965de92642
rename 2025-09-10 13:59:05 -07:00
Adil Hafeez
6db2208678
only add file format 2025-09-10 13:51:32 -07:00
Adil Hafeez
36cfa55233
add file format 2025-09-10 13:48:51 -07:00
Adil Hafeez
5df01d73f0
Merge branch 'main' into adil/tmobile_work 2025-09-10 10:47:33 -07:00
Salman Paracha
fb0581fd39
add support for v1/messages and transformations (#558)
* pushing draft PR

* transformations are working. Now need to add some tests next

* updated tests and added necessary response transformations for Anthropics' message response object

* fixed bugs for integration tests

* fixed doc tests

* fixed serialization issues with enums on response

* adding some debug logs to help

* fixed issues with non-streaming responses

* updated the stream_context to update response bytes

* the serialized bytes length must be set in the response side

* fixed the debug statement that was causing the integration tests for wasm to fail

* fixing json parsing errors

* intentionally removing the headers

* making sure that we convert the raw bytes to the correct provider type upstream

* fixing non-streaming responses to tranform correctly

* /v1/messages works with transformations to and from /v1/chat/completions

* updating the CLI and demos to support anthropic vs. claude

* adding the anthropic key to the preference based routing tests

* fixed test cases and added more structured logs

* fixed integration tests and cleaned up logs

* added python client tests for anthropic and openai

* cleaned up logs and fixed issue with connectivity for llm gateway in weather forecast demo

* fixing the tests. python dependency order was broken

* updated the openAI client to fix demos

* removed the raw response debug statement

* fixed the dup cloning issue and cleaned up the ProviderRequestType enum and traits

* fixing logs

* moved away from string literals to consts

* fixed streaming from Anthropic Client to OpenAI

* removed debug statement that would likely trip up integration tests

* fixed integration tests for llm_gateway

* cleaned up test cases and removed unnecessary crates

* fixing comments from PR

* fixed bug whereby we were sending an OpenAIChatCompletions request object to llm_gateway even though the request may have been AnthropicMessages

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-4.local>
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-9.local>
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-10.local>
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-41.local>
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-136.local>
2025-09-10 07:40:30 -07:00
Adil Hafeez
c3a0874829
add more 2025-09-07 23:26:45 -07:00
Adil Hafeez
22c84fb689
pending tmo work 2025-09-07 23:26:00 -07:00
Salman Paracha
bb71d041a0
Fix formatting in README.md
Fixed formatting issues in the README.md file.
2025-08-29 09:58:24 -07:00
Salman Paracha
c698f2cba2
Improve clarity of routing and orchestration section
Reworded the section on routing and orchestration for clarity and conciseness.
2025-08-29 09:50:03 -07:00
Salman Paracha
dd4e6a7497
Improve clarity of routing and orchestration section 2025-08-29 09:44:52 -07:00
Salman Paracha
8d1046fb3d
Enhance README with detailed routing and orchestration issues 2025-08-29 09:43:32 -07:00
Salman Paracha
85f10fc75a
Update README.md 2025-08-21 08:53:39 -07:00
Salman Paracha
f169090c3d
Update README.md 2025-08-21 06:40:56 -07:00
Salman Paracha
bcd973ebf3
Update README.md 2025-08-21 06:33:33 -07:00
Salman Paracha
95d28df725
updating readme and see how it flows (#556)
* updating readme and see how it flows

* fixed links
---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-4.local>
2025-08-21 06:29:47 -07:00
Salman Paracha
89ab51697a
updating the implementation of /v1/chat/completions to use the generi… (#548)
* updating the implementation of /v1/chat/completions to use the generic provider interfaces

* saving changes, although we will need a small re-factor after this as well

* more refactoring changes, getting close

* more refactoring changes to avoid unecessary re-direction and duplication

* more clean up

* more refactoring

* more refactoring to clean code and make stream_context.rs work

* removing unecessary trait implemenations

* some more clean-up

* fixed bugs

* fixing test cases, and making sure all references to the ChatCOmpletions* objects point to the new types

* refactored changes to support enum dispatch

* removed the dependency on try_streaming_from_bytes into a try_from trait implementation

* updated readme based on new usage

* updated code based on code review comments

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-2.local>
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-4.local>
2025-08-20 12:55:29 -07:00
Adil Hafeez
1fdde8181a
release 0.3.10 (#555) 2025-08-13 14:50:10 -07:00
Adil Hafeez
359b6fc4ab
update base image to python3.13 (#554) 2025-08-13 14:20:46 -07:00