Commit graph

90 commits

Author SHA1 Message Date
Adil Hafeez
77cdc7f6ef
Revert "release 0.4.1 (#666)" (#669)
This reverts commit 77df5160d8.
2025-12-30 15:28:30 -08:00
Adil Hafeez
77df5160d8
release 0.4.1 (#666) 2025-12-28 14:29:19 -08:00
Adil Hafeez
6eb9d07e7e
use uv to publish planoai to pypi (#665) 2025-12-28 14:14:27 -08:00
Adil Hafeez
4c4bc99e31
set cli package to planoai instead of cli (#664) 2025-12-27 23:49:41 -08:00
Adil Hafeez
053e2b3a74
use uv instead of poetry (#663) 2025-12-26 11:21:42 -08:00
Adil Hafeez
47b2658b28
update rust to 1.92 (#661) 2025-12-25 20:46:49 -08:00
Adil Hafeez
5557372789
publish planoai package from gh action (#657)
* publish planoai package from gh action

* fix comments
2025-12-25 18:18:09 -08:00
Adil Hafeez
88d14a205b
restructure cli (#656) 2025-12-25 14:55:29 -08:00
Adil Hafeez
e7ce00b5a7
rename cli to plano (#647) 2025-12-23 18:37:58 -08:00
Salman Paracha
a79f55f313
Improve end to end tracing (#628)
* adding canonical tracing support via bright-staff

* improved formatting for tools in the traces

* removing anthropic from the currency exchange demo

* using Envoy to transport traces, not calling OTEL directly

* moving otel collcetor cluster outside tracing if/else

* minor fixes to not write to the OTEL collector if tracing is disabled

* fixed PR comments and added more trace attributes

* more fixes based on PR comments

* more clean up based on PR comments

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>
2025-12-11 15:21:57 -08:00
Adil Hafeez
8adb9795d8
release 0.3.22 (#629) 2025-12-11 11:20:19 -08:00
Adil Hafeez
09c0b999b2
release 0.3.21 (#626) 2025-12-03 17:12:34 -08:00
Salman Paracha
a448c6e9cb
Add support for v1/responses API (#622)
* making first commit. still need to work on streaming respones

* making first commit. still need to work on streaming respones

* stream buffer implementation with tests

* adding grok API keys to workflow

* fixed changes based on code review

* adding support for bedrock models

* fixed issues with translation to claude code

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>
2025-12-03 14:58:26 -08:00
Adil Hafeez
b01a81927d
release 0.3.20 (#620) 2025-11-22 19:29:04 -08:00
Salman Paracha
d37af7605c
removing model_server. buh bye (#619) 2025-11-22 15:04:41 -08:00
Salman Paracha
88c2bd1851
removing model_server python module to brightstaff (function calling) (#615)
* adding function_calling functionality via rust

* fixed rendered YAML file

* removed model_server from envoy.template and forwarding traffic to bright_staff

* fixed bugs in function_calling.rs that were breaking tests. All good now

* updating e2e test to clean up disk usage

* removing Arch* models to be used as a default model if one is not specified

* if the user sets arch-function base_url we should honor it

* fixing demos as we needed to pin to a particular version of huggingface_hub else the chatbot ui wouldn't build

* adding a constant for Arch-Function model name

* fixing some edge cases with calls made to Arch-Function

* fixed JSON parsing issues in function_calling.rs

* fixed bug where the raw response from Arch-Function was re-encoded

* removed debug from supervisord.conf

* commenting out disk cleanup

* adding back disk space

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>
2025-11-22 12:55:00 -08:00
Adil Hafeez
126b029345
release 0.3.18 (#611) 2025-10-31 12:24:49 -07:00
Branch Vincent
0a7e932837
support python 3.14 (#605)
* add python 3.14 to ci

* allow torch 2.9 for python 3.14
2025-10-30 09:17:31 -07:00
Adil Hafeez
f26bb05d35
release 0.3.17 (#604) 2025-10-24 17:52:15 -07:00
Branch Vincent
662546481a
move pytest to dev deps and migrate to poetry 2 (#602)
* move pytest to dev deps

* migrate to poetry 2 and standard metadata
2025-10-24 15:58:54 -07:00
Adil Hafeez
6d70545459
release 0.3.16 (#596) 2025-10-22 14:43:33 -07:00
Salman Paracha
9407ae6af7
Add support for Amazon Bedrock Converse and ConverseStream (#588)
* first commit to get Bedrock Converse API working. Next commit support for streaming and binary frames

* adding translation from BedrockBinaryFrameDecoder to AnthropicMessagesEvent

* Claude Code works with Amazon Bedrock

* added tests for openai streaming from bedrock

* PR comments fixed

* adding support for bedrock in docs as supported provider

* cargo fmt

* revertted to chatgpt models for claude code routing

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>
Co-authored-by: Adil Hafeez <adil.hafeez@gmail.com>
2025-10-22 11:31:21 -07:00
Adil Hafeez
c4a4f3f554
resume publishing to github container registry 2025-10-14 15:17:10 -07:00
Adil Hafeez
c6c8119936 disable push to github docker repository from main 2025-10-03 12:20:02 -07:00
Adil Hafeez
43fceffd93
remove proxy-wasm integration tests (#580)
We have coverage in e2e tests.
2025-09-30 18:15:18 -07:00
Adil Hafeez
cd563c2706
release 0.3.15 (#579) 2025-09-30 13:44:11 -07:00
Adil Hafeez
7df1b8cdb0
release 0.3.14 (#577) 2025-09-29 23:11:43 -07:00
Adil Hafeez
7ce8d44d8e
release 0.3.13 (#572) 2025-09-19 11:26:49 -07:00
Salman Paracha
8d0b468345
draft commit to add support for xAI, TogehterAI, AzureOpenAI (#570)
* draft commit to add support for xAI, LambdaAI, TogehterAI, AzureOpenAI

* fixing failing tests and updating rederend config file

* Update arch_config_with_aliases.yaml

* adding the AZURE_API_KEY to the GH workflow for e2e

* fixing GH secerts

* adding valdiating for azure_openai

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-167.local>
2025-09-18 18:36:30 -07:00
Adil Hafeez
118f60eea7
release 0.3.12 (#567) 2025-09-16 11:56:05 -07:00
Adil Hafeez
1e8c81d8f6
release 0.3.11 (#565) 2025-09-11 18:44:18 -07:00
Salman Paracha
fb0581fd39
add support for v1/messages and transformations (#558)
* pushing draft PR

* transformations are working. Now need to add some tests next

* updated tests and added necessary response transformations for Anthropics' message response object

* fixed bugs for integration tests

* fixed doc tests

* fixed serialization issues with enums on response

* adding some debug logs to help

* fixed issues with non-streaming responses

* updated the stream_context to update response bytes

* the serialized bytes length must be set in the response side

* fixed the debug statement that was causing the integration tests for wasm to fail

* fixing json parsing errors

* intentionally removing the headers

* making sure that we convert the raw bytes to the correct provider type upstream

* fixing non-streaming responses to tranform correctly

* /v1/messages works with transformations to and from /v1/chat/completions

* updating the CLI and demos to support anthropic vs. claude

* adding the anthropic key to the preference based routing tests

* fixed test cases and added more structured logs

* fixed integration tests and cleaned up logs

* added python client tests for anthropic and openai

* cleaned up logs and fixed issue with connectivity for llm gateway in weather forecast demo

* fixing the tests. python dependency order was broken

* updated the openAI client to fix demos

* removed the raw response debug statement

* fixed the dup cloning issue and cleaned up the ProviderRequestType enum and traits

* fixing logs

* moved away from string literals to consts

* fixed streaming from Anthropic Client to OpenAI

* removed debug statement that would likely trip up integration tests

* fixed integration tests for llm_gateway

* cleaned up test cases and removed unnecessary crates

* fixing comments from PR

* fixed bug whereby we were sending an OpenAIChatCompletions request object to llm_gateway even though the request may have been AnthropicMessages

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-4.local>
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-9.local>
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-10.local>
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-41.local>
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-136.local>
2025-09-10 07:40:30 -07:00
Adil Hafeez
1fdde8181a
release 0.3.10 (#555) 2025-08-13 14:50:10 -07:00
Adil Hafeez
5908ca6a16
publish to ghrc (#553) 2025-08-13 10:35:26 -07:00
Adil Hafeez
ad4cea227f
release 0.3.9 (#552) 2025-08-12 13:43:43 -07:00
Adil Hafeez
2639323dab
release 0.3.8 (#550) 2025-08-11 14:12:17 -07:00
Adil Hafeez
202409cc9a
update torch==2.6.0 (#526) 2025-08-11 13:23:40 -07:00
Adil Hafeez
ac3fb4cb5b
release 0.3.7 (#542) 2025-07-25 19:08:24 -07:00
Adil Hafeez
92a425facd
release 0.3.6 (#536) 2025-07-22 12:48:20 -07:00
Adil Hafeez
79a62fffe8
release 0.3.5 (#534) 2025-07-21 10:09:22 -07:00
Adil Hafeez
14f19f02a7
release 0.3.4 (#525)
* release 0.3.4

* update lock file
2025-07-11 17:24:21 -07:00
Adil Hafeez
a7fddf30f9
better model names (#517) 2025-07-11 16:42:16 -07:00
Adil Hafeez
5fb7ce576c
release 0.3.3 (#519) 2025-07-08 00:59:33 -07:00
Adil Hafeez
7baec20772
release 0.3.2 (#507) 2025-06-13 17:02:20 -07:00
Adil Hafeez
2e47d41a8c
Add ARCH_API_KEY in preference based routing demo (#498) 2025-05-31 01:52:25 -07:00
Adil Hafeez
ed28bbaf04
release 0.3.1 (#495) 2025-05-30 17:47:59 -07:00
Adil Hafeez
fffa837a06
separate out currency exchange and preference based routing (#491) 2025-05-30 02:14:37 -07:00
Adil Hafeez
dc271f1f76
release 0.3.0 (#483) 2025-05-23 09:52:23 -07:00
Adil Hafeez
27c0f2fdce
Introduce brightstaff a new terminal service for llm routing (#477) 2025-05-19 09:59:22 -07:00
Adil Hafeez
9c803f4d69
release 0.2.8 (#472) 2025-04-21 17:02:36 -07:00