Commit graph

129 commits

Author SHA1 Message Date
Salman Paracha
45d4718961 final changes to orchestration 2025-12-23 16:31:34 -08:00
Salman Paracha
8cfefa38a2 orchestration guide changes 2025-12-23 16:31:34 -08:00
Adil Hafeez
e3f7ae1cbe
fix reference to filter type 2025-12-23 16:22:03 -08:00
Adil Hafeez
c3ace314ce
bring back prompt targets 2025-12-23 16:15:02 -08:00
Adil Hafeez
5b5312a7c1
update refernce config 2025-12-23 15:46:35 -08:00
Adil Hafeez
0533987a2f
add input_guards and update prompt guards section 2025-12-23 15:18:10 -08:00
Musa
bbadd61de0 Create custom.css 2025-12-23 14:22:33 -08:00
Musa
b228f152f7 feat(docs): observability updated + logo inversion issue 2025-12-23 14:22:13 -08:00
Musa
ec17252403 feat(docs): refresh routing models 2025-12-23 14:02:20 -08:00
Musa
9c515116cb feat(docs): refresh monitoring page 2025-12-23 12:41:24 -08:00
Musa
217048fabc feat(docs): refresh deployment page 2025-12-23 12:39:21 -08:00
Musa
a381bd46bb feat(docs): refresh llm providers 2025-12-23 12:34:37 -08:00
Musa
ef4158010a feat(docs): include llms.txt 2025-12-23 12:09:25 -08:00
Musa
0d605c685d fix(demo): claude/openai misconfig in docs 2025-12-23 11:08:10 -08:00
Salman Paracha
d9f948276e Merge branch 'main' into salmanap/plano-docs-cli 2025-12-22 18:06:26 -08:00
Adil Hafeez
15fbb6c3af
plano orchestration using plano orchestration 4b model (#637) 2025-12-22 18:05:49 -08:00
Salman Paracha
a70a965975 updating the types of filters we support 2025-12-22 17:47:51 -08:00
Salman Paracha
a275d888a7 remove more references of Arch 2025-12-22 16:34:46 -08:00
Salman Paracha
0890e5c5ed removed more references of the word Arch 2025-12-22 16:15:01 -08:00
Salman Paracha
cdfd8d5726 fixing overview section 2025-12-22 15:47:27 -08:00
Salman Paracha
75867ff789 updated the google analytics tag for docs.planoai.dev 2025-12-22 15:11:13 -08:00
Salman Paracha
3802350213 minor changes to the overview section 2025-12-22 14:21:13 -08:00
Salman Paracha
e0404d305c several improvements to docs. TODOS: Tracing and Filters 2025-12-21 22:10:32 -08:00
Salman Paracha
41ae9e8d06 removed prompt guards altogether from our repo - use filters 2025-12-20 10:50:39 -08:00
Musa
205c3afbf3 docs: include new orchestration guide 2025-12-19 18:23:11 -08:00
Salman Paracha
28fd430efd updating plano docs, README and CLI 2025-12-19 17:45:51 -08:00
Salman Paracha
d5a273f740
enable state management for v1/responses (#631)
* first commit with tests to enable state mamangement via memory

* fixed logs to follow the conversational flow a bit better

* added support for supabase

* added the state_storage_v1_responses flag, and use that to store state appropriately

* cleaned up logs and fixed issue with connectivity for llm gateway in weather forecast demo

* fixed mixed inputs from openai v1/responses api (#632)

* fixed mixed inputs from openai v1/responses api

* removing tracing from model-alias-rouing

* handling additional input types from openairs

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>

* resolving PR comments

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>
2025-12-17 12:18:38 -08:00
Adil Hafeez
8adb9795d8
release 0.3.22 (#629) 2025-12-11 11:20:19 -08:00
Adil Hafeez
09c0b999b2
release 0.3.21 (#626) 2025-12-03 17:12:34 -08:00
Adil Hafeez
b01a81927d
release 0.3.20 (#620) 2025-11-22 19:29:04 -08:00
Salman Paracha
d37af7605c
removing model_server. buh bye (#619) 2025-11-22 15:04:41 -08:00
Salman Paracha
88c2bd1851
removing model_server python module to brightstaff (function calling) (#615)
* adding function_calling functionality via rust

* fixed rendered YAML file

* removed model_server from envoy.template and forwarding traffic to bright_staff

* fixed bugs in function_calling.rs that were breaking tests. All good now

* updating e2e test to clean up disk usage

* removing Arch* models to be used as a default model if one is not specified

* if the user sets arch-function base_url we should honor it

* fixing demos as we needed to pin to a particular version of huggingface_hub else the chatbot ui wouldn't build

* adding a constant for Arch-Function model name

* fixing some edge cases with calls made to Arch-Function

* fixed JSON parsing issues in function_calling.rs

* fixed bug where the raw response from Arch-Function was re-encoded

* removed debug from supervisord.conf

* commenting out disk cleanup

* adding back disk space

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>
2025-11-22 12:55:00 -08:00
Adil Hafeez
126b029345
release 0.3.18 (#611) 2025-10-31 12:24:49 -07:00
Salman Paracha
cdfcfb9169
support base_url path for model providers (#608)
* adding support for base_url

* updated docs

* fixed tests for config generator

* making fixes based on PR comments

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>
2025-10-29 17:08:07 -07:00
Adil Hafeez
f26bb05d35
release 0.3.17 (#604) 2025-10-24 17:52:15 -07:00
Adil Hafeez
6d70545459
release 0.3.16 (#596) 2025-10-22 14:43:33 -07:00
Salman Paracha
7a6f87de3e
fixed test and docs for deployment (#595)
* fixed test and docs for deployment

* updating the main logo image

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>
2025-10-22 14:13:16 -07:00
Salman Paracha
9407ae6af7
Add support for Amazon Bedrock Converse and ConverseStream (#588)
* first commit to get Bedrock Converse API working. Next commit support for streaming and binary frames

* adding translation from BedrockBinaryFrameDecoder to AnthropicMessagesEvent

* Claude Code works with Amazon Bedrock

* added tests for openai streaming from bedrock

* PR comments fixed

* adding support for bedrock in docs as supported provider

* cargo fmt

* revertted to chatgpt models for claude code routing

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>
Co-authored-by: Adil Hafeez <adil.hafeez@gmail.com>
2025-10-22 11:31:21 -07:00
Adil Hafeez
96e0732089
add support for agents (#564) 2025-10-14 14:01:11 -07:00
Salman Paracha
6a06d9ac97
add claude code router to the README (#586)
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>
2025-10-05 13:38:39 -07:00
Salman Paracha
03d8cc1894
fixing docs (#584)
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>
2025-10-01 22:26:54 -07:00
Salman Paracha
226139e907
adding support for Qwen models and fixed issue with passing PATH vari… (#583)
* adding support for Qwen models and fixed issue with passing PATH variable

* don't need to have qwen in the model alias routing example

* fixed base_url for qwen

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>
2025-10-01 21:57:58 -07:00
Adil Hafeez
cd563c2706
release 0.3.15 (#579) 2025-09-30 13:44:11 -07:00
Salman Paracha
045a5e9751
adding support for moonshot and z-ai (#578)
* adding support for moonshot and z-ai

* Revert unwanted changes to arch_config.yaml

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>
2025-09-30 12:24:06 -07:00
Adil Hafeez
7df1b8cdb0
release 0.3.14 (#577) 2025-09-29 23:11:43 -07:00
Adil Hafeez
7ce8d44d8e
release 0.3.13 (#572) 2025-09-19 11:26:49 -07:00
Salman Paracha
fbe82351c0
Salmanap/fix docs new providers model alias (#571)
* fixed docs and added ollama as a first-class LLM provider

* matching the LLM routing section on the README.md to the docs

* updated the section on preference-based routing

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-167.local>
2025-09-19 10:19:57 -07:00
Salman Paracha
8d0b468345
draft commit to add support for xAI, TogehterAI, AzureOpenAI (#570)
* draft commit to add support for xAI, LambdaAI, TogehterAI, AzureOpenAI

* fixing failing tests and updating rederend config file

* Update arch_config_with_aliases.yaml

* adding the AZURE_API_KEY to the GH workflow for e2e

* fixing GH secerts

* adding valdiating for azure_openai

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-167.local>
2025-09-18 18:36:30 -07:00
Adil Hafeez
118f60eea7
release 0.3.12 (#567) 2025-09-16 11:56:05 -07:00
Adil Hafeez
1e8c81d8f6
release 0.3.11 (#565) 2025-09-11 18:44:18 -07:00