plano

mirror of https://github.com/katanemo/plano.git synced 2026-04-26 01:06:25 +02:00

Author	SHA1	Message	Date
Adil Hafeez	d8b4c800e6	release 0.4.4 (#713 )	2026-01-28 20:45:10 -08:00
Adil Hafeez	062825f26e	add envoy retries (#712 ) * add envoy retries * add missing file * fix tests --------- Co-authored-by: Adil Hafeez <adil.hafeez10@t-mobile.com>	2026-01-28 20:31:01 -08:00
Salman Paracha	2a36dd7376	fixing the build scripts for documentation (#711 ) Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>	2026-01-28 18:55:35 -08:00
Salman Paracha	2941392ed1	Adding support for wildcard models in the model_providers config (#696 ) * cleaning up plano cli commands * adding support for wildcard model providers * fixing compile errors * fixing bugs related to default model provider, provider hint and duplicates in the model provider list * fixed cargo fmt issues * updating tests to always include the model id * using default for the prompt_gateway path * fixed the model name, as gpt-5-mini-2025-08-07 wasn't in the config * making sure that all aliases and models match the config * fixed the config generator to allow for base_url providers LLMs to include wildcard models * re-ran the models list utility and added a shell script to run it * updating docs to mention wildcard model providers * updated provider_models.json to yaml, added that file to our docs for reference * updating the build docs to use the new root-based build --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>	2026-01-28 17:47:33 -08:00
Adil Hafeez	da5cbc29b7	release 0.4.3 (#701 )	2026-01-18 00:07:46 -08:00
Tang Quoc Thai	4d53297c17	feat: add passthrough_auth option for forwarding client Authorization header (#687 ) * feat: add passthrough_auth option for forwarding client Authorization header * fix tests * Update comment to reflect upstream forwarding * Apply suggestions from code review --------- Co-authored-by: Adil Hafeez <adil.hafeez@gmail.com> Co-authored-by: Adil Hafeez <adil@katanemo.com>	2026-01-14 15:06:28 -08:00
Adil Hafeez	ab391f96c7	don't include internal models in /v1/models endpoint (#685 )	2026-01-09 16:57:41 -08:00
Salman Paracha	40b9780774	tweaks to web and docs to align to 0.4.2 (#680 ) * tweaks to web and docs to align to 0.4.2 * made our release banner clickable --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>	2026-01-07 13:51:40 -08:00
Adil Hafeez	b7fba7a97f	release 0.4.2 (#679 )	2026-01-07 13:02:06 -08:00
Salman Paracha	b4543ba56c	Introduce signals change (#655 ) * adding support for signals * reducing false positives for signals like positive interaction * adding docs. Still need to fix the messages list, but waiting on PR #621 * Improve frustration detection: normalize contractions and refine punctuation * Further refine test cases with longer messages * minor doc changes * fixing echo statement for build * fixing the messages construction and using the trait for signals * update signals docs * fixed some minor doc changes * added more tests and fixed docuemtnation. PR 100% ready * made fixes based on PR comments * Optimize latency 1. replace sliding window approach with trigram containment check 2. add code to pre-compute ngrams for patterns * removed some debug statements to make tests easier to read * PR comments to make ObservableStreamProcessor accept optonal Vec<Messagges> * fixed PR comments --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local> Co-authored-by: MeiyuZhong <mariazhong9612@gmail.com> Co-authored-by: nehcgs <54548843+nehcgs@users.noreply.github.com>	2026-01-07 11:20:44 -08:00
Salman Paracha	a764cac869	updated readme with a snippet of code to go along with the descriptio… (#674 ) * updated readme with a snippet of code to go along with the description of the proejct * updated readme --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>	2026-01-03 23:22:04 -08:00
Adil Hafeez	f054dbdbe9	update quick start to elevate gateway/proxy example (#671 )	2026-01-02 10:21:38 -08:00
Adil Hafeez	41aa4abaeb	release 0.4.1 (#670 )	2026-01-01 23:39:18 -08:00
Adil Hafeez	77cdc7f6ef	Revert "release 0.4.1 (#666 )" (#669 ) This reverts commit `77df5160d8`.	2025-12-30 15:28:30 -08:00
Adil Hafeez	77df5160d8	release 0.4.1 (#666 )	2025-12-28 14:29:19 -08:00
Adil Hafeez	053e2b3a74	use uv instead of poetry (#663 )	2025-12-26 11:21:42 -08:00
Adil Hafeez	88d14a205b	restructure cli (#656 )	2025-12-25 14:55:29 -08:00
Adil Hafeez	f6fe7b84dc	use new config format for llm routing quick start guide	2025-12-23 19:42:19 -08:00
Adil Hafeez	e8170f76ca	rename to planoai (#650 )	2025-12-23 19:26:51 -08:00
Adil Hafeez	e7ce00b5a7	rename cli to plano (#647 )	2025-12-23 18:37:58 -08:00
Salman Paracha	e224cba3e3	Update docs to Plano (#639 )	2025-12-23 17:14:50 -08:00
Adil Hafeez	15fbb6c3af	plano orchestration using plano orchestration 4b model (#637 )	2025-12-22 18:05:49 -08:00
Salman Paracha	d5a273f740	enable state management for v1/responses (#631 ) * first commit with tests to enable state mamangement via memory * fixed logs to follow the conversational flow a bit better * added support for supabase * added the state_storage_v1_responses flag, and use that to store state appropriately * cleaned up logs and fixed issue with connectivity for llm gateway in weather forecast demo * fixed mixed inputs from openai v1/responses api (#632) * fixed mixed inputs from openai v1/responses api * removing tracing from model-alias-rouing * handling additional input types from openairs --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local> * resolving PR comments --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>	2025-12-17 12:18:38 -08:00
Adil Hafeez	8adb9795d8	release 0.3.22 (#629 )	2025-12-11 11:20:19 -08:00
Adil Hafeez	09c0b999b2	release 0.3.21 (#626 )	2025-12-03 17:12:34 -08:00
Adil Hafeez	b01a81927d	release 0.3.20 (#620 )	2025-11-22 19:29:04 -08:00
Salman Paracha	d37af7605c	removing model_server. buh bye (#619 )	2025-11-22 15:04:41 -08:00
Salman Paracha	88c2bd1851	removing model_server python module to brightstaff (function calling) (#615 ) * adding function_calling functionality via rust * fixed rendered YAML file * removed model_server from envoy.template and forwarding traffic to bright_staff * fixed bugs in function_calling.rs that were breaking tests. All good now * updating e2e test to clean up disk usage * removing Arch* models to be used as a default model if one is not specified * if the user sets arch-function base_url we should honor it * fixing demos as we needed to pin to a particular version of huggingface_hub else the chatbot ui wouldn't build * adding a constant for Arch-Function model name * fixing some edge cases with calls made to Arch-Function * fixed JSON parsing issues in function_calling.rs * fixed bug where the raw response from Arch-Function was re-encoded * removed debug from supervisord.conf * commenting out disk cleanup * adding back disk space --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local> Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>	2025-11-22 12:55:00 -08:00
Adil Hafeez	126b029345	release 0.3.18 (#611 )	2025-10-31 12:24:49 -07:00
Salman Paracha	cdfcfb9169	support base_url path for model providers (#608 ) * adding support for base_url * updated docs * fixed tests for config generator * making fixes based on PR comments --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>	2025-10-29 17:08:07 -07:00
Adil Hafeez	f26bb05d35	release 0.3.17 (#604 )	2025-10-24 17:52:15 -07:00
Adil Hafeez	6d70545459	release 0.3.16 (#596 )	2025-10-22 14:43:33 -07:00
Salman Paracha	7a6f87de3e	fixed test and docs for deployment (#595 ) * fixed test and docs for deployment * updating the main logo image --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>	2025-10-22 14:13:16 -07:00
Salman Paracha	9407ae6af7	Add support for Amazon Bedrock Converse and ConverseStream (#588 ) * first commit to get Bedrock Converse API working. Next commit support for streaming and binary frames * adding translation from BedrockBinaryFrameDecoder to AnthropicMessagesEvent * Claude Code works with Amazon Bedrock * added tests for openai streaming from bedrock * PR comments fixed * adding support for bedrock in docs as supported provider * cargo fmt * revertted to chatgpt models for claude code routing --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local> Co-authored-by: Adil Hafeez <adil.hafeez@gmail.com>	2025-10-22 11:31:21 -07:00
Adil Hafeez	96e0732089	add support for agents (#564 )	2025-10-14 14:01:11 -07:00
Salman Paracha	6a06d9ac97	add claude code router to the README (#586 ) Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>	2025-10-05 13:38:39 -07:00
Salman Paracha	03d8cc1894	fixing docs (#584 ) Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>	2025-10-01 22:26:54 -07:00
Salman Paracha	226139e907	adding support for Qwen models and fixed issue with passing PATH vari… (#583 ) * adding support for Qwen models and fixed issue with passing PATH variable * don't need to have qwen in the model alias routing example * fixed base_url for qwen --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>	2025-10-01 21:57:58 -07:00
Adil Hafeez	cd563c2706	release 0.3.15 (#579 )	2025-09-30 13:44:11 -07:00
Salman Paracha	045a5e9751	adding support for moonshot and z-ai (#578 ) * adding support for moonshot and z-ai * Revert unwanted changes to arch_config.yaml --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>	2025-09-30 12:24:06 -07:00
Adil Hafeez	7df1b8cdb0	release 0.3.14 (#577 )	2025-09-29 23:11:43 -07:00
Adil Hafeez	7ce8d44d8e	release 0.3.13 (#572 )	2025-09-19 11:26:49 -07:00
Salman Paracha	fbe82351c0	Salmanap/fix docs new providers model alias (#571 ) * fixed docs and added ollama as a first-class LLM provider * matching the LLM routing section on the README.md to the docs * updated the section on preference-based routing --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-167.local>	2025-09-19 10:19:57 -07:00
Salman Paracha	8d0b468345	draft commit to add support for xAI, TogehterAI, AzureOpenAI (#570 ) * draft commit to add support for xAI, LambdaAI, TogehterAI, AzureOpenAI * fixing failing tests and updating rederend config file * Update arch_config_with_aliases.yaml * adding the AZURE_API_KEY to the GH workflow for e2e * fixing GH secerts * adding valdiating for azure_openai --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-167.local>	2025-09-18 18:36:30 -07:00
Adil Hafeez	118f60eea7	release 0.3.12 (#567 )	2025-09-16 11:56:05 -07:00
Adil Hafeez	1e8c81d8f6	release 0.3.11 (#565 )	2025-09-11 18:44:18 -07:00
Adil Hafeez	1fdde8181a	release 0.3.10 (#555 )	2025-08-13 14:50:10 -07:00
Adil Hafeez	ad4cea227f	release 0.3.9 (#552 )	2025-08-12 13:43:43 -07:00
Adil Hafeez	2639323dab	release 0.3.8 (#550 )	2025-08-11 14:12:17 -07:00
Salman Paracha	62a092fa63	consistent messaging (#546 ) * consistent messaging * updating README --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>	2025-07-28 11:45:07 -07:00

1 2 3

120 commits