plano

mirror of https://github.com/katanemo/plano.git synced 2026-04-25 00:36:34 +02:00

Author	SHA1	Message	Date
Adil Hafeez	f4d65e2469	stream access logs and improve access log format (#581 )	2025-09-30 18:46:13 -07:00
Adil Hafeez	43fceffd93	remove proxy-wasm integration tests (#580 ) We have coverage in e2e tests.	2025-09-30 18:15:18 -07:00
Adil Hafeez	cd563c2706	release 0.3.15 (#579 )	2025-09-30 13:44:11 -07:00
Salman Paracha	045a5e9751	adding support for moonshot and z-ai (#578 ) * adding support for moonshot and z-ai * Revert unwanted changes to arch_config.yaml --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>	2025-09-30 12:24:06 -07:00
Adil Hafeez	7df1b8cdb0	release 0.3.14 (#577 )	2025-09-29 23:11:43 -07:00
Salman Paracha	cf23aefddd	fixing README for claude code and adding a helper script to show model selection (#576 ) Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>	2025-09-29 21:20:52 -07:00
Salman Paracha	f00870dccb	adding support for claude code routing (#575 ) * fixed for claude code routing. first commit * removing redundant enum tags for cache_control * making sure that claude code can run via the archgw cli * fixing broken config * adding a README.md and updated the cli to use more of our defined patterns for params * fixed config.yaml * minor fixes to make sure PR is clean. Ready to ship * adding claude-sonnet-4-5 to the config * fixes based on PR * fixed alias for README * fixed 400 error handling tests, now that we write temperature to 1.0 for GPT-5 --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-257.local> Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>	2025-09-29 19:23:08 -07:00
Salman Paracha	03c2cf6f0d	fixed changes related to max_tokens and processing http error codes like 400 properly (#574 ) Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-257.local>	2025-09-25 17:00:37 -07:00
Adil Hafeez	7ce8d44d8e	release 0.3.13 (#572 )	2025-09-19 11:26:49 -07:00
Salman Paracha	fbe82351c0	Salmanap/fix docs new providers model alias (#571 ) * fixed docs and added ollama as a first-class LLM provider * matching the LLM routing section on the README.md to the docs * updated the section on preference-based routing --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-167.local>	2025-09-19 10:19:57 -07:00
Salman Paracha	8d0b468345	draft commit to add support for xAI, TogehterAI, AzureOpenAI (#570 ) * draft commit to add support for xAI, LambdaAI, TogehterAI, AzureOpenAI * fixing failing tests and updating rederend config file * Update arch_config_with_aliases.yaml * adding the AZURE_API_KEY to the GH workflow for e2e * fixing GH secerts * adding valdiating for azure_openai --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-167.local>	2025-09-18 18:36:30 -07:00
Salman Paracha	b56311f458	adding code snippets in a single place for newsletter (#569 ) * adding code snippets in a single place for newsletter * fixing README and run_demo.sh * renaming branch --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-136.local>	2025-09-17 01:06:06 -07:00
Adil Hafeez	3eb6af8829	add default implementation for common openai types (#568 )	2025-09-16 12:48:07 -07:00
Adil Hafeez	118f60eea7	release 0.3.12 (#567 )	2025-09-16 11:56:05 -07:00
Salman Paracha	4eb2b410c5	adding support for model aliases in archgw (#566 ) * adding support for model aliases in archgw * fixed PR based on feedback * removing README. Not relevant for PR --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-136.local>	2025-09-16 11:12:08 -07:00
Adil Hafeez	1e8c81d8f6	release 0.3.11 (#565 )	2025-09-11 18:44:18 -07:00
Salman Paracha	fb0581fd39	add support for v1/messages and transformations (#558 ) * pushing draft PR * transformations are working. Now need to add some tests next * updated tests and added necessary response transformations for Anthropics' message response object * fixed bugs for integration tests * fixed doc tests * fixed serialization issues with enums on response * adding some debug logs to help * fixed issues with non-streaming responses * updated the stream_context to update response bytes * the serialized bytes length must be set in the response side * fixed the debug statement that was causing the integration tests for wasm to fail * fixing json parsing errors * intentionally removing the headers * making sure that we convert the raw bytes to the correct provider type upstream * fixing non-streaming responses to tranform correctly * /v1/messages works with transformations to and from /v1/chat/completions * updating the CLI and demos to support anthropic vs. claude * adding the anthropic key to the preference based routing tests * fixed test cases and added more structured logs * fixed integration tests and cleaned up logs * added python client tests for anthropic and openai * cleaned up logs and fixed issue with connectivity for llm gateway in weather forecast demo * fixing the tests. python dependency order was broken * updated the openAI client to fix demos * removed the raw response debug statement * fixed the dup cloning issue and cleaned up the ProviderRequestType enum and traits * fixing logs * moved away from string literals to consts * fixed streaming from Anthropic Client to OpenAI * removed debug statement that would likely trip up integration tests * fixed integration tests for llm_gateway * cleaned up test cases and removed unnecessary crates * fixing comments from PR * fixed bug whereby we were sending an OpenAIChatCompletions request object to llm_gateway even though the request may have been AnthropicMessages --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-4.local> Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-9.local> Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-10.local> Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-41.local> Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-136.local>	2025-09-10 07:40:30 -07:00
Salman Paracha	bb71d041a0	Fix formatting in README.md Fixed formatting issues in the README.md file.	2025-08-29 09:58:24 -07:00
Salman Paracha	c698f2cba2	Improve clarity of routing and orchestration section Reworded the section on routing and orchestration for clarity and conciseness.	2025-08-29 09:50:03 -07:00
Salman Paracha	dd4e6a7497	Improve clarity of routing and orchestration section	2025-08-29 09:44:52 -07:00
Salman Paracha	8d1046fb3d	Enhance README with detailed routing and orchestration issues	2025-08-29 09:43:32 -07:00
Salman Paracha	85f10fc75a	Update README.md	2025-08-21 08:53:39 -07:00
Salman Paracha	f169090c3d	Update README.md	2025-08-21 06:40:56 -07:00
Salman Paracha	bcd973ebf3	Update README.md	2025-08-21 06:33:33 -07:00
Salman Paracha	95d28df725	updating readme and see how it flows (#556 ) * updating readme and see how it flows * fixed links --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-4.local>	2025-08-21 06:29:47 -07:00
Salman Paracha	89ab51697a	updating the implementation of /v1/chat/completions to use the generi… (#548 ) * updating the implementation of /v1/chat/completions to use the generic provider interfaces * saving changes, although we will need a small re-factor after this as well * more refactoring changes, getting close * more refactoring changes to avoid unecessary re-direction and duplication * more clean up * more refactoring * more refactoring to clean code and make stream_context.rs work * removing unecessary trait implemenations * some more clean-up * fixed bugs * fixing test cases, and making sure all references to the ChatCOmpletions* objects point to the new types * refactored changes to support enum dispatch * removed the dependency on try_streaming_from_bytes into a try_from trait implementation * updated readme based on new usage * updated code based on code review comments --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-2.local> Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-4.local>	2025-08-20 12:55:29 -07:00
Adil Hafeez	1fdde8181a	release 0.3.10 (#555 )	2025-08-13 14:50:10 -07:00
Adil Hafeez	359b6fc4ab	update base image to python3.13 (#554 )	2025-08-13 14:20:46 -07:00
Adil Hafeez	5908ca6a16	publish to ghrc (#553 )	2025-08-13 10:35:26 -07:00
Adil Hafeez	ad4cea227f	release 0.3.9 (#552 )	2025-08-12 13:43:43 -07:00
Adil Hafeez	950c9b443c	fix cve_2025-6020 by removing libpam (#551 ) * fix cve_2025-6020 by removing libpam * Apply suggestions from code review Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-08-12 13:20:04 -07:00
Adil Hafeez	2639323dab	release 0.3.8 (#550 )	2025-08-11 14:12:17 -07:00
Adil Hafeez	202409cc9a	update torch==2.6.0 (#526 )	2025-08-11 13:23:40 -07:00
Salman Paracha	93ff4d7b1f	pushing new apis module for hermes (#547 )	2025-08-07 12:42:09 -07:00
Salman Paracha	62a092fa63	consistent messaging (#546 ) * consistent messaging * updating README --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>	2025-07-28 11:45:07 -07:00
Matt Kafonek	5d5762ac20	swap out python to sys.executable to support uv tool install workflows (#544 )	2025-07-28 00:48:35 -07:00
Musa	d215724864	Update llm_provider.rst (#543 )	2025-07-27 09:26:12 -07:00
Adil Hafeez	ac3fb4cb5b	release 0.3.7 (#542 )	2025-07-25 19:08:24 -07:00
Adil Hafeez	04c7e5a175	bug fix - allow image content to pass through (#539 ) fixes https://github.com/katanemo/archgw/issues/535	2025-07-25 01:22:06 -07:00
Adil Hafeez	92a425facd	release 0.3.6 (#536 )	2025-07-22 12:48:20 -07:00
Adil Hafeez	d341f4365b	In request path use same format for usage preferences as arch_config (#533 )	2025-07-21 18:31:19 -07:00
Adil Hafeez	79a62fffe8	release 0.3.5 (#534 )	2025-07-21 10:09:22 -07:00
Adil Hafeez	83f4d33434	refactor logging in brightstaff (#532 ) refactor logs, move unnecessary info log statements to debug and start logging latest chat completion message to log	2025-07-17 16:00:04 -07:00
Adil Hafeez	f819ee3507	pass model name in header when a route is selected when using usage preferences (#531 )	2025-07-17 13:41:58 -07:00
Salman Paracha	2340a45353	chatgpt.com updated its backend api path. fixing (#530 )	2025-07-14 21:20:23 -07:00
Salman Paracha	5e65572573	updating the messaging to call ourselves the edge and AI gateway for … (#527 ) * updating the messaging to call ourselves the edge and AI gateway for agents * updating README to tidy up some language * updating README to tidy up some language * updating README to tidy up some language --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>	2025-07-12 03:25:09 -07:00
Adil Hafeez	14f19f02a7	release 0.3.4 (#525 ) * release 0.3.4 * update lock file	2025-07-11 17:24:21 -07:00
Adil Hafeez	a7fddf30f9	better model names (#517 )	2025-07-11 16:42:16 -07:00
Salman Paracha	4e2355965b	removing the tabs permission and reverting to window.postMessage (#524 ) Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>	2025-07-11 12:07:32 -07:00
Salman Paracha	5ea51c8cbd	added the tabs permission again (#521 )	2025-07-08 17:50:41 -07:00

... 3 4 5 6 7 ...

675 commits