plano

mirror of https://github.com/katanemo/plano.git synced 2026-06-17 15:25:17 +02:00

Author	SHA1	Message	Date
Salman Paracha	e8881c7b8a	fixed the dup cloning issue and cleaned up the ProviderRequestType enum and traits	2025-09-05 21:18:55 -07:00
Salman Paracha	c8b59aeda7	removed the raw response debug statement	2025-09-05 12:52:01 -07:00
Salman Paracha	2895a07088	cleaned up logs and fixed issue with connectivity for llm gateway in weather forecast demo	2025-09-05 09:09:17 -07:00
Salman Paracha	00c543667d	fixed integration tests and cleaned up logs	2025-09-04 21:49:54 -07:00
Salman Paracha	ee52c608f7	fixed test cases and added more structured logs	2025-09-04 19:28:47 -07:00
Salman Paracha	ecf453ed70	/v1/messages works with transformations to and from /v1/chat/completions	2025-09-04 15:13:53 -07:00
Salman Paracha	2813a8cfa5	fixing non-streaming responses to tranform correctly	2025-09-02 17:42:02 -07:00
Salman Paracha	d4dfbe600f	making sure that we convert the raw bytes to the correct provider type upstream	2025-09-02 16:19:45 -07:00
Salman Paracha	c55979307e	intentionally removing the headers	2025-08-30 23:00:04 -07:00
Salman Paracha	7c4174a821	fixing json parsing errors	2025-08-30 12:52:59 -07:00
Salman Paracha	041a9eda3a	fixed the debug statement that was causing the integration tests for wasm to fail	2025-08-29 18:33:18 -07:00
Salman Paracha	0a0d2c95a3	the serialized bytes length must be set in the response side	2025-08-29 18:18:32 -07:00
Salman Paracha	e7238fb7fd	updated the stream_context to update response bytes	2025-08-28 22:55:12 -07:00
Salman Paracha	9f6d2464f6	fixed issues with non-streaming responses	2025-08-24 18:52:48 -07:00
Salman Paracha	77491b4a69	adding some debug logs to help	2025-08-24 14:11:21 -07:00
Salman Paracha	0b41496c45	fixed serialization issues with enums on response	2025-08-24 13:12:15 -07:00
Salman Paracha	2fa983d50d	fixed doc tests	2025-08-23 16:48:06 -07:00
Salman Paracha	7345657612	fixed bugs for integration tests	2025-08-23 16:37:52 -07:00
Salman Paracha	9f3a6f71a3	updated tests and added necessary response transformations for Anthropics' message response object	2025-08-22 16:06:48 -07:00
Salman Paracha	e73a9eb61c	transformations are working. Now need to add some tests next	2025-08-22 14:36:46 -07:00
Salman Paracha	0aa9243093	pushing draft PR	2025-08-21 22:24:07 -07:00
Salman Paracha	89ab51697a	updating the implementation of /v1/chat/completions to use the generi… (#548 ) * updating the implementation of /v1/chat/completions to use the generic provider interfaces * saving changes, although we will need a small re-factor after this as well * more refactoring changes, getting close * more refactoring changes to avoid unecessary re-direction and duplication * more clean up * more refactoring * more refactoring to clean code and make stream_context.rs work * removing unecessary trait implemenations * some more clean-up * fixed bugs * fixing test cases, and making sure all references to the ChatCOmpletions* objects point to the new types * refactored changes to support enum dispatch * removed the dependency on try_streaming_from_bytes into a try_from trait implementation * updated readme based on new usage * updated code based on code review comments --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-2.local> Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-4.local>	2025-08-20 12:55:29 -07:00
Salman Paracha	93ff4d7b1f	pushing new apis module for hermes (#547 )	2025-08-07 12:42:09 -07:00
Adil Hafeez	04c7e5a175	bug fix - allow image content to pass through (#539 ) fixes https://github.com/katanemo/archgw/issues/535	2025-07-25 01:22:06 -07:00
Adil Hafeez	d341f4365b	In request path use same format for usage preferences as arch_config (#533 )	2025-07-21 18:31:19 -07:00
Adil Hafeez	83f4d33434	refactor logging in brightstaff (#532 ) refactor logs, move unnecessary info log statements to debug and start logging latest chat completion message to log	2025-07-17 16:00:04 -07:00
Adil Hafeez	f819ee3507	pass model name in header when a route is selected when using usage preferences (#531 )	2025-07-17 13:41:58 -07:00
Adil Hafeez	a7fddf30f9	better model names (#517 )	2025-07-11 16:42:16 -07:00
Adil Hafeez	147908ba7e	make arch-router cluster optional (#518 )	2025-07-08 00:33:40 -07:00
Adil Hafeez	00dc95e034	Add support for updating model preferences (#510 )	2025-07-02 14:08:19 -07:00
Salman Paracha	a03aef8b4c	local support for Arch-Router via Ollama (#509 ) * local support for Arch-Router via Ollama * fixed issue withe non-local yaml config --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>	2025-06-20 17:32:14 -07:00
Adil Hafeez	7baec20772	release 0.3.2 (#507 )	2025-06-13 17:02:20 -07:00
Adil Hafeez	aa9d747fa9	add support for gemini (#505 )	2025-06-11 15:15:00 -07:00
Adil Hafeez	e734b76086	fix build break	2025-06-10 13:07:35 -07:00
Adil Hafeez	fb34dbdf6c	remove unnecessary rust files	2025-06-10 12:54:31 -07:00
Adil Hafeez	6c53510f49	Introduce hermesllm library to handle llm message translation (#501 )	2025-06-10 12:53:27 -07:00
Adil Hafeez	0d190a6e5c	update code to use new json based system prompt for routing (#493 )	2025-05-30 17:40:46 -07:00
Adil Hafeez	8d12a9a6e0	add arch provider (#494 )	2025-05-30 17:12:52 -07:00
Adil Hafeez	176f039bbc	fix model warning and use openwebui for preference based router demo	2025-05-30 12:29:56 -07:00
Adil Hafeez	fffa837a06	separate out currency exchange and preference based routing (#491 )	2025-05-30 02:14:37 -07:00
Adil Hafeez	470cdf9843	use provider_name as model_id /v1/models api (#490 )	2025-05-29 11:23:18 -07:00
Adil Hafeez	9c4733590f	add support for openwebui (#487 )	2025-05-28 19:08:00 -07:00
Adil Hafeez	d29eba4102	trim conversation if it exceed max limit of what router model can handle (#488 )	2025-05-27 20:28:22 -07:00
Adil Hafeez	99dd900a34	fix panic in brightstaff (#485 ) make router section optional in arch_config	2025-05-23 09:37:25 -07:00
Adil Hafeez	d050dfb85a	When router usage is defined ensure that router model is defined too (#481 )	2025-05-23 08:46:12 -07:00
Adil Hafeez	218e9c540d	Add support for json based content types in Message (#480 )	2025-05-23 00:51:53 -07:00
Adil Hafeez	f5e77bbe65	add support for claude and add first class support for groq and deepseek (#479 )	2025-05-22 22:55:46 -07:00
Adil Hafeez	27c0f2fdce	Introduce brightstaff a new terminal service for llm routing (#477 )	2025-05-19 09:59:22 -07:00
Shuguang Chen	7d4b261a68	Integrate Arch-Function-Chat (#449 )	2025-04-15 14:39:12 -07:00
Salman Paracha	f31aa59fac	fixed issue with groq LLMs that require the openai in the /v1/chat/co… (#460 ) * fixed issue with groq LLMs that require the openai in the /v1/chat/completions path. My first change * updated the GH actions with keys for Groq * adding missing groq API keys * add llama-3.2-3b-preview to the model based on addin groq to the demo --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>	2025-04-13 14:00:16 -07:00

1 2

100 commits