plano

mirror of https://github.com/katanemo/plano.git synced 2026-04-27 01:36:33 +02:00

Author	SHA1	Message	Date
Adil Hafeez	0d190a6e5c	update code to use new json based system prompt for routing (#493 )	2025-05-30 17:40:46 -07:00
Adil Hafeez	176f039bbc	fix model warning and use openwebui for preference based router demo	2025-05-30 12:29:56 -07:00
Adil Hafeez	fffa837a06	separate out currency exchange and preference based routing (#491 )	2025-05-30 02:14:37 -07:00
Adil Hafeez	9c4733590f	add support for openwebui (#487 )	2025-05-28 19:08:00 -07:00
Adil Hafeez	79cbcb5fe1	add claude-4 in llm_routing demo (#486 )	2025-05-23 10:21:21 -07:00
Adil Hafeez	218e9c540d	Add support for json based content types in Message (#480 )	2025-05-23 00:51:53 -07:00
Adil Hafeez	f5e77bbe65	add support for claude and add first class support for groq and deepseek (#479 )	2025-05-22 22:55:46 -07:00
Adil Hafeez	27c0f2fdce	Introduce brightstaff a new terminal service for llm routing (#477 )	2025-05-19 09:59:22 -07:00
Adil Hafeez	2e346143dd	use separate host port for chat ui and for app_server (#473 ) We were using same port for both chatui and app_server which was causing conflict. This code change updates host port for app_server to 18083 and updates arch_config	2025-04-23 14:05:48 -07:00
Shuguang Chen	7d4b261a68	Integrate Arch-Function-Chat (#449 )	2025-04-15 14:39:12 -07:00
Salman Paracha	f31aa59fac	fixed issue with groq LLMs that require the openai in the /v1/chat/co… (#460 ) * fixed issue with groq LLMs that require the openai in the /v1/chat/completions path. My first change * updated the GH actions with keys for Groq * adding missing groq API keys * add llama-3.2-3b-preview to the model based on addin groq to the demo --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>	2025-04-13 14:00:16 -07:00
Mat Sylvia	e7b0de2a72	Tweak readme docs for minor nits (#461 ) Co-authored-by: darkdatter <msylvia@tradestax.io>	2025-04-12 23:52:20 -07:00
Adil Hafeez	eb48f3d5bb	use passed in model name in chat completion request (#445 )	2025-03-21 15:56:17 -07:00
Adil Hafeez	84cd1df7bf	add preliminary support for llm agents (#432 )	2025-03-19 15:21:34 -07:00
Adil Hafeez	d2cb1427fb	add hurl tests for currency exchange demo (#435 )	2025-03-17 14:21:41 -07:00
Adil Hafeez	5526314b3c	fix ollama demo - add egress endpoint for llm routing (#433 )	2025-03-17 11:50:14 -07:00
Salman Paracha	ffb8566c36	updating the network agent to work agin (#425 ) Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>	2025-03-08 14:29:11 -08:00
Adil Hafeez	e8dc7f18d3	start using base_url in place of endpoint (#430 )	2025-03-05 17:20:04 -08:00
Adil Hafeez	ed3845040e	add demo for deepseek (#426 )	2025-03-05 14:08:06 -08:00
Salman Paracha	2e2680c98c	Salanap/java debug ai (#416 ) * stashing changes on my local branch * updated the java demo with debug points and jaeger tracing --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>	2025-03-04 12:49:34 -08:00
Shuguang Chen	e77fc47225	Handle intent matching better in arch gateway (#391 )	2025-03-04 12:49:13 -08:00
Adil Hafeez	e40b13be05	Update arch_config and add tests for arch config file (#407 )	2025-02-14 19:28:10 -08:00
Adil Hafeez	2f6c4348fd	update jaeger (#411 )	2025-02-14 14:55:41 -08:00
Salman Paracha	28ee14a75c	updated the spotify bearer authorization README and fixed main README… (#402 ) * updated the spotify bearer authorization README and fixed main README links * minor fixes to SPOTIFY README --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>	2025-02-10 17:56:28 -08:00
Salman Paracha	6ad70d9737	Fixing relative link to the shared chatbotui in the spotify demo, and add references to it in the README (#400 )	2025-02-10 11:23:04 -08:00
Adil Hafeez	8de6eacfbd	spotify demo with optimized context window code change (#397 )	2025-02-07 19:14:15 -08:00
Salman Paracha	b3c95a6698	refactor demos (#398 )	2025-02-07 18:45:42 -08:00
Adil Hafeez	2bd61d628c	add ability to specify custom http headers in api endpoint (#386 )	2025-02-06 11:48:09 -08:00
Adil Hafeez	962727f244	Infer port from protocol if port is not specified and add ability to override hostname in clusters def (#389 )	2025-02-03 14:51:59 -08:00
Adil Hafeez	38f7691163	add support for custom llm with ssl support (#380 ) * add support for custom llm with ssl support Add support for using custom llm that are served through https protocol. * add instructions on how to add custom inference endpoint * fix formatting * add more details * Apply suggestions from code review Co-authored-by: Salman Paracha <salman.paracha@gmail.com> * Apply suggestions from code review * fix precommit --------- Co-authored-by: Salman Paracha <salman.paracha@gmail.com>	2025-01-24 17:14:24 -08:00
Adil Hafeez	2c67fa3bc0	Fix llm_routing provider element (#382 ) * Fix llm_routing provider element We replaced provider with provider_interface to make it more clear to developers about provider api/backend being used. During that upgrade we removed support for mistral in provider to encourage developers to start using provider_interface. But this demo was not updated to use provider_interface as it was using mistral. This code change fixes it by replacing provider with provider_interface. Signed-off-by: Adil Hafeez <adil.hafeez@gmail.com> * fix the path * move * add more details * fix * Apply suggestions from code review * fix * fix --------- Signed-off-by: Adil Hafeez <adil.hafeez@gmail.com>	2025-01-24 16:34:11 -08:00
Guy Stone	84af476c75	Add run_demo.sh for llm_routing demo (#381 ) * Create run_demo.sh for llm_routing demo * Update run_demo.sh	2025-01-24 14:41:51 -08:00
Adil Hafeez	6740a09952	add docker-compose file for honeycomb tracing (#377 )	2025-01-22 14:02:59 -08:00
Aayush	fcd8cfb9fc	add in honeycomb support for weather-forecast demo (#345 )	2025-01-21 17:15:27 -08:00
Adil Hafeez	07ef3149b8	add support for using custom upstream llm (#365 )	2025-01-17 18:25:55 -08:00
Adil Hafeez	8407edae99	only test currency exchange in demo test (#348 )	2024-12-21 11:33:08 -08:00
Shuguang Chen	ba7279becb	Use intent model from archfc to pick prompt gateway (#328 )	2024-12-20 13:25:01 -08:00
Adil Hafeez	93d3d349a2	fix code bug (#340 )	2024-12-06 17:20:59 -08:00
Adil Hafeez	a54db1a098	update getting started guide and add llm gateway and prompt gateway samples (#330 )	2024-12-06 14:37:33 -08:00
Aayush	9d8fe02729	fix the README for the weather_forecasting demo (#336 ) * README fix * add missing colon	2024-12-06 14:02:41 -08:00
Aayush	885acc899f	322 add support for pydantic logfire for llm agent tracing (#329 ) * set up otel-collector and implement sending to logfire * moved rest of the files for the demo into the folder * update docker-compose.yaml and run_demo.sh to properly check for LOGFIRE_API_KEY * refactor weather_forecast demo to only be one demo * add a default docker-compose for e2e tests * update based on requested changes * fix replace comma with colon in readme * remove weather_forecast_service folder, and make logfire demo fail instantly if no key is set * remove the unused weather forecast service folder * Changed stop_demo to only stop one file at a time * update readme with new demo stopping setup * Revert changes to end behavior * fix silly formatting mistake	2024-12-06 13:44:22 -08:00
Peter Jausovec	f5cdafb7c8	update alertmanager version to v2, remove the merge artifacts (#309 ) Signed-off-by: Peter Jausovec <peter.jausovec@solo.io>	2024-11-27 11:41:31 -08:00
Adil Hafeez	0ff3d43008	remove dependency on docker-compose when starting up archgw (#305 )	2024-11-26 13:13:02 -08:00
Adil Hafeez	726f1a3185	add schema change to use enum in arch_config (#304 )	2024-11-25 17:51:25 -08:00
Adil Hafeez	9c6fcdb771	use fix prompt guards (#303 )	2024-11-25 17:16:35 -08:00
Adil Hafeez	d3c17c7abd	move custom tracer to llm filter (#267 )	2024-11-15 10:44:01 -08:00
Aayush	1d229cba8f	Add in tpot (#269 ) * add in tpot and tokens per second * add in debug logs for new stats and update integration tests * update shared dashboard to include new stats	2024-11-14 15:03:08 -08:00
Adil Hafeez	31749bfc74	move grafana and prometheus to shared (#265 )	2024-11-12 15:23:30 -08:00
Aayush	5993e36f22	Update arch stats (#250 )	2024-11-12 15:03:26 -08:00
Adil Hafeez	30647fd508	Add service to stream custom otel traces to otel-collector (#262 )	2024-11-12 11:09:40 -08:00

1 2 3

144 commits