plano

mirror of https://github.com/katanemo/plano.git synced 2026-06-17 15:25:17 +02:00

Author	SHA1	Message	Date
Adil Hafeez	53d11ae235	add --docker flag to E2E tests and demo scripts	2026-03-03 15:08:50 -08:00
Adil Hafeez	473996d35d	Overhaul demos directory: cleanup, restructure, and standardize configs (#760 )	2026-02-17 03:09:28 -08:00
Adil Hafeez	c3591bcbf3	Upgrade CI, Docker, and demos to Python 3.14 (#759 ) Update all GitHub Actions workflows and Dockerfiles to use Python 3.14 as the default version. Remove the upper bound on requires-python in model_choice_with_test_harness to allow 3.14+. The CLI's requires-python stays at >=3.10 for broad compatibility.	2026-02-15 10:22:33 -08:00
Adil Hafeez	ba651aaf71	Rename all arch references to plano (#745 ) * Rename all arch references to plano across the codebase Complete rebrand from "Arch"/"archgw" to "Plano" including: - Config files: arch_config_schema.yaml, workflow, demo configs - Environment variables: ARCH_CONFIG_* → PLANO_CONFIG_* - Python CLI: variables, functions, file paths, docker mounts - Rust crates: config paths, log messages, metadata keys - Docker/build: Dockerfile, supervisord, .dockerignore, .gitignore - Docker Compose: volume mounts and env vars across all demos/tests - GitHub workflows: job/step names - Shell scripts: log messages - Demos: Python code, READMEs, VS Code configs, Grafana dashboard - Docs: RST includes, code comments, config references - Package metadata: package.json, pyproject.toml, uv.lock External URLs (docs.archgw.com, github.com/katanemo/archgw) left as-is. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Update remaining arch references in docs - Rename RST cross-reference labels: arch_access_logging, arch_overview_tracing, arch_overview_threading → plano_* - Update label references in request_lifecycle.rst - Rename arch_config_state_storage_example.yaml → plano_config_state_storage_example.yaml - Update config YAML comments: "Arch creates/uses" → "Plano creates/uses" - Update "the Arch gateway" → "the Plano gateway" in configuration_reference.rst - Update arch_config_schema.yaml reference in provider_models.py - Rename arch_agent_router → plano_agent_router in config example Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Fix remaining arch references found in second pass - config/docker-compose.dev.yaml: ARCH_CONFIG_FILE → PLANO_CONFIG_FILE, arch_config.yaml → plano_config.yaml, archgw_logs → plano_logs - config/test_passthrough.yaml: container mount path - tests/e2e/docker-compose.yaml: source file path (was still arch_config.yaml) - cli/planoai/core.py: comment and log message - crates/brightstaff/src/tracing/constants.rs: doc comment - tests/{e2e,archgw}/common.py: get_arch_messages → get_plano_messages, arch_state/arch_messages variables renamed - tests/{e2e,archgw}/test_prompt_gateway.py: updated imports and usages - demos/shared/test_runner/{common,test_demos}.py: same renames - tests/e2e/test_model_alias_routing.py: docstring - .dockerignore: archgw_modelserver → plano_modelserver - demos/use_cases/claude_code_router/pretty_model_resolution.sh: container name Note: x-arch-* HTTP header values and Rust constant names intentionally preserved for backwards compatibility with existing deployments. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 15:16:56 -08:00
Adil Hafeez	053e2b3a74	use uv instead of poetry (#663 )	2025-12-26 11:21:42 -08:00
Adil Hafeez	e8170f76ca	rename to planoai (#650 )	2025-12-23 19:26:51 -08:00
Adil Hafeez	e7ce00b5a7	rename cli to plano (#647 )	2025-12-23 18:37:58 -08:00
Salman Paracha	88c2bd1851	removing model_server python module to brightstaff (function calling) (#615 ) * adding function_calling functionality via rust * fixed rendered YAML file * removed model_server from envoy.template and forwarding traffic to bright_staff * fixed bugs in function_calling.rs that were breaking tests. All good now * updating e2e test to clean up disk usage * removing Arch* models to be used as a default model if one is not specified * if the user sets arch-function base_url we should honor it * fixing demos as we needed to pin to a particular version of huggingface_hub else the chatbot ui wouldn't build * adding a constant for Arch-Function model name * fixing some edge cases with calls made to Arch-Function * fixed JSON parsing issues in function_calling.rs * fixed bug where the raw response from Arch-Function was re-encoded * removed debug from supervisord.conf * commenting out disk cleanup * adding back disk space --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local> Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>	2025-11-22 12:55:00 -08:00
Salman Paracha	fb0581fd39	add support for v1/messages and transformations (#558 ) * pushing draft PR * transformations are working. Now need to add some tests next * updated tests and added necessary response transformations for Anthropics' message response object * fixed bugs for integration tests * fixed doc tests * fixed serialization issues with enums on response * adding some debug logs to help * fixed issues with non-streaming responses * updated the stream_context to update response bytes * the serialized bytes length must be set in the response side * fixed the debug statement that was causing the integration tests for wasm to fail * fixing json parsing errors * intentionally removing the headers * making sure that we convert the raw bytes to the correct provider type upstream * fixing non-streaming responses to tranform correctly * /v1/messages works with transformations to and from /v1/chat/completions * updating the CLI and demos to support anthropic vs. claude * adding the anthropic key to the preference based routing tests * fixed test cases and added more structured logs * fixed integration tests and cleaned up logs * added python client tests for anthropic and openai * cleaned up logs and fixed issue with connectivity for llm gateway in weather forecast demo * fixing the tests. python dependency order was broken * updated the openAI client to fix demos * removed the raw response debug statement * fixed the dup cloning issue and cleaned up the ProviderRequestType enum and traits * fixing logs * moved away from string literals to consts * fixed streaming from Anthropic Client to OpenAI * removed debug statement that would likely trip up integration tests * fixed integration tests for llm_gateway * cleaned up test cases and removed unnecessary crates * fixing comments from PR * fixed bug whereby we were sending an OpenAIChatCompletions request object to llm_gateway even though the request may have been AnthropicMessages --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-4.local> Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-9.local> Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-10.local> Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-41.local> Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-136.local>	2025-09-10 07:40:30 -07:00
Adil Hafeez	aff389d342	don't run docker compose up for preference based router e2e demo tests (#499 )	2025-05-31 01:16:17 -07:00
Adil Hafeez	fffa837a06	separate out currency exchange and preference based routing (#491 )	2025-05-30 02:14:37 -07:00
Adil Hafeez	f5e77bbe65	add support for claude and add first class support for groq and deepseek (#479 )	2025-05-22 22:55:46 -07:00
Adil Hafeez	27c0f2fdce	Introduce brightstaff a new terminal service for llm routing (#477 )	2025-05-19 09:59:22 -07:00
Shuguang Chen	7d4b261a68	Integrate Arch-Function-Chat (#449 )	2025-04-15 14:39:12 -07:00
Adil Hafeez	eb48f3d5bb	use passed in model name in chat completion request (#445 )	2025-03-21 15:56:17 -07:00
Adil Hafeez	d2cb1427fb	add hurl tests for currency exchange demo (#435 )	2025-03-17 14:21:41 -07:00
Adil Hafeez	2f6c4348fd	update jaeger (#411 )	2025-02-14 14:55:41 -08:00
Salman Paracha	b3c95a6698	refactor demos (#398 )	2025-02-07 18:45:42 -08:00
Aayush	fcd8cfb9fc	add in honeycomb support for weather-forecast demo (#345 )	2025-01-21 17:15:27 -08:00
Aayush	885acc899f	322 add support for pydantic logfire for llm agent tracing (#329 ) * set up otel-collector and implement sending to logfire * moved rest of the files for the demo into the folder * update docker-compose.yaml and run_demo.sh to properly check for LOGFIRE_API_KEY * refactor weather_forecast demo to only be one demo * add a default docker-compose for e2e tests * update based on requested changes * fix replace comma with colon in readme * remove weather_forecast_service folder, and make logfire demo fail instantly if no key is set * remove the unused weather forecast service folder * Changed stop_demo to only stop one file at a time * update readme with new demo stopping setup * Revert changes to end behavior * fix silly formatting mistake	2024-12-06 13:44:22 -08:00
Peter Jausovec	f5cdafb7c8	update alertmanager version to v2, remove the merge artifacts (#309 ) Signed-off-by: Peter Jausovec <peter.jausovec@solo.io>	2024-11-27 11:41:31 -08:00
Adil Hafeez	d3c17c7abd	move custom tracer to llm filter (#267 )	2024-11-15 10:44:01 -08:00
Aayush	1d229cba8f	Add in tpot (#269 ) * add in tpot and tokens per second * add in debug logs for new stats and update integration tests * update shared dashboard to include new stats	2024-11-14 15:03:08 -08:00
Adil Hafeez	31749bfc74	move grafana and prometheus to shared (#265 )	2024-11-12 15:23:30 -08:00
Adil Hafeez	30647fd508	Add service to stream custom otel traces to otel-collector (#262 )	2024-11-12 11:09:40 -08:00
Adil Hafeez	d87105882b	update rust toolchain to 1.82 (#255 ) * update rust to 1.82 pin it, also update envoy to 1.32 and python to 3.13 * use python:3.12	2024-11-12 10:35:14 -08:00
Adil Hafeez	6b62662e01	update docs with weather_forecast path (#253 )	2024-11-08 10:00:15 -08:00
Adil Hafeez	a72bb804eb	add support for jaeger tracing (#229 )	2024-11-07 22:11:00 -06:00
Salman Paracha	dab7a44053	several fixes to demos (#238 ) Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>	2024-10-30 18:38:18 -07:00
Salman Paracha	bb882fb59b	Updated hr_agent to be full stack: gradio + fastAPI (#235 ) * commiting to remove * fix * updating hr_agent --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local> Co-authored-by: Adil Hafeez <adil@katanemo.com>	2024-10-30 15:05:34 -07:00
Salman Paracha	bb9a774a72	moving chatbot-ui in demos and out of root project structure (#228 ) Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>	2024-10-29 12:05:29 -07:00

31 commits