plano

mirror of https://github.com/katanemo/plano.git synced 2026-06-08 14:55:14 +02:00

Author	SHA1	Message	Date
Troy Mitchell	ed5e1d69d4	retry: add retry orchestrator coordinating all components Implement RetryOrchestrator as the top-level coordinator that: - Manages the full retry lifecycle per request - Integrates backoff, error detection, provider selection - Handles request deduplication via content hashing - Supports both same-provider retry and cross-provider failover - Emits structured attempt records for observability Signed-off-by: Troy Mitchell <i@troy-y.org>	2026-04-28 17:05:03 +08:00
Troy Mitchell	52c71fe23f	retry: add provider selector with failover logic Implement ProviderSelector that determines the next provider for retry attempts based on: - Failover provider list with priority ordering - Latency-blocked provider filtering - Retry-After header honoring - Round-robin and priority-based selection strategies Signed-off-by: Troy Mitchell <i@troy-y.org>	2026-04-28 17:05:03 +08:00
Troy Mitchell	d6a9ada93a	retry: add configuration validation for retry policies Implement validate_retry_policy() that checks retry policy configuration for errors and warnings including: - Invalid max_retries/timeout ranges - Conflicting backoff and jitter settings - Missing or invalid provider references - Latency threshold consistency checks Signed-off-by: Troy Mitchell <i@troy-y.org>	2026-04-28 17:05:03 +08:00
Troy Mitchell	46b6324819	retry: add state managers for latency blocking and retry-after Add three state management components: - LatencyBlockStateManager: tracks providers blocked due to high latency with configurable block duration and scope - LatencyTriggerCounter: counts consecutive latency threshold breaches before triggering provider blocking - RetryAfterStateManager: honors Retry-After headers with per-provider/model/endpoint blocking scope Signed-off-by: Troy Mitchell <i@troy-y.org>	2026-04-28 17:05:03 +08:00
Troy Mitchell	47a3e8a8e6	retry: add error response builder for retry exhaustion Implement RetryErrorResponseBuilder that constructs structured JSON error responses when all retry attempts are exhausted, including per-attempt error details and provider information. Signed-off-by: Troy Mitchell <i@troy-y.org>	2026-04-28 17:05:03 +08:00
Troy Mitchell	ea5617612a	retry: add error detector for HTTP response classification Implement ErrorDetector that classifies HTTP responses into: - Retryable errors (5xx, 429, timeouts) - Non-retryable errors (4xx client errors) - Successful responses Supports configurable status code matching and latency-based error detection with measurement strategies (TTFB/total). Signed-off-by: Troy Mitchell <i@troy-y.org>	2026-04-28 17:05:03 +08:00
Troy Mitchell	3c7fefac81	retry: add backoff calculator with jitter strategies Implement BackoffCalculator supporting: - Exponential backoff with configurable base/max delay - Full, equal, and decorrelated jitter strategies - Per-provider and per-status-code backoff overrides - Comprehensive unit tests for all strategies Signed-off-by: Troy Mitchell <i@troy-y.org>	2026-04-28 17:05:03 +08:00
Troy Mitchell	5a2d0aa52e	retry: add core retry types and module structure Add the retry module with core type definitions including: - RequestContext, RequestSignature for request deduplication - RetryExhaustedError, AllProvidersExhaustedError for error handling - AttemptError, AttemptErrorType for attempt tracking - ValidationError, ValidationWarning for config validation - Helper functions for provider extraction and hashing Wire up pub mod retry in lib.rs. Signed-off-by: Troy Mitchell <i@troy-y.org>	2026-04-28 17:05:03 +08:00
Troy Mitchell	6853e4d88f	common: add sha2, dashmap, tokio runtime dependencies for retry module Signed-off-by: Troy Mitchell <i@troy-y.org>	2026-04-28 15:31:57 +08:00
Troy Mitchell	388fbff8e6	common: add RetryPolicy proptest and YAML pattern tests Add comprehensive tests for retry policy configuration: - proptest: round-trip serialization, default invariants, status code expansion (single, range, full range) - YAML pattern tests covering 17 real-world configuration patterns: multi-provider failover, same-provider model downgrade, backoff on multiple error types, per-status-code strategy customization, timeout-specific config, no-retry, backoff scopes (model/provider/ global), high-latency blocking, retry-after handling, fallback models list, mixed integer and range codes Signed-off-by: Troy Mitchell <i@troy-y.org>	2026-04-28 15:25:21 +08:00
Troy Mitchell	a58a283e20	common: add RetryPolicy configuration types Add retry policy configuration types to support automatic retry and failover for LLM requests: - RetryPolicy: top-level config with fallback_models, default_strategy, default_max_attempts, and per-status-code overrides - BackoffConfig: exponential backoff with base_ms, max_ms, jitter, and scope (per-model, per-provider, or global) - RetryAfterConfig: Retry-After header handling with block scope and duration limits - HighLatencyConfig: latency-based blocking with threshold, measurement type, and trigger conditions - LatencyTriggerConfig: min_triggers and trigger_window for debouncing - RetryStrategy enum: same_model, same_provider, different_provider - StatusCodeEntry: flexible status code matching (single, range, list) Also add retry_policy field to GatewayConfig with Default impl. Signed-off-by: Troy Mitchell <i@troy-y.org>	2026-04-28 15:22:47 +08:00
Troy Mitchell	2548aa71cb	common: add proptest dev-dependency for configuration tests Signed-off-by: Troy Mitchell <i@troy-y.org>	2026-04-28 15:20:21 +08:00
Musa	2954ae258f	fix(config): accept `vercel` and `openrouter` as provider_interface values (#915 ) The Python CLI (#902) and the JSON schema both allow `vercel` and `openrouter` as `provider_interface`, and `hermesllm::ProviderId` knows how to dispatch them — but `crates/common::LlmProviderType` was never extended to deserialize them. As a result, `planoai up` with no user config (which synthesizes both providers via `cli/planoai/defaults.py`) caused brightstaff to crash on startup with: unknown variant `vercel`, expected one of `anthropic`, ..., `digitalocean` Add the missing enum variants and Display arms, plus a regression test that asserts both round-trip through serde and resolve through `to_provider_id()` (the exact path that previously panicked at parse).	2026-04-24 16:32:00 -07:00
Musa	897fda2deb	fix(routing): auto-migrate v0.3.0 inline routing_preferences to v0.4.0 top-level (#912 ) * fix(routing): auto-migrate v0.3.0 inline routing_preferences to v0.4.0 top-level Lift inline routing_preferences under each model_provider into the top-level routing_preferences list with merged models[] and bump version to v0.4.0, with a deprecation warning. Existing v0.3.0 demo configs (Claude Code, Codex, preference_based_routing, etc.) keep working unchanged. Schema flags the inline shape as deprecated but still accepts it. Docs and skills updated to canonical top-level multi-model form. * test(common): bump reference config assertion to v0.4.0 The rendered reference config was bumped to v0.4.0 when its inline routing_preferences were lifted to the top level; align the configuration deserialization test with that change. * fix(config_generator): bump version to v0.4.0 up front in migration Move the v0.3.0 -> v0.4.0 version bump to the top of migrate_inline_routing_preferences so it runs unconditionally, including for configs that already declare top-level routing_preferences at v0.3.0. Previously the bump only fired when inline migration produced entries, leaving top-level v0.3.0 configs rejected by brightstaff's v0.4.0 gate. Tests updated to cover the new behavior and to confirm we never downgrade newer versions. * fix(config_generator): gate routing_preferences migration on version < v0.4.0 Short-circuit the migration when the config already declares v0.4.0 or newer. Anything at v0.4.0+ is assumed to be on the canonical top-level shape and is passed through untouched, including stray inline preferences (which are the author's bug to fix). Only v0.3.0 and older configs are rewritten and bumped.	2026-04-24 12:31:44 -07:00
Musa	78dc4edad9	Add first-class ChatGPT subscription provider support (#881 ) * Add first-class ChatGPT subscription provider support * Address PR feedback: move uuid import to top, reuse parsed config in up() * Add ChatGPT token watchdog for seamless long-lived sessions * Address PR feedback: error on stream=false for ChatGPT, fix auth file permissions * Replace ChatGPT watchdog/restart with passthrough_auth --------- Co-authored-by: Musa Malik <musam@uw.edu>	2026-04-23 15:34:44 -07:00
Adil Hafeez	6701195a5d	add overrides.disable_signals to skip CPU-heavy signal analysis (#906 )	2026-04-23 11:38:29 -07:00
Adil Hafeez	1f701258cb	Zero-config planoai up: pass-through proxy with auto-detected providers (#890 )	2026-04-17 13:11:12 -07:00
Adil Hafeez	90b926c2ce	use plano-orchestrator for LLM routing, remove arch-router (#886 )	2026-04-15 16:41:42 -07:00
Musa	980faef6be	Redis-backed session cache for cross-replica model affinity (#879 ) Some checks failed CI / pre-commit (push) Has been cancelled CI / plano-tools-tests (push) Has been cancelled CI / native-smoke-test (push) Has been cancelled CI / docker-build (push) Has been cancelled CI / validate-config (push) Has been cancelled Publish docker image (latest) / build-arm64 (push) Has been cancelled Publish docker image (latest) / build-amd64 (push) Has been cancelled Build and Deploy Documentation / build (push) Has been cancelled CI / security-scan (push) Has been cancelled CI / test-prompt-gateway (push) Has been cancelled CI / test-model-alias-routing (push) Has been cancelled CI / test-responses-api-with-state (push) Has been cancelled CI / e2e-plano-tests (3.10) (push) Has been cancelled CI / e2e-plano-tests (3.11) (push) Has been cancelled CI / e2e-plano-tests (3.12) (push) Has been cancelled CI / e2e-plano-tests (3.13) (push) Has been cancelled CI / e2e-plano-tests (3.14) (push) Has been cancelled CI / e2e-demo-preference (push) Has been cancelled CI / e2e-demo-currency (push) Has been cancelled Publish docker image (latest) / create-manifest (push) Has been cancelled * add pluggable session cache with Redis backend * add Redis session affinity demos (Docker Compose and Kubernetes) * address PR review feedback on session cache * document Redis session cache backend for model affinity * sync rendered config reference with session_cache addition * add tenant-scoped Redis session cache keys and remove dead log_affinity_hit - Add tenant_header to SessionCacheConfig; when set, cache keys are scoped as plano:affinity:{tenant_id}:{session_id} for multi-tenant isolation - Thread tenant_id through RouterService, routing_service, and llm handlers - Use Cow<'_, str> in session_key to avoid allocation when no tenant is set - Remove unused log_affinity_hit (logging was already inlined at call sites) * remove session_affinity_redis and session_affinity_redis_k8s demos	2026-04-13 19:30:47 -07:00
Adil Hafeez	8dedf0bec1	Model affinity for consistent model selection in agentic loops (#827 ) Some checks are pending CI / pre-commit (push) Waiting to run CI / plano-tools-tests (push) Waiting to run CI / native-smoke-test (push) Waiting to run CI / docker-build (push) Waiting to run CI / validate-config (push) Waiting to run CI / security-scan (push) Blocked by required conditions CI / test-prompt-gateway (push) Blocked by required conditions CI / test-model-alias-routing (push) Blocked by required conditions CI / test-responses-api-with-state (push) Blocked by required conditions CI / e2e-plano-tests (3.10) (push) Blocked by required conditions CI / e2e-plano-tests (3.11) (push) Blocked by required conditions CI / e2e-plano-tests (3.12) (push) Blocked by required conditions CI / e2e-plano-tests (3.13) (push) Blocked by required conditions CI / e2e-plano-tests (3.14) (push) Blocked by required conditions CI / e2e-demo-preference (push) Blocked by required conditions CI / e2e-demo-currency (push) Blocked by required conditions Publish docker image (latest) / build-arm64 (push) Waiting to run Publish docker image (latest) / build-amd64 (push) Waiting to run Publish docker image (latest) / create-manifest (push) Blocked by required conditions Build and Deploy Documentation / build (push) Waiting to run	2026-04-08 17:32:02 -07:00
Musa	978b1ea722	Add first-class Xiaomi provider support (#863 ) Some checks failed CI / pre-commit (push) Has been cancelled CI / plano-tools-tests (push) Has been cancelled CI / native-smoke-test (push) Has been cancelled CI / docker-build (push) Has been cancelled CI / validate-config (push) Has been cancelled CI / security-scan (push) Has been cancelled CI / test-prompt-gateway (push) Has been cancelled CI / test-model-alias-routing (push) Has been cancelled CI / test-responses-api-with-state (push) Has been cancelled CI / e2e-plano-tests (3.10) (push) Has been cancelled CI / e2e-plano-tests (3.11) (push) Has been cancelled CI / e2e-plano-tests (3.12) (push) Has been cancelled CI / e2e-plano-tests (3.13) (push) Has been cancelled CI / e2e-plano-tests (3.14) (push) Has been cancelled CI / e2e-demo-preference (push) Has been cancelled CI / e2e-demo-currency (push) Has been cancelled Publish docker image (latest) / build-arm64 (push) Has been cancelled Publish docker image (latest) / build-amd64 (push) Has been cancelled Publish docker image (latest) / create-manifest (push) Has been cancelled Build and Deploy Documentation / build (push) Has been cancelled * feat(provider): add xiaomi as first-class provider * feat(demos): add xiaomi mimo integration demo * refactor(demos): remove Xiaomi MiMo integration demo and update documentation * updating model list and adding the xiamoi models --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-389.local>	2026-04-04 09:58:36 -07:00
Musa	f68c21f8df	Handle null prefer in inline routing policy (#856 ) * Handle null prefer in inline routing policy * Use serde defaulting for null selection preference * Add tests for default selection policy behavior in routing preferences	2026-03-31 17:41:25 -07:00
Adil Hafeez	d8f4fd76e3	replace production panics with graceful error handling in common crate (#844 )	2026-03-31 14:28:11 -07:00
Adil Hafeez	af98c11a6d	restructure model_metrics_sources to type + provider (#855 )	2026-03-30 17:12:20 -07:00
Adil Hafeez	e5751d6b13	model routing: cost/latency ranking with ranked fallback list (#849 )	2026-03-30 13:46:52 -07:00
Adil Hafeez	1f23c573bf	add output filter chain (#822 )	2026-03-18 17:58:20 -07:00
Adil Hafeez	bc059aed4d	Unified overrides for custom router and orchestrator models (#820 ) * support configurable orchestrator model via orchestration config section * add self-hosting docs and demo for Plano-Orchestrator * list all Plano-Orchestrator model variants in docs * use overrides for custom routing and orchestration model * update docs * update orchestrator model name * rename arch provider to plano, use llm_routing_model and agent_orchestration_model * regenerate rendered config reference	2026-03-15 09:36:11 -07:00
Musa	2bde21ff57	add Custom Trace Attributes to extend observability (#708 ) * add custom trace attributes * refactor: prefix custom trace attributes and update schema handlers tests configs * refactor: rename custom_attribute_prefixes to span_attribute_header_prefixes in configuration and related handlers * docs: add section on custom span attributes * refactor: update tracing configuration to use span attributes and adjust related handlers * docs: custom span attributes section to include static attributes and clarify configuration * add custom trace attributes * refactor: prefix custom trace attributes and update schema handlers tests configs * refactor: rename custom_attribute_prefixes to span_attribute_header_prefixes in configuration and related handlers * docs: add section on custom span attributes * refactor: update tracing configuration to use span attributes and adjust related handlers * docs: custom span attributes section to include static attributes and clarify configuration * refactor: remove TraceCollector usage and enhance logging with structured attributes * refactor: custom trace attribute extraction to improve clarity --------- Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-25 16:27:20 -08:00
Syed A. Hashmi	54bc8e5e52	[ISSUE 706]: Standardize returned errors from Plano (#772 ) * [ISSUE 706]: Standardize returned errors from Plano * Standardized errors in chat completion	2026-02-24 14:34:33 -08:00
Adil Hafeez	ba651aaf71	Rename all arch references to plano (#745 ) * Rename all arch references to plano across the codebase Complete rebrand from "Arch"/"archgw" to "Plano" including: - Config files: arch_config_schema.yaml, workflow, demo configs - Environment variables: ARCH_CONFIG_* → PLANO_CONFIG_* - Python CLI: variables, functions, file paths, docker mounts - Rust crates: config paths, log messages, metadata keys - Docker/build: Dockerfile, supervisord, .dockerignore, .gitignore - Docker Compose: volume mounts and env vars across all demos/tests - GitHub workflows: job/step names - Shell scripts: log messages - Demos: Python code, READMEs, VS Code configs, Grafana dashboard - Docs: RST includes, code comments, config references - Package metadata: package.json, pyproject.toml, uv.lock External URLs (docs.archgw.com, github.com/katanemo/archgw) left as-is. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Update remaining arch references in docs - Rename RST cross-reference labels: arch_access_logging, arch_overview_tracing, arch_overview_threading → plano_* - Update label references in request_lifecycle.rst - Rename arch_config_state_storage_example.yaml → plano_config_state_storage_example.yaml - Update config YAML comments: "Arch creates/uses" → "Plano creates/uses" - Update "the Arch gateway" → "the Plano gateway" in configuration_reference.rst - Update arch_config_schema.yaml reference in provider_models.py - Rename arch_agent_router → plano_agent_router in config example Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Fix remaining arch references found in second pass - config/docker-compose.dev.yaml: ARCH_CONFIG_FILE → PLANO_CONFIG_FILE, arch_config.yaml → plano_config.yaml, archgw_logs → plano_logs - config/test_passthrough.yaml: container mount path - tests/e2e/docker-compose.yaml: source file path (was still arch_config.yaml) - cli/planoai/core.py: comment and log message - crates/brightstaff/src/tracing/constants.rs: doc comment - tests/{e2e,archgw}/common.py: get_arch_messages → get_plano_messages, arch_state/arch_messages variables renamed - tests/{e2e,archgw}/test_prompt_gateway.py: updated imports and usages - demos/shared/test_runner/{common,test_demos}.py: same renames - tests/e2e/test_model_alias_routing.py: docstring - .dockerignore: archgw_modelserver → plano_modelserver - demos/use_cases/claude_code_router/pretty_model_resolution.sh: container name Note: x-arch-* HTTP header values and Rust constant names intentionally preserved for backwards compatibility with existing deployments. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 15:16:56 -08:00
Adil Hafeez	46de89590b	use standard tracing and logging in brightstaff (#721 )	2026-02-09 13:33:27 -08:00
Adil Hafeez	e41aa0a617	upgrade rust to 1.93.0 and fix pre-commit (#720 )	2026-02-02 11:03:12 -08:00
Salman Paracha	2941392ed1	Adding support for wildcard models in the model_providers config (#696 ) * cleaning up plano cli commands * adding support for wildcard model providers * fixing compile errors * fixing bugs related to default model provider, provider hint and duplicates in the model provider list * fixed cargo fmt issues * updating tests to always include the model id * using default for the prompt_gateway path * fixed the model name, as gpt-5-mini-2025-08-07 wasn't in the config * making sure that all aliases and models match the config * fixed the config generator to allow for base_url providers LLMs to include wildcard models * re-ran the models list utility and added a shell script to run it * updating docs to mention wildcard model providers * updated provider_models.json to yaml, added that file to our docs for reference * updating the build docs to use the new root-based build --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>	2026-01-28 17:47:33 -08:00
Tang Quoc Thai	4d53297c17	feat: add passthrough_auth option for forwarding client Authorization header (#687 ) * feat: add passthrough_auth option for forwarding client Authorization header * fix tests * Update comment to reflect upstream forwarding * Apply suggestions from code review --------- Co-authored-by: Adil Hafeez <adil.hafeez@gmail.com> Co-authored-by: Adil Hafeez <adil@katanemo.com>	2026-01-14 15:06:28 -08:00
Adil Hafeez	ab391f96c7	don't include internal models in /v1/models endpoint (#685 )	2026-01-09 16:57:41 -08:00
Adil Hafeez	11fb4cd633	remove unnecessary clones from code (#682 )	2026-01-08 15:11:05 -08:00
Adil Hafeez	57327ba667	ensure that request id is consistent (#677 ) * ensure that request id is consistent * remove test debug/info statements	2026-01-07 08:44:41 -08:00
Adil Hafeez	ca95ffb63d	cargo clippy (#660 )	2025-12-25 21:08:37 -08:00
Salman Paracha	e224cba3e3	Update docs to Plano (#639 )	2025-12-23 17:14:50 -08:00
Adil Hafeez	15fbb6c3af	plano orchestration using plano orchestration 4b model (#637 )	2025-12-22 18:05:49 -08:00
Adil Hafeez	2f9121407b	Use mcp tools for filter chain (#621 ) * agents framework demo * more changes * add more changes * pending changes * fix tests * fix more * rebase with main and better handle error from mcp * add trace for filters * add test for client error, server error and for mcp error * update schema validate code and rename kind => type in agent_filter * fix agent description and pre-commit * fix tests * add provider specific request parsing in agents chat * fix precommit and tests * cleanup demo * update readme * fix pre-commit * refactor tracing * fix fmt * fix: handle MessageContent enum in responses API conversion - Update request.rs to handle new MessageContent enum structure from main - MessageContent can now be Text(String) or Items(Vec<InputContent>) - Handle new InputItem variants (ItemReference, FunctionCallOutput) - Fixes compilation error after merging latest main (#632) * address pr feedback * fix span * fix build * update openai version	2025-12-17 17:30:14 -08:00
Shuguang Chen	cb82a83c7b	orchestration integration (#623 ) * orchestration integration * Convert compact json to spaced json	2025-12-17 17:20:19 -08:00
Salman Paracha	d5a273f740	enable state management for v1/responses (#631 ) * first commit with tests to enable state mamangement via memory * fixed logs to follow the conversational flow a bit better * added support for supabase * added the state_storage_v1_responses flag, and use that to store state appropriately * cleaned up logs and fixed issue with connectivity for llm gateway in weather forecast demo * fixed mixed inputs from openai v1/responses api (#632) * fixed mixed inputs from openai v1/responses api * removing tracing from model-alias-rouing * handling additional input types from openairs --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local> * resolving PR comments --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>	2025-12-17 12:18:38 -08:00
Salman Paracha	a79f55f313	Improve end to end tracing (#628 ) * adding canonical tracing support via bright-staff * improved formatting for tools in the traces * removing anthropic from the currency exchange demo * using Envoy to transport traces, not calling OTEL directly * moving otel collcetor cluster outside tracing if/else * minor fixes to not write to the OTEL collector if tracing is disabled * fixed PR comments and added more trace attributes * more fixes based on PR comments * more clean up based on PR comments --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>	2025-12-11 15:21:57 -08:00
Salman Paracha	a448c6e9cb	Add support for v1/responses API (#622 ) * making first commit. still need to work on streaming respones * making first commit. still need to work on streaming respones * stream buffer implementation with tests * adding grok API keys to workflow * fixed changes based on code review * adding support for bedrock models * fixed issues with translation to claude code --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>	2025-12-03 14:58:26 -08:00
Salman Paracha	88c2bd1851	removing model_server python module to brightstaff (function calling) (#615 ) * adding function_calling functionality via rust * fixed rendered YAML file * removed model_server from envoy.template and forwarding traffic to bright_staff * fixed bugs in function_calling.rs that were breaking tests. All good now * updating e2e test to clean up disk usage * removing Arch* models to be used as a default model if one is not specified * if the user sets arch-function base_url we should honor it * fixing demos as we needed to pin to a particular version of huggingface_hub else the chatbot ui wouldn't build * adding a constant for Arch-Function model name * fixing some edge cases with calls made to Arch-Function * fixed JSON parsing issues in function_calling.rs * fixed bug where the raw response from Arch-Function was re-encoded * removed debug from supervisord.conf * commenting out disk cleanup * adding back disk space --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local> Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local>	2025-11-22 12:55:00 -08:00
Salman Paracha	cdfcfb9169	support base_url path for model providers (#608 ) * adding support for base_url * updated docs * fixed tests for config generator * making fixes based on PR comments --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>	2025-10-29 17:08:07 -07:00
Salman Paracha	566e7b9c09	fixed bug in Bedrock translation code and dramatically improved tracing for outbound LLM traffic (#601 ) * dramatically improve LLM traces and fixed bug with Bedrock translation from claude code * addressing comments --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local>	2025-10-24 14:07:05 -07:00
Salman Paracha	9407ae6af7	Add support for Amazon Bedrock Converse and ConverseStream (#588 ) * first commit to get Bedrock Converse API working. Next commit support for streaming and binary frames * adding translation from BedrockBinaryFrameDecoder to AnthropicMessagesEvent * Claude Code works with Amazon Bedrock * added tests for openai streaming from bedrock * PR comments fixed * adding support for bedrock in docs as supported provider * cargo fmt * revertted to chatgpt models for claude code routing --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local> Co-authored-by: Adil Hafeez <adil.hafeez@gmail.com>	2025-10-22 11:31:21 -07:00
Adil Hafeez	96e0732089	add support for agents (#564 )	2025-10-14 14:01:11 -07:00

1 2 3

105 commits