plano/crates/common/src/consts.rs

pub const RATELIMIT_SELECTOR_HEADER_KEY: &str = "x-arch-ratelimit-selector";
pub const SYSTEM_ROLE: &str = "system";
pub const USER_ROLE: &str = "user";
pub const TOOL_ROLE: &str = "tool";
pub const ASSISTANT_ROLE: &str = "assistant";
pub const ARCH_FC_REQUEST_TIMEOUT_MS: u64 = 30000; // 30 seconds
pub const DEFAULT_TARGET_REQUEST_TIMEOUT_MS: u64 = 30000; // 30 seconds
pub const API_REQUEST_TIMEOUT_MS: u64 = 30000; // 30 seconds
pub const MODEL_SERVER_REQUEST_TIMEOUT_MS: u64 = 30000; // 30 seconds
pub const MODEL_SERVER_NAME: &str = "bright_staff";
pub const ARCH_ROUTING_HEADER: &str = "x-arch-llm-provider";
pub const MESSAGES_KEY: &str = "messages";
pub const ARCH_PROVIDER_HINT_HEADER: &str = "x-arch-llm-provider-hint";
pub const ARCH_IS_STREAMING_HEADER: &str = "x-arch-streaming-request";
pub const CHAT_COMPLETIONS_PATH: &str = "/v1/chat/completions";
pub const MESSAGES_PATH: &str = "/v1/messages";
pub const HEALTHZ_PATH: &str = "/healthz";
pub const X_ARCH_STATE_HEADER: &str = "x-arch-state";
pub const X_ARCH_API_RESPONSE: &str = "x-arch-api-response-message";
pub const X_ARCH_TOOL_CALL: &str = "x-arch-tool-call-message";
pub const X_ARCH_FC_MODEL_RESPONSE: &str = "x-arch-fc-model-response";
pub const ARCH_FC_MODEL_NAME: &str = "Arch-Function";
pub const REQUEST_ID_HEADER: &str = "x-request-id";
pub const TRACE_PARENT_HEADER: &str = "traceparent";
pub const ARCH_INTERNAL_CLUSTER_NAME: &str = "arch_internal";
pub const ARCH_UPSTREAM_HOST_HEADER: &str = "x-arch-upstream";
pub const ARCH_MODEL_PREFIX: &str = "Arch";
pub const HALLUCINATION_TEMPLATE: &str =
    "It seems I'm missing some information. Could you provide the following details ";
pub const OTEL_COLLECTOR_HTTP: &str = "opentelemetry_collector_http";
pub const OTEL_POST_PATH: &str = "/v1/traces";
pub const LLM_ROUTE_HEADER: &str = "x-arch-llm-route";
pub const ENVOY_RETRY_HEADER: &str = "x-envoy-max-retries";
rename envoyfilter => arch (#91) * rename envoyfilter => arch * fix more files * more fixes * more renames 2024-09-27 16:41:39 -07:00			`pub const RATELIMIT_SELECTOR_HEADER_KEY: &str = "x-arch-ratelimit-selector";`
Add workflow logic for weather forecast demo (#24) 2024-07-30 16:23:23 -07:00			`pub const SYSTEM_ROLE: &str = "system";`
			`pub const USER_ROLE: &str = "user";`
Pass tool call and app function response back in metadata (#193) 2024-10-18 13:25:39 -07:00			`pub const TOOL_ROLE: &str = "tool";`
			`pub const ASSISTANT_ROLE: &str = "assistant";`
Update arch_config and add tests for arch config file (#407) 2025-02-14 19:28:10 -08:00			`pub const ARCH_FC_REQUEST_TIMEOUT_MS: u64 = 30000; // 30 seconds`
			`pub const DEFAULT_TARGET_REQUEST_TIMEOUT_MS: u64 = 30000; // 30 seconds`
			`pub const API_REQUEST_TIMEOUT_MS: u64 = 30000; // 30 seconds`
			`pub const MODEL_SERVER_REQUEST_TIMEOUT_MS: u64 = 30000; // 30 seconds`
removing model_server python module to brightstaff (function calling) (#615) * adding function_calling functionality via rust * fixed rendered YAML file * removed model_server from envoy.template and forwarding traffic to bright_staff * fixed bugs in function_calling.rs that were breaking tests. All good now * updating e2e test to clean up disk usage * removing Arch* models to be used as a default model if one is not specified * if the user sets arch-function base_url we should honor it * fixing demos as we needed to pin to a particular version of huggingface_hub else the chatbot ui wouldn't build * adding a constant for Arch-Function model name * fixing some edge cases with calls made to Arch-Function * fixed JSON parsing issues in function_calling.rs * fixed bug where the raw response from Arch-Function was re-encoded * removed debug from supervisord.conf * commenting out disk cleanup * adding back disk space --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local> Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-342.local> 2025-11-22 12:55:00 -08:00			`pub const MODEL_SERVER_NAME: &str = "bright_staff";`
rename envoyfilter => arch (#91) * rename envoyfilter => arch * fix more files * more fixes * more renames 2024-09-27 16:41:39 -07:00			`pub const ARCH_ROUTING_HEADER: &str = "x-arch-llm-provider";`
updated key name (#211) 2024-10-24 00:02:24 -04:00			`pub const MESSAGES_KEY: &str = "messages";`
Add the ability to use LLM Providers from the Arch config (#112) Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com> 2024-10-03 10:57:01 -07:00			`pub const ARCH_PROVIDER_HINT_HEADER: &str = "x-arch-llm-provider-hint";`
Add support for Amazon Bedrock Converse and ConverseStream (#588) * first commit to get Bedrock Converse API working. Next commit support for streaming and binary frames * adding translation from BedrockBinaryFrameDecoder to AnthropicMessagesEvent * Claude Code works with Amazon Bedrock * added tests for openai streaming from bedrock * PR comments fixed * adding support for bedrock in docs as supported provider * cargo fmt * revertted to chatgpt models for claude code routing --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-288.local> Co-authored-by: Adil Hafeez <adil.hafeez@gmail.com> 2025-10-22 11:31:21 -07:00			`pub const ARCH_IS_STREAMING_HEADER: &str = "x-arch-streaming-request";`
add support for gemini (#505) 2025-06-11 15:15:00 -07:00			`pub const CHAT_COMPLETIONS_PATH: &str = "/v1/chat/completions";`
add support for v1/messages and transformations (#558) * pushing draft PR * transformations are working. Now need to add some tests next * updated tests and added necessary response transformations for Anthropics' message response object * fixed bugs for integration tests * fixed doc tests * fixed serialization issues with enums on response * adding some debug logs to help * fixed issues with non-streaming responses * updated the stream_context to update response bytes * the serialized bytes length must be set in the response side * fixed the debug statement that was causing the integration tests for wasm to fail * fixing json parsing errors * intentionally removing the headers * making sure that we convert the raw bytes to the correct provider type upstream * fixing non-streaming responses to tranform correctly * /v1/messages works with transformations to and from /v1/chat/completions * updating the CLI and demos to support anthropic vs. claude * adding the anthropic key to the preference based routing tests * fixed test cases and added more structured logs * fixed integration tests and cleaned up logs * added python client tests for anthropic and openai * cleaned up logs and fixed issue with connectivity for llm gateway in weather forecast demo * fixing the tests. python dependency order was broken * updated the openAI client to fix demos * removed the raw response debug statement * fixed the dup cloning issue and cleaned up the ProviderRequestType enum and traits * fixing logs * moved away from string literals to consts * fixed streaming from Anthropic Client to OpenAI * removed debug statement that would likely trip up integration tests * fixed integration tests for llm_gateway * cleaned up test cases and removed unnecessary crates * fixing comments from PR * fixed bug whereby we were sending an OpenAIChatCompletions request object to llm_gateway even though the request may have been AnthropicMessages --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-4.local> Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-9.local> Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-10.local> Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-41.local> Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-136.local> 2025-09-10 07:40:30 -07:00			`pub const MESSAGES_PATH: &str = "/v1/messages";`
Add support for streaming and fixes few issues (see description) (#202) 2024-10-28 20:05:06 -04:00			`pub const HEALTHZ_PATH: &str = "/healthz";`
Integrate Arch-Function-Chat (#449) 2025-04-15 14:39:12 -07:00			`pub const X_ARCH_STATE_HEADER: &str = "x-arch-state";`
			`pub const X_ARCH_API_RESPONSE: &str = "x-arch-api-response-message";`
			`pub const X_ARCH_TOOL_CALL: &str = "x-arch-tool-call-message";`
			`pub const X_ARCH_FC_MODEL_RESPONSE: &str = "x-arch-fc-model-response";`
			`pub const ARCH_FC_MODEL_NAME: &str = "Arch-Function";`
Split listener (#141) 2024-10-08 16:24:08 -07:00			`pub const REQUEST_ID_HEADER: &str = "x-request-id";`
add support for jaeger tracing (#229) 2024-11-07 22:11:00 -06:00			`pub const TRACE_PARENT_HEADER: &str = "traceparent";`
Split listener (#141) 2024-10-08 16:24:08 -07:00			`pub const ARCH_INTERNAL_CLUSTER_NAME: &str = "arch_internal";`
			`pub const ARCH_UPSTREAM_HOST_HEADER: &str = "x-arch-upstream";`
concatenate history of user messages for hallucination (#177) * concatenate history of user messages for hallucination * add history of messages * fix gpt to not arch * add model prefix * fix * correct init of user_messages * fmt * fix test 2024-10-15 11:43:05 -07:00			`pub const ARCH_MODEL_PREFIX: &str = "Arch";`
Add support for streaming and fixes few issues (see description) (#202) 2024-10-28 20:05:06 -04:00			`pub const HALLUCINATION_TEMPLATE: &str =`
			`"It seems I'm missing some information. Could you provide the following details ";`
use envoy to publish traces (#270) 2024-11-18 17:55:39 -08:00			`pub const OTEL_COLLECTOR_HTTP: &str = "opentelemetry_collector_http";`
			`pub const OTEL_POST_PATH: &str = "/v1/traces";`
Introduce brightstaff a new terminal service for llm routing (#477) 2025-05-19 09:59:22 -07:00			`pub const LLM_ROUTE_HEADER: &str = "x-arch-llm-route";`
add support for agents (#564) 2025-10-14 14:01:11 -07:00			`pub const ENVOY_RETRY_HEADER: &str = "x-envoy-max-retries";`