Update README.md

This commit is contained in:
Salman Paracha 2025-03-19 13:51:14 -07:00 committed by GitHub
parent 18becb9e4b
commit 6624e4efd9
No known key found for this signature in database
GPG key ID: B5690EEEBB952194

View file

@ -39,10 +39,10 @@ And you think to youself, can't I move faster by focusing on higher-level object
**Core Features**:
- `Routing`. Engineered with purpose-built [LLMs](https://huggingface.co/collections/katanemo/arch-function-66f209a693ea8df14317ad68) for blazng fast (<100ms) routing and hand-off decisions to downstream agents.
- `Fast ⚡ Function Calling`: For common agentic scenarios, expose tools as APIs and let Arch detect, clarify and convert prompts to structured APIs
- `Guardrails`: Centralizes guardrails to prevent jailbreak attempts and harmful outcomes, and ensure safe user interactions without writing a single line of code.
- `Unified Access to LLMs`: Arch centralizes calls to LLMs used by your applications, offering smart retries, automatic cutover, and resilient upstream connections for continuous availability.
- `Observability`: Arch uses the W3C Trace Context standard to enable complete request tracing across applications, ensuring compatibility with observability tools, and provides metrics to monitor latency, token usage, and error rates, helping optimize AI application performance.
- `Fast ⚡ Function Calling`: For common agentic scenarios, expose tools as APIs and let Arch instantly clarify and convert prompts to structured APIs.
- `Guardrails`: Centralizes guardrails to prevent jailbreak attempts and harmful outcomes, and ensure safe user interactions.
- `Unified Access to LLMs`: Arch centralizes calls to LLMs, offering smart retries, automatic cutover, and resilient upstream connections for continuous availability.
- `Observability`: Arch uses the W3C Trace Context standard to enable request tracing, ensuring compatibility with observability tools, and provides metrics to monitor latency, token usage, and error rates.
- `Built on Envoy`: Arch runs alongside application servers as a separate containerized process, and builds on top of [Envoy's](https://envoyproxy.io) proven HTTP management and scalability features to handle ingress and egress traffic related to prompts and LLMs.
**High-Level Sequence Diagram**: