Update README.md

This commit is contained in:
Salman Paracha 2025-03-19 13:56:25 -07:00 committed by GitHub
parent 3fcb63d357
commit 668a8e4793
No known key found for this signature in database
GPG key ID: B5690EEEBB952194

View file

@ -38,12 +38,12 @@ And you think to youself, can't I move faster by focusing on higher-level object
**Core Features**:
- `Routing`. Engineered with purpose-built [LLMs](https://huggingface.co/collections/katanemo/arch-function-66f209a693ea8df14317ad68) for blazng fast (<100ms) routing and hand-off decisions to downstream agents.
- `Fast ⚡ Function Calling`: For common agentic scenarios let Arch clarfiy and convert prompts to tools and APIs.
- `Guardrails`: Centrally configure and prevent jailbreaks and harmful outcomes, and ensure safe user interactions.
- `Unified Access to LLMs`: Centralize= calls to LLMs with smart retries, automatic cutover, and resilient upstream connections for continuous availability.
- `Observability`: W3C Trace Context compatiblerequest tracing, ensuring compatibility with popular observability tools, and provides metrics to monitor latency, token usage, and error rates.
- `Built on Envoy`: Arch runs alongside application servers as a separate containerized process, and builds on top of [Envoy's](https://envoyproxy.io) proven HTTP management and scalability features to handle ingress and egress traffic related to prompts and LLMs.
- `🚦 Routing`. Engineered with purpose-built [LLMs](https://huggingface.co/collections/katanemo/arch-function-66f209a693ea8df14317ad68) for blazng fast (<100ms) routing and hand-off decisions to downstream agents.
- `⚡ Function Calling`: Improve responsiveness of common agentic scenarios by letting Arch clarfiy and convert prompts to tools/APIs.
- `Guardrails`: Centrally configure and prevent jailbreaks and harmful outcomes, and ensure safe user interactions.
- `🔗 Unified Access to LLMs`: Centralize= calls to LLMs with smart retries, automatic cutover, and resilient upstream connections for continuous availability.
- `🕵 Observability`: W3C Trace Context compatiblerequest tracing, ensuring compatibility with popular observability tools, and provides metrics to monitor latency, token usage, and error rates.
- `🧱 Built on Envoy`: Arch runs alongside application servers as a separate containerized process, and builds on top of [Envoy's](https://envoyproxy.io) proven HTTP management and scalability features to handle ingress and egress traffic related to prompts and LLMs.
**High-Level Sequence Diagram**:
![alt text](docs/source/_static/img/arch_network_diagram_high_level.png)