mirror of
https://github.com/katanemo/plano.git
synced 2026-06-17 15:25:17 +02:00
Update README.md
This commit is contained in:
parent
3fcb63d357
commit
668a8e4793
1 changed files with 6 additions and 6 deletions
12
README.md
12
README.md
|
|
@ -38,12 +38,12 @@ And you think to youself, can't I move faster by focusing on higher-level object
|
|||
|
||||
**Core Features**:
|
||||
|
||||
- `Routing`. Engineered with purpose-built [LLMs](https://huggingface.co/collections/katanemo/arch-function-66f209a693ea8df14317ad68) for blazng fast (<100ms) routing and hand-off decisions to downstream agents.
|
||||
- `Fast ⚡ Function Calling`: For common agentic scenarios let Arch clarfiy and convert prompts to tools and APIs.
|
||||
- `Guardrails`: Centrally configure and prevent jailbreaks and harmful outcomes, and ensure safe user interactions.
|
||||
- `Unified Access to LLMs`: Centralize= calls to LLMs with smart retries, automatic cutover, and resilient upstream connections for continuous availability.
|
||||
- `Observability`: W3C Trace Context compatiblerequest tracing, ensuring compatibility with popular observability tools, and provides metrics to monitor latency, token usage, and error rates.
|
||||
- `Built on Envoy`: Arch runs alongside application servers as a separate containerized process, and builds on top of [Envoy's](https://envoyproxy.io) proven HTTP management and scalability features to handle ingress and egress traffic related to prompts and LLMs.
|
||||
- `🚦 Routing`. Engineered with purpose-built [LLMs](https://huggingface.co/collections/katanemo/arch-function-66f209a693ea8df14317ad68) for blazng fast (<100ms) routing and hand-off decisions to downstream agents.
|
||||
- `⚡ Function Calling`: Improve responsiveness of common agentic scenarios by letting Arch clarfiy and convert prompts to tools/APIs.
|
||||
- `⛨ Guardrails`: Centrally configure and prevent jailbreaks and harmful outcomes, and ensure safe user interactions.
|
||||
- `🔗 Unified Access to LLMs`: Centralize= calls to LLMs with smart retries, automatic cutover, and resilient upstream connections for continuous availability.
|
||||
- `🕵 Observability`: W3C Trace Context compatiblerequest tracing, ensuring compatibility with popular observability tools, and provides metrics to monitor latency, token usage, and error rates.
|
||||
- `🧱 Built on Envoy`: Arch runs alongside application servers as a separate containerized process, and builds on top of [Envoy's](https://envoyproxy.io) proven HTTP management and scalability features to handle ingress and egress traffic related to prompts and LLMs.
|
||||
|
||||
**High-Level Sequence Diagram**:
|
||||

|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue