mirror of
https://github.com/katanemo/plano.git
synced 2026-06-17 15:25:17 +02:00
Update README.md
This commit is contained in:
parent
8635555243
commit
18becb9e4b
1 changed files with 6 additions and 6 deletions
12
README.md
12
README.md
|
|
@ -38,12 +38,12 @@ And you think to youself, can't I move faster by focusing on higher-level object
|
|||
|
||||
**Core Features**:
|
||||
|
||||
- `Agent Routing`. Engineered with purpose-built [LLMs](https://huggingface.co/collections/katanemo/arch-function-66f209a693ea8df14317ad68) for blazng fast (<100ms) routing and hand-off decisions to downstream agents.
|
||||
- `Blazing Fast ⚡ Function Calling`: For common agentic scenarios, expose tools as APIs and let Arch detect, clarify and convert prompts to structured APIs
|
||||
- `Prompt [Guardrails](https://huggingface.co/collections/katanemo/arch-guard-6702bdc08b889e4bce8f446d)`: Centralizes guardrails to prevent jailbreak attempts and harmful outcomes, and ensure safe user interactions without writing a single line of code.
|
||||
- `Unified Access to (any) LLM`: Arch centralizes calls to LLMs used by your applications, offering smart retries, automatic cutover, and resilient upstream connections for continuous availability.
|
||||
- `Rich Observability`: Arch uses the W3C Trace Context standard to enable complete request tracing across applications, ensuring compatibility with observability tools, and provides metrics to monitor latency, token usage, and error rates, helping optimize AI application performance.
|
||||
- `Built on [Envoy](https://envoyproxy.io)`: Arch runs alongside application servers as a separate containerized process, and builds on top of Envoy's proven HTTP management and scalability features to handle ingress and egress traffic related to prompts and LLMs.
|
||||
- `Routing`. Engineered with purpose-built [LLMs](https://huggingface.co/collections/katanemo/arch-function-66f209a693ea8df14317ad68) for blazng fast (<100ms) routing and hand-off decisions to downstream agents.
|
||||
- `Fast ⚡ Function Calling`: For common agentic scenarios, expose tools as APIs and let Arch detect, clarify and convert prompts to structured APIs
|
||||
- `Guardrails`: Centralizes guardrails to prevent jailbreak attempts and harmful outcomes, and ensure safe user interactions without writing a single line of code.
|
||||
- `Unified Access to LLMs`: Arch centralizes calls to LLMs used by your applications, offering smart retries, automatic cutover, and resilient upstream connections for continuous availability.
|
||||
- `Observability`: Arch uses the W3C Trace Context standard to enable complete request tracing across applications, ensuring compatibility with observability tools, and provides metrics to monitor latency, token usage, and error rates, helping optimize AI application performance.
|
||||
- `Built on Envoy`: Arch runs alongside application servers as a separate containerized process, and builds on top of [Envoy's](https://envoyproxy.io) proven HTTP management and scalability features to handle ingress and egress traffic related to prompts and LLMs.
|
||||
|
||||
**High-Level Sequence Diagram**:
|
||||

|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue