mirror of
https://github.com/katanemo/plano.git
synced 2026-04-26 01:06:25 +02:00
Tweak readme docs for minor nits (#461)
Co-authored-by: darkdatter <msylvia@tradestax.io>
This commit is contained in:
parent
4d2d8bd7a1
commit
e7b0de2a72
13 changed files with 38 additions and 38 deletions
|
|
@ -13,7 +13,7 @@ Past the thrill of an AI demo, have you found yourself hitting these walls? You
|
|||
- You're **trapped in tedious prompting work** to clarify inputs and user intents?
|
||||
- You're **wasting cycles** choosing and integrating **code for observability** instead of it just happening transparently?
|
||||
|
||||
And you think to youself, can't I move faster by focusing on higher-level objectives in a language and framework agnostic way? Well, you can!
|
||||
And you think to yourself, can't I move faster by focusing on higher-level objectives in a language and framework agnostic way? Well, you can!
|
||||
|
||||
.. figure:: /_static/img/arch_network_diagram_high_level.png
|
||||
:width: 100%
|
||||
|
|
@ -35,7 +35,7 @@ Arch takes a dependency on Envoy and is a self-contained process that is designe
|
|||
Arch uses Envoy's HTTP connection management subsystem, HTTP L7 filtering and telemetry capabilities to extend the functionality exclusively for prompts and LLMs.
|
||||
This gives Arch several advantages:
|
||||
|
||||
* Arch builds on Envoy's proven success. Envoy is used at masssive scale by the leading technology companies of our time including `AirBnB <https://www.airbnb.com>`_, `Dropbox <https://www.dropbox.com>`_, `Google <https://www.google.com>`_, `Reddit <https://www.reddit.com>`_, `Stripe <https://www.stripe.com>`_, etc. Its battle tested and scales linearly with usage and enables developers to focus on what really matters: application features and business logic.
|
||||
* Arch builds on Envoy's proven success. Envoy is used at massive scale by the leading technology companies of our time including `AirBnB <https://www.airbnb.com>`_, `Dropbox <https://www.dropbox.com>`_, `Google <https://www.google.com>`_, `Reddit <https://www.reddit.com>`_, `Stripe <https://www.stripe.com>`_, etc. Its battle tested and scales linearly with usage and enables developers to focus on what really matters: application features and business logic.
|
||||
|
||||
* Arch works with any application language. A single Arch deployment can act as gateway for AI applications written in Python, Java, C++, Go, Php, etc.
|
||||
|
||||
|
|
@ -54,7 +54,7 @@ These LLMs are designed to be best-in-class for critical prompt-related tasks li
|
|||
With prompt guardrails you can prevent ``jailbreak attempts`` present in user's prompts without having to write a single line of code.
|
||||
To learn more about how to configure guardrails available in Arch, read :ref:`Prompt Guard <prompt_guard>`.
|
||||
|
||||
**Traffic Management:** Arch offers several capabilities for LLM calls originating from your applications, including smart retries on errors from upstream LLMs, and automatic cutover to other LLMs configured in Arch for continuous availability and disaster recovery scenarios.
|
||||
**Traffic Management:** Arch offers several capabilities for LLM calls originating from your applications, including smart retries on errors from upstream LLMs, and automatic cut-over to other LLMs configured in Arch for continuous availability and disaster recovery scenarios.
|
||||
Arch extends Envoy's `cluster subsystem <https://www.envoyproxy.io/docs/envoy/latest/intro/arch_overview/upstream/cluster_manager>`_ to manage upstream connections to LLMs so that you can build resilient AI applications.
|
||||
|
||||
**Front/edge Gateway:** There is substantial benefit in using the same software at the edge (observability, traffic shaping algorithms, applying guardrails, etc.) as for outbound LLM inference use cases.
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue