Doc Update (#129)

* init update * Update terminology.rst * fix the branch to create an index.html, and fix pre-commit issues * Doc update * made several changes to the docs after Shuguang's revision * fixing pre-commit issues * fixed the reference file to the final prompt config file * added google analytics --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2026-06-05 14:45:15 +02:00 · 2024-10-06 16:54:34 -07:00 · 2024-10-06 16:54:34 -07:00 · 5c7567584d
commit 5c7567584d
parent 2a7b95582c
49 changed files with 1185 additions and 609 deletions
--- a/docs/source/concepts/tech_overview/terminology.rst
+++ b/docs/source/concepts/tech_overview/terminology.rst
@ -0,0 +1,46 @@
+.. _arch_terminology:
+
+Terminology
+============
+
+A few definitions before we dive into the main architecture documentation. Arch borrows from Envoy's terminology
+to keep things consistent in logs, traces and in code.
+
+**Downstream(Ingress)**: An downstream client (web application, etc.) connects to Arch, sends prompts, and receives responses.
+
+**Upstream(Egress)**: An upstream host that receives connections and prompts from Arch, and returns context or responses for a prompt
+
+.. image:: /_static/img/network-topology-ingress-egress.jpg
+   :width: 100%
+   :align: center
+
+**Listener**: A listener is a named network location (e.g., port, address, path etc.) that Arch listens on to process prompts
+before forwarding them to your application server endpoints. rch enables you to configure one listener for downstream connections
+(like port 80, 443) and creates a separate internal listener for calls that initiate from your application code to LLMs.
+
+.. Note::
+
+   When you start Arch, you specify a listener address/port that you want to bind downstream. But, Arch uses are predefined port
+   that you can use (``127.0.0.1:10000``) to proxy egress calls originating from your application to LLMs (API-based or hosted).
+   For more details, check out :ref:`LLM providers <llm_provider>`
+
+**Instance**: An instance of the Arch gateway. When you start Arch it creates at most two processes. One to handle Layer 7
+networking operations (auth, tls, observability, etc) and the second process to serve models that enable it to make smart
+decisions on how to accept, handle and forward prompts. The second process is optional, as the model serving sevice could be
+hosted on a different network (an API call). But these two processes are considered a single instance of Arch.
+
+**Prompt Targets**: Arch offers a primitive called ``prompt_targets`` to help separate business logic from undifferentiated
+work in building generative AI apps. Prompt targets are endpoints that receive prompts that are processed by Arch.
+For example, Arch enriches incoming prompts with metadata like knowing when a request is a follow-up or clarifying prompt
+so that you can build faster, more accurate retrieval (RAG) apps. To support agentic apps, like scheduling travel plans or
+sharing comments on a document - via prompts, Bolt uses its function calling abilities to extract critical information from
+the incoming prompt (or a set of prompts) needed by a downstream backend API or function call before calling it directly.
+
+**Error Targets**: Error targets are those endpoints that receive forwarded errors from Arch when issues arise,
+such as failing to properly call a function/API, detecting violations of guardrails, or encountering other processing errors.
+These errors are communicated to the application via headers (X-Arch-[ERROR-TYPE]), allowing it to handle the errors gracefully
+and take appropriate actions.
+
+**Model Serving**: Arch is a set of **two** self-contained processes that are designed to run alongside your application servers
+(or on a separate hostconnected via a network).The  **model serving** process helps Arch make intelligent decisions about the
+incoming prompts. The model server is designed to call the (fast) purpose-built LLMs in Arch.