updated all demo READMes and minor doc changes (#154)

* updated all demo READMes and minor doc changes * minor typo fixes * updated main Readme * fixed README and docs * fixed README and docs --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2026-05-01 20:03:40 +02:00 · 2024-10-08 23:58:55 -07:00 · 2024-10-08 23:58:55 -07:00 · 42d4a28e13
commit 42d4a28e13
parent b63a01fe82
22 changed files with 324 additions and 1455 deletions
--- a/docs/source/build_with_arch/rag.rst
+++ b/docs/source/build_with_arch/rag.rst
@ -6,16 +6,45 @@ RAG Application
 The following section describes how Arch can help you build faster, smarter and more accurate
 Retrieval-Augmented Generation (RAG) applications.

-Intent-drift Detection
----------------------
-Developers struggle to handle ``follow-up`` or ``clarification`` questions.
-Specifically, when users ask for changes or additions to previous responses their AI applications often generate entirely new responses instead of adjusting previous ones.
-Arch offers **intent-drift** tracking as a feature so that developers can know when the user has shifted away from a previous intent so that they can dramatically improve retrieval accuracy, lower overall token cost and  improve the speed of their responses back to users.
+Parameter Extraction for RAG
+----------------------------
+
+To build RAG (Retrieval-Augmented Generation) applications, you can configure prompt targets with parameters,
+enabling Arch to retrieve critical information in a structured way for processing. This approach improves the
+retrieval quality and speed of your application. By extracting parameters from the conversation, you can pull
+the appropriate chunks from a vector database or SQL-like data store to enhance accuracy. With Arch, you can
+streamline data retrieval and processing to build more efficient and precise RAG applications.
+
+Step 1: Define Prompt Targets
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+
+.. literalinclude:: includes/rag/prompt_targets.yaml
+    :language: yaml
+    :caption: Prompt Targets
+    :linenos:
+
+Step 2: Process Request Parameters in Flask
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+
+Once the prompt targets are configured as above, handling those parameters is
+
+.. literalinclude:: includes/rag/parameter_handling.py
+    :language: python
+    :caption: Parameter handling with Flask
+    :linenos:
+
+[Coming Soon] `Drift Detection via Arch Intent-Markers <https://github.com/orgs/katanemo/projects/1/views/1?pane=issue&itemId=82697909>`_
+-----------------------------------------------------------------------------------------------------------------------------------------
+Developers struggle to efficiently handle ``follow-up`` or ``clarification`` questions. Specifically, when users ask for
+changes or additions to previous responses their AI applications often generate entirely new responses instead of adjusting
+previous ones.Arch offers **intent** tracking as a feature so that developers can know when the user has shifted away from a
+previous intent so that they can dramatically improve retrieval accuracy, lower overall token cost and  improve the speed of
+their responses back to users.

 Arch uses its built-in lightweight NLI and embedding models to know if the user has steered away from an active intent.
 Arch's intent-drift detection mechanism is based on its' :ref:`prompt_targets <prompt_target>` primtive. Arch tries to match an incoming
 prompt to one of the prompt_targets configured in the gateway. Once it detects that the user has moved away from an active
-active intent, Arch adds the ``x-arch-intent-drift`` headers to the request before sending it your application servers.
+active intent, Arch adds the ``x-arch-intent-marker`` headers to the request before sending it your application servers.

 .. literalinclude:: includes/rag/intent_detection.py
    :language: python
@ -61,30 +90,3 @@ Step 3: Get Messages based on latest drift
 You can used the last set of messages that match to an intent to prompt an LLM, use it with an vector-DB for
 improved retrieval, etc. With Arch and a few lines of code, you can improve the retrieval accuracy, lower overall
 token cost and dramatically improve the speed of their responses back to users.
-
-Parameter Extraction for RAG
----------------------------
-
-To build RAG (Retrieval-Augmented Generation) applications, you can configure prompt targets with parameters,
-enabling Arch to retrieve critical information in a structured way for processing. This approach improves the
-retrieval quality and speed of your application. By extracting parameters from the conversation, you can pull
-the appropriate chunks from a vector database or SQL-like data store to enhance accuracy. With Arch, you can
-streamline data retrieval and processing to build more efficient and precise RAG applications.
-
-Step 1: Define Prompt Targets
-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
-
-.. literalinclude:: includes/rag/prompt_targets.yaml
-    :language: yaml
-    :caption: Prompt Targets
-    :linenos:
-
-Step 2: Process Request Parameters in Flask
-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
-
-Once the prompt targets are configured as above, handling those parameters is
-
-.. literalinclude:: includes/rag/parameter_handling.py
-    :language: python
-    :caption: Parameter handling with Flask
-    :linenos: