V1 docs push (#86)

* updated docs (again)

* updated the LLMs section, prompt processing section and the RAG section of the docs

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
This commit is contained in:
Salman Paracha 2024-09-25 23:43:34 -07:00 committed by GitHub
parent 3511798fa8
commit 48a2c1800c
No known key found for this signature in database
GPG key ID: B5690EEEBB952194
13 changed files with 222 additions and 79 deletions

View file

@ -1,10 +1,12 @@
.. _arch_rag_guide:
Retrieval-Augmented (RAG)
====================================
=========================
The following section describes how Arch can help you build faster, smarter and more accurate
Retrieval-Augmented Generation (RAG) applications.
Intent-drift detection
Intent-drift Detection
----------------------
Developers struggle to handle `follow-up <https://www.reddit.com/r/ChatGPTPromptGenius/comments/17dzmpy/how_to_use_rag_with_conversation_history_for/?>`_
@ -65,8 +67,8 @@ You can used the last set of messages that match to an intent to prompt an LLM,
improved retrieval, etc. With Arch and a few lines of code, you can improve the retrieval accuracy, lower overall
token cost and dramatically improve the speed of their responses back to users.
Smarter retrival with parameter extraction
------------------------------------------
Parameter Extraction for RAG
----------------------------
To build RAG (Retrieval-Augmented Generation) applications, you can configure prompt targets with parameters,
enabling Arch to retrieve critical information in a structured way for processing. This approach improves the