mirror of
https://github.com/katanemo/plano.git
synced 2026-04-26 01:06:25 +02:00
Docs branch - v1 of our tech docs (#69)
* added the first set of docs for our technical docs * more docuemtnation changes * added support for prompt processing and updated life of a request * updated docs to including getting help sections and updated life of a request * committing local changes for getting started guide, sample applications, and full reference spec for prompt-config * updated configuration reference, added sample app skeleton, updated favico * fixed the configuration refernce file, and made minor changes to the intent detection. commit v1 for now --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local> Co-authored-by: Adil Hafeez <adil@katanemo.com>
This commit is contained in:
parent
233976a568
commit
80c554ce1a
34 changed files with 1040 additions and 0 deletions
|
|
@ -0,0 +1,6 @@
|
|||
Function Calling (Agentic) Apps
|
||||
===============================
|
||||
|
||||
Building something more than a summary/qa experience requires giving users access to you data and APIs - via prompts.
|
||||
Arch enables that use case by offering a capability called "Function Calling". Arch extracts critical imformation
|
||||
from a prompt and can match the intent of the user to an API or business function hosted in your application.
|
||||
27
docs/source/getting_started/sample_apps/rag.rst
Normal file
27
docs/source/getting_started/sample_apps/rag.rst
Normal file
|
|
@ -0,0 +1,27 @@
|
|||
Retrieval-Augmented Generation (RAG)
|
||||
====================================
|
||||
|
||||
The following section describes how Arch can help you build faster, more smarter Retrieval-Augmented Generation (RAG) applications.
|
||||
|
||||
Intent Markers (Multi-Turn Chat)
|
||||
----------------------------------
|
||||
|
||||
Developers struggle to handle follow-up questions, or clarifying questions from users in their AI applications. Specifically, when
|
||||
users ask for modifications or additions to previous responses, their AI applications often generates entirely new responses instead
|
||||
of adjusting the previous ones. Developers are facing challenges in maintaining context across interactions, despite using tools like
|
||||
ConversationBufferMemory and chat_history from Langchain.
|
||||
|
||||
There are several documented cases of this issue, `here <https://www.reddit.com/r/ChatGPTPromptGenius/comments/17dzmpy/how_to_use_rag_with_conversation_history_for/?>`_,
|
||||
`and here <https://www.reddit.com/r/LocalLLaMA/comments/18mqwg6/best_practice_for_rag_with_followup_chat/>`_ and `again here <https://www.reddit.com/r/LangChain/comments/1bajhg8/chat_with_rag_further_questions/>`_.
|
||||
|
||||
Arch helps developer with intent detection tracking. Arch uses its lightweight NLI and embedding-based intent detection models to know
|
||||
if the user's last prompt represents a new intent or not. This way developers can easily build an intent tracker and only use a subset of prompts
|
||||
to process from the history to improve the retrieval and speed of their applications.
|
||||
|
||||
.. literalinclude:: /_include/intent_detection.py
|
||||
:language: python
|
||||
:linenos:
|
||||
:lines: 77-
|
||||
:emphasize-lines: 15-22
|
||||
:caption: :download:`intent-detection-python-example.py </_include/intent_detection.py>`
|
||||
|
||||
Loading…
Add table
Add a link
Reference in a new issue