mirror of
https://github.com/katanemo/plano.git
synced 2026-04-25 00:36:34 +02:00
Salmanap/docs v1 push (#92)
* updated model serving, updated the config references, architecture docs and added the llm_provider section * several documentation changes to improve sections like life_of_a_request, model serving subsystem --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
This commit is contained in:
parent
8a4e11077c
commit
7168b14ed3
19 changed files with 375 additions and 119 deletions
|
|
@ -1,11 +1,11 @@
|
|||
Configuration Reference
|
||||
============================
|
||||
|
||||
The following is a complete reference of the prompt-conifg.yml that controls the behavior of an Arch gateway.
|
||||
We've kept things simple (less than 80 lines) and held off on exposing additional functionality (for e.g. suppporting
|
||||
push observability stats, managing prompt-endpoints as virtual cluster, exposing more load balancing options, etc). Our
|
||||
belief that the simple things, should be simple. So we offert good defaults for developers, so that they can spend more
|
||||
of their time in building features unique to their AI experience.
|
||||
The following is a complete reference of the ``prompt-conifg.yml`` that controls the behavior of a single instance of
|
||||
the Arch gateway. We've kept things simple (less than 80 lines) and held off on exposing additional functionality (for
|
||||
e.g. suppporting push observability stats, managing prompt-endpoints as virtual cluster, exposing more load balancing
|
||||
options, etc). Our belief that the simple things, should be simple. So we offert good defaults for developers, so
|
||||
that they can spend more of their time in building features unique to their AI experience.
|
||||
|
||||
.. literalinclude:: /_config/prompt-config-full-reference.yml
|
||||
:language: yaml
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue