mirror of
https://github.com/katanemo/plano.git
synced 2026-04-25 16:56:24 +02:00
* init update * Update terminology.rst * fix the branch to create an index.html, and fix pre-commit issues * Doc update * made several changes to the docs after Shuguang's revision * fixing pre-commit issues * fixed the reference file to the final prompt config file * added google analytics --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
9 lines
429 B
ReStructuredText
9 lines
429 B
ReStructuredText
.. _monitoring:
|
|
|
|
Monitoring
|
|
==========
|
|
|
|
Arch offers several monitoring metrics that help you understand three critical aspects of your application:
|
|
latency, token usage, and error rates by an upstream LLM provider. Latency measures the speed at which your
|
|
application is responding to users, which includes metrics like time to first token (TFT), time per output
|
|
token (TOT) metrics, and the total latency as perceived by users.
|