mirror of
https://github.com/katanemo/plano.git
synced 2026-06-17 15:25:17 +02:00
added latency histogram and ouput sequency length histogram to the wasm metrics. Updated stream context so that When the end_stream is recieved, it stores the time since request was sent as well as total number of tokens up till that point. |
||
|---|---|---|
| .. | ||
| common | ||
| llm_gateway | ||
| prompt_gateway | ||
| Cargo.lock | ||
| Cargo.toml | ||