plano/crates
aayushwhiz 8fb5c4eceb Add in Latency and output_sequence_length
added latency histogram and ouput sequency length histogram to the wasm
metrics. Updated stream context so that When the end_stream is recieved,
it stores the time since request was sent as well as total number of
tokens up till that point.
2024-11-08 18:09:37 -08:00
..
common add in time to first token stat 2024-11-08 18:09:37 -08:00
llm_gateway Add in Latency and output_sequence_length 2024-11-08 18:09:37 -08:00
prompt_gateway obfuscate auth header (#254) 2024-11-08 15:17:39 -06:00
Cargo.lock Cotran/hallucination (#208) 2024-10-22 12:52:01 -07:00
Cargo.toml Add cargo workspace to allow rust-analyzer to work correctly (#197) 2024-10-18 15:44:52 -04:00