plano/crates/llm_gateway
aayushwhiz 8fb5c4eceb Add in Latency and output_sequence_length
added latency histogram and ouput sequency length histogram to the wasm
metrics. Updated stream context so that When the end_stream is recieved,
it stores the time since request was sent as well as total number of
tokens up till that point.
2024-11-08 18:09:37 -08:00
..
src Add in Latency and output_sequence_length 2024-11-08 18:09:37 -08:00
tests Add support for streaming and fixes few issues (see description) (#202) 2024-10-28 17:05:06 -07:00
Cargo.lock Code refactor and some improvements - see description (#194) 2024-10-18 12:53:44 -07:00
Cargo.toml split wasm filter (#186) 2024-10-16 14:20:26 -07:00