plano/crates/llm_gateway
aayushwhiz 6fc32b0152 update weather_forecast demo to spin up grafana and prometheus when using monitoring profile
has full dashboard with total requests, time per output token, time to
first token, total latency, output sequence length, and input sequence
length.
2024-11-11 17:00:48 -08:00
..
src fix after merge 2024-11-08 18:09:37 -08:00
tests update weather_forecast demo to spin up grafana and prometheus when using monitoring profile 2024-11-11 17:00:48 -08:00
Cargo.lock Code refactor and some improvements - see description (#194) 2024-10-18 12:53:44 -07:00
Cargo.toml split wasm filter (#186) 2024-10-16 14:20:26 -07:00