plano/crates
aayushwhiz 6fc32b0152 update weather_forecast demo to spin up grafana and prometheus when using monitoring profile
has full dashboard with total requests, time per output token, time to
first token, total latency, output sequence length, and input sequence
length.
2024-11-11 17:00:48 -08:00
..
common add in time to first token stat 2024-11-08 18:09:37 -08:00
llm_gateway update weather_forecast demo to spin up grafana and prometheus when using monitoring profile 2024-11-11 17:00:48 -08:00
prompt_gateway obfuscate auth header (#254) 2024-11-08 15:17:39 -06:00
Cargo.lock Cotran/hallucination (#208) 2024-10-22 12:52:01 -07:00
Cargo.toml Add cargo workspace to allow rust-analyzer to work correctly (#197) 2024-10-18 15:44:52 -04:00