plano/crates
aayushwhiz cb8e2a772b update stats to output input_sequence_length Histogram
Changes the enforce_ratelimit function by getting token count regardless
of if there is a ratelimit or not, allowing for metric to be saved. This
essentially is the token count of what is sent to openai, but that is
not the tokens being sent by user, so rather than info about usage
statistics, it's more relavant to price or cost. Not yet sure if this is
the best way to go, but i'll use it for now.
2024-11-08 18:09:37 -08:00
..
common add in time to first token stat 2024-11-08 18:09:37 -08:00
llm_gateway update stats to output input_sequence_length Histogram 2024-11-08 18:09:37 -08:00
prompt_gateway obfuscate auth header (#254) 2024-11-08 15:17:39 -06:00
Cargo.lock Cotran/hallucination (#208) 2024-10-22 12:52:01 -07:00
Cargo.toml Add cargo workspace to allow rust-analyzer to work correctly (#197) 2024-10-18 15:44:52 -04:00