mirror of
https://github.com/katanemo/plano.git
synced 2026-06-17 15:25:17 +02:00
Changes the enforce_ratelimit function by getting token count regardless of if there is a ratelimit or not, allowing for metric to be saved. This essentially is the token count of what is sent to openai, but that is not the tokens being sent by user, so rather than info about usage statistics, it's more relavant to price or cost. Not yet sure if this is the best way to go, but i'll use it for now. |
||
|---|---|---|
| .. | ||
| common | ||
| llm_gateway | ||
| prompt_gateway | ||
| Cargo.lock | ||
| Cargo.toml | ||