trustgraph/trustgraph-flow/trustgraph/model/text_completion
cybermaggedon 0e03bc05a4
Refactor rate limit handling (#280)
* - Refactored retry for rate limits into the base class
- ConsumerProducer is derived from Consumer to simplify code
- Added rate_limit_count metrics for rate limit events

* Add rate limit events to VertexAI and Google AI Studio

* Added Grafana rate limit dashboard

* Add rate limit handling to all LLMs
2025-01-27 17:04:49 +00:00
..
azure Refactor rate limit handling (#280) 2025-01-27 17:04:49 +00:00
azure_openai Refactor rate limit handling (#280) 2025-01-27 17:04:49 +00:00
claude Refactor rate limit handling (#280) 2025-01-27 17:04:49 +00:00
cohere Refactor rate limit handling (#280) 2025-01-27 17:04:49 +00:00
googleaistudio Refactor rate limit handling (#280) 2025-01-27 17:04:49 +00:00
llamafile Refactor rate limit handling (#280) 2025-01-27 17:04:49 +00:00
ollama Refactor rate limit handling (#280) 2025-01-27 17:04:49 +00:00
openai Refactor rate limit handling (#280) 2025-01-27 17:04:49 +00:00
__init__.py Feature/pkgsplit (#83) 2024-09-30 19:36:09 +01:00