Commit graph

11 commits

Author SHA1 Message Date
cybermaggedon
57663742e6
Fix bedrock: (#331)
- Fix missing await
- Fix missing error response
2025-03-27 15:17:08 +00:00
cybermaggedon
1db6dd5dfd
Support bedrock inference profiles (#314)
* Break out enums for different model types

* Add model detection for inference profiles in US and EU

* Encapsulate model handling, make it easier to manage
2025-03-15 12:39:15 +00:00
cybermaggedon
f350abb415
Maint/asyncio (#305)
* Move to asyncio services, even though everything is largely sync
2025-02-11 23:24:46 +00:00
cybermaggedon
d1e9577e7f
Fix rate limit handler, incomplete (#293) 2025-01-29 21:13:17 +00:00
cybermaggedon
701ec1e27e
Fix startup error on import (#292) 2025-01-29 19:11:08 +00:00
cybermaggedon
1543a0650d
Better aws integration (#291)
* - More AWS Boto3 settings (profile and session key)
- Align environment variable and profile setting names with AWS
  conventions.
Hopefully this should be able to run from an EC2 instance just with role
setting.

* Tweak naming to all make sense, added rate limit detect
2025-01-29 14:38:16 +00:00
cybermaggedon
0e03bc05a4
Refactor rate limit handling (#280)
* - Refactored retry for rate limits into the base class
- ConsumerProducer is derived from Consumer to simplify code
- Added rate_limit_count metrics for rate limit events

* Add rate limit events to VertexAI and Google AI Studio

* Added Grafana rate limit dashboard

* Add rate limit handling to all LLMs
2025-01-27 17:04:49 +00:00
cybermaggedon
65cda7b276
Implement system in text completion API (#137)
* Add system prompt to LLM invocation

* Added system parameter to LLMs

* Added to Bedrock and VertexAI
2024-11-05 22:46:17 +00:00
cybermaggedon
25983d1557
Fix Bedrock (#119) 2024-10-15 19:21:43 +01:00
cybermaggedon
86288339cf
Feature/environment var creds (#116)
- Change templates to interpolate environment variables in docker compose
- Change templates to invoke secrets for environment variable credentials in K8s configuration
- Update LLMs to pull in credentials from environment variables if not specified
2024-10-15 00:34:52 +01:00
cybermaggedon
9b91d5eee3
Feature/pkgsplit (#83)
* Starting to spawn base package
* More package hacking
* Bedrock and VertexAI
* Parquet split
* Updated templates
* Utils
2024-09-30 19:36:09 +01:00