|
|
cef71df3df
|
feat: add ctx-size for llama-swap models to dashboard
PR Tests / test (pull_request) Successful in 1m19s
NYX Security Scan / nyx-scan (pull_request) Successful in 6m15s
|
2026-06-15 19:09:55 +02:00 |
|
|
|
aa8baebac5
|
feat: add llama-swap as a backend
PR Tests / test (pull_request) Successful in 1m18s
NYX Security Scan / nyx-scan (pull_request) Successful in 6m19s
|
2026-06-14 16:34:31 +02:00 |
|
|
|
b28f175b61
|
feat: transparent openai responses api integration
|
2026-06-10 18:48:26 +02:00 |
|
|
|
3cd530586c
|
feat: cache backend clients per endpoint instead of building one (with a fresh SSL context) per request
Build and Publish Docker Image (Semantic Cache) / build (amd64, linux/amd64, docker-amd64) (push) Successful in 3m59s
Build and Publish Docker Image / build (amd64, linux/amd64, docker-amd64) (push) Successful in 1m25s
Build and Publish Docker Image / build (arm64, linux/arm64, docker-arm64) (push) Successful in 12m46s
Build and Publish Docker Image / merge (push) Successful in 33s
Build and Publish Docker Image (Semantic Cache) / build (arm64, linux/arm64, docker-arm64) (push) Successful in 19m56s
Build and Publish Docker Image (Semantic Cache) / merge (push) Successful in 33s
|
2026-06-07 09:55:54 +02:00 |
|
|
|
497c87b02e
|
refac: code deduplication for error handling and call sites
Build and Publish Docker Image (Semantic Cache) / build (amd64, linux/amd64, docker-amd64) (push) Successful in 4m2s
Build and Publish Docker Image / build (amd64, linux/amd64, docker-amd64) (push) Successful in 1m37s
Build and Publish Docker Image (Semantic Cache) / build (arm64, linux/arm64, docker-arm64) (push) Successful in 17m43s
Build and Publish Docker Image (Semantic Cache) / merge (push) Successful in 34s
Build and Publish Docker Image / build (arm64, linux/arm64, docker-arm64) (push) Successful in 12m47s
Build and Publish Docker Image / merge (push) Successful in 33s
|
2026-06-04 10:57:33 +02:00 |
|
|
|
d3b2ee3047
|
feat: surface an upstream ollama backend error transitively from a streaming generator
|
2026-06-04 10:33:47 +02:00 |
|
|
|
820e217da6
|
fix: Lightweight health/introspection probes no longer compete with long-lived streaming completions for the proxy pool's per-host connection slots
|
2026-05-28 09:54:53 +02:00 |
|
|
|
4b5a70e787
|
refac: modularize apis VII
|
2026-05-19 14:57:39 +02:00 |
|