dev-0.9.x-ref -> main #113

Merged
alpha-nerd merged 19 commits from dev-0.9.x-ref into main 2026-06-10 15:48:31 +02:00

19 commits

Author SHA1 Message Date
d163fea154
fix: remove aioresponses
All checks were successful
PR Tests / test (pull_request) Successful in 1m16s
NYX Security Scan / nyx-scan (pull_request) Successful in 6m1s
sec: bumb aiohttp 3.14

fix: tiktoken test issue by pre-cache the vocab file
2026-06-07 13:23:35 +02:00
3cd530586c
feat: cache backend clients per endpoint instead of building one (with a fresh SSL context) per request
All checks were successful
Build and Publish Docker Image (Semantic Cache) / build (amd64, linux/amd64, docker-amd64) (push) Successful in 3m59s
Build and Publish Docker Image / build (amd64, linux/amd64, docker-amd64) (push) Successful in 1m25s
Build and Publish Docker Image / build (arm64, linux/arm64, docker-arm64) (push) Successful in 12m46s
Build and Publish Docker Image / merge (push) Successful in 33s
Build and Publish Docker Image (Semantic Cache) / build (arm64, linux/arm64, docker-arm64) (push) Successful in 19m56s
Build and Publish Docker Image (Semantic Cache) / merge (push) Successful in 33s
2026-06-07 09:55:54 +02:00
1ce792c48b
feat: new load test added 2026-06-07 09:38:14 +02:00
75d204e7f3
feat: use SSE reconnect to prevent API Key modal to pop up in dashboard if no API Key is configured 2026-06-07 09:29:06 +02:00
497c87b02e
refac: code deduplication for error handling and call sites
All checks were successful
Build and Publish Docker Image (Semantic Cache) / build (amd64, linux/amd64, docker-amd64) (push) Successful in 4m2s
Build and Publish Docker Image / build (amd64, linux/amd64, docker-amd64) (push) Successful in 1m37s
Build and Publish Docker Image (Semantic Cache) / build (arm64, linux/arm64, docker-arm64) (push) Successful in 17m43s
Build and Publish Docker Image (Semantic Cache) / merge (push) Successful in 34s
Build and Publish Docker Image / build (arm64, linux/arm64, docker-arm64) (push) Successful in 12m47s
Build and Publish Docker Image / merge (push) Successful in 33s
2026-06-04 10:57:33 +02:00
2dceece0d6
feat: add test for ollama stream errors 2026-06-04 10:42:18 +02:00
d3b2ee3047
feat: surface an upstream ollama backend error transitively from a streaming generator 2026-06-04 10:33:47 +02:00
b754daf1af
feat: after closing the probe session, reset
All checks were successful
Build and Publish Docker Image (Semantic Cache) / build (amd64, linux/amd64, docker-amd64) (push) Successful in 3m52s
Build and Publish Docker Image / build (amd64, linux/amd64, docker-amd64) (push) Successful in 1m23s
Build and Publish Docker Image (Semantic Cache) / build (arm64, linux/arm64, docker-arm64) (push) Successful in 15m16s
Build and Publish Docker Image (Semantic Cache) / merge (push) Successful in 34s
Build and Publish Docker Image / build (arm64, linux/arm64, docker-arm64) (push) Successful in 11m59s
Build and Publish Docker Image / merge (push) Successful in 33s
2026-05-28 10:16:54 +02:00
820e217da6
fix: Lightweight health/introspection probes no longer compete with long-lived streaming completions for the proxy pool's per-host connection slots 2026-05-28 09:54:53 +02:00
13d796817f
feat: add authorization header to llama model endpoint fetch 2026-05-28 09:32:20 +02:00
4b5a70e787
refac: modularize apis VII 2026-05-19 14:57:39 +02:00
e74f5d1ba6
refac: request handling VI 2026-05-19 14:09:52 +02:00
8355bf9a1e
refac: modularize sse, routing, db and token handling V 2026-05-19 12:48:55 +02:00
3a9854c5db
refac: modularize backend IV 2026-05-19 12:05:51 +02:00
c88ba1e5a4
refac: modularize global states III 2026-05-19 11:18:06 +02:00
d2b31b6c7b
refac: modularize config II 2026-05-19 11:00:50 +02:00
90b6868f5a
refac: split into modules I 2026-05-19 10:05:27 +02:00
078855ba9a Merge pull request 'feat: completion errors on an endpoint:model key a caught, cached and rerouted (openai compatible endpoints)' (#87) from dev-0.9.x-completion-error-cache into dev-0.9.x
Reviewed-on: https://bitfreedom.net/code/code/nomyo-ai/nomyo-router/pulls/87
2026-05-19 07:40:40 +02:00
079b677e23
feat: completion errors on an endpoint:model key a caught, cached and rerouted (openai compatible endpoints)
All checks were successful
PR Tests / test (pull_request) Successful in 57s
2026-05-18 18:14:28 +02:00