Commit graph

262 commits

Author SHA1 Message Date
e7cd8d4d68
fix: usage locks now release before the subscriber queue awaits 2026-04-07 15:30:52 +02:00
2c87472483
fix: conditional to_thread for the image_transform to relieve threadpool pressure 2026-04-07 13:28:34 +02:00
81013ec3b1
fix: available_error_cache poisoning 2026-04-07 09:32:53 +02:00
5170162a80
fix: make image transform non-blocking 2026-04-07 09:18:12 +02:00
e912b71b5c
doc: update readme 2026-04-07 09:14:00 +02:00
6bc4157d65
feat: add workflow cleanup 2026-04-06 15:13:01 +02:00
28afa4e9c0
fix: missing requirement
fix: strip assistant prefill when ollama -> openai translaton + openai guard
2026-04-06 11:32:47 +02:00
2aa475ce5e
fix: strip mediatypes for oci 2026-04-06 09:18:56 +02:00
ae509ff548
fix: we pray to god 2026-04-05 19:31:49 +02:00
9bf7a6c967
fix: disable summary 2026-04-05 19:01:17 +02:00
0368408832
fix: cache bug 2026-04-05 18:21:06 +02:00
3030b7126b
fix: tagging of imgs 2026-04-05 13:13:31 +02:00
ce8e9c42cf
fix: write cache using standard OCI image manifest format 2026-04-05 13:01:50 +02:00
cb464d30d0
fix: label versions with arch 2026-04-05 10:34:54 +02:00
0cfe6502fe
fix: less caching 2026-04-05 10:13:42 +02:00
2815faf827
fix: mergeblock missing container block 2026-04-05 09:50:44 +02:00
eaed006d39
fix: workflow once more 2026-04-05 09:32:58 +02:00
d4464be581
feat: add matrix build 2026-04-04 18:46:08 +02:00
403abd5357
fix: added PAT REGISTRY_TOKEN 2026-04-02 19:26:43 +02:00
0a69e56e61
fix: secrets 2026-04-02 18:42:52 +02:00
9a1dff4649
fix: buildkit container network access 2026-04-02 17:37:39 +02:00
ceed676b94
fix: add dns 2026-04-02 17:16:53 +02:00
49c7030e1d
fix: revert registry url 2026-04-02 16:56:08 +02:00
df445bab88
fix: correct registry path 2026-04-02 14:51:05 +02:00
a5fb82d006
fix: remove dind 2026-04-02 13:42:49 +02:00
82f2c034da
fix: wait for docker daemon for docker info to succeed 2026-04-02 13:39:25 +02:00
3d01ba7408
fix: bypass automatic repo base_url 2026-04-02 13:35:49 +02:00
929a972c16
fix: repo url 2026-04-02 13:30:01 +02:00
1e709814c7
feat: add forgejo workflows 2026-04-02 12:49:04 +02:00
b899ac8559
feat: add all models to TPS graph in dashboard 2026-04-01 18:10:48 +02:00
f0dd124118 doc: update repo base_url 2026-04-01 17:00:14 +02:00
031de165a1 feat: prettyfy dashboard
Some checks failed
Build and Publish Docker Image / build-and-push (push) Has been cancelled
Build and Publish Docker Image (Semantic Cache) / build-and-push-semantic (push) Has been cancelled
2026-03-27 16:24:57 +01:00
c796fd6a47 fix: add missing git to docker for semcache dependency install 2026-03-23 17:06:46 +01:00
c0dc0a10af fix: catch non-standard openai sdk error bodies for parsing 2026-03-12 19:08:01 +01:00
1e9996c393 fix: exclude embedding models from preemptive context shift caches 2026-03-12 18:56:51 +01:00
21d6835253
Merge pull request #37 from nomyo-ai/dev-v0.7.x-semcache
Dev v0.7.x semcache addtl. feature
2026-03-12 16:08:23 +01:00
e416542bf8 fix: model name normalization for context_cash preemptive context-shifting for smaller context-windows with previous failure 2026-03-12 16:08:01 +01:00
be60a348e1 fix: changing error_cache to stale-while-revalidate same as available_models_cache 2026-03-12 14:47:54 +01:00
9acc37951a feat: add reactive auto context-shift in openai endpoints to prevent recover from out of context errors 2026-03-12 10:15:52 +01:00
95c643109a feat: add an openai retry if request with image is send to a pure text model 2026-03-12 10:06:18 +01:00
1ae989788b fix(router): normalize multimodal input to extract text for embeddings
Extract text parts from multimodal payloads (lists/dicts).
Skip image_url and other non-text types to ensure embedding
models receive compatible text-only input.
2026-03-11 16:41:21 +01:00
7468bfffbb
Merge branch 'main' into dev-v0.7.x 2026-03-11 09:47:13 +01:00
ca773d6ddb
Merge pull request #35 from nomyo-ai/dev-v0.7.x-semcache
Dev v0.7.x semcache -> dev-v0.7.x
2026-03-11 09:40:55 +01:00
46da392a53 fix: semcache version pinned 2026-03-11 09:40:00 +01:00
95d03d828e
Merge pull request #34 from nomyo-ai/dev-v0.7.x
docs: adding ghcr docker pull instructions
2026-03-10 15:58:45 +01:00
fbdc73eebb fix: improvements, fixes and opt-in cache
doc: semantic-cache.md added with detailed write-up
2026-03-10 15:19:37 +01:00
a5108486e3 conf: clean default conf 2026-03-08 09:35:40 +01:00
e8b8981421 doc: updated usage.md 2026-03-08 09:26:53 +01:00
dd4b12da6a feat: adding a semantic cache layer 2026-03-08 09:12:09 +01:00
c3d47c7ffe docs: adding ghcr docker pull instructions 2026-03-05 11:54:42 +01:00