7a667c3a08
Merge pull request 'fix: added PAT REGISTRY_TOKEN' ( #12 ) from dev-v0.7.x into main
...
Build and Publish Docker Image (Semantic Cache) / build-and-push-semantic (push) Successful in 43m35s
Build and Publish Docker Image / build-and-push (push) Successful in 27m49s
Reviewed-on: https://bitfreedom.net/code/code/nomyo-ai/nomyo-router/pulls/12
2026-04-02 19:27:38 +02:00
403abd5357
fix: added PAT REGISTRY_TOKEN
2026-04-02 19:26:43 +02:00
50c73c2f6e
Merge pull request 'fix: secrets' ( #11 ) from dev-v0.7.x into main
...
Build and Publish Docker Image (Semantic Cache) / build-and-push-semantic (push) Failing after 43m26s
Build and Publish Docker Image / build-and-push (push) Has been cancelled
Reviewed-on: https://bitfreedom.net/code/code/nomyo-ai/nomyo-router/pulls/11
2026-04-02 18:43:24 +02:00
0a69e56e61
fix: secrets
2026-04-02 18:42:52 +02:00
0e6fc72324
Merge pull request 'fix: buildkit container network access' ( #10 ) from dev-v0.7.x into main
...
Build and Publish Docker Image / build-and-push (push) Has been cancelled
Build and Publish Docker Image (Semantic Cache) / build-and-push-semantic (push) Has been cancelled
Reviewed-on: https://bitfreedom.net/code/code/nomyo-ai/nomyo-router/pulls/10
2026-04-02 17:38:33 +02:00
9a1dff4649
fix: buildkit container network access
2026-04-02 17:37:39 +02:00
e66d5f5fb1
Merge pull request 'fix: add dns' ( #9 ) from dev-v0.7.x into main
...
Build and Publish Docker Image / build-and-push (push) Has been cancelled
Build and Publish Docker Image (Semantic Cache) / build-and-push-semantic (push) Has been cancelled
Reviewed-on: https://bitfreedom.net/code/code/nomyo-ai/nomyo-router/pulls/9
2026-04-02 17:18:02 +02:00
ceed676b94
fix: add dns
2026-04-02 17:16:53 +02:00
bd51490659
Merge pull request 'fix: revert registry url' ( #8 ) from dev-v0.7.x into main
...
Build and Publish Docker Image (Semantic Cache) / build-and-push-semantic (push) Failing after 20m2s
Build and Publish Docker Image / build-and-push (push) Has been cancelled
Reviewed-on: https://bitfreedom.net/code/code/nomyo-ai/nomyo-router/pulls/8
2026-04-02 16:57:01 +02:00
49c7030e1d
fix: revert registry url
2026-04-02 16:56:08 +02:00
c15f7e0be6
Merge pull request 'fix: correct registry path' ( #7 ) from dev-v0.7.x into main
...
Build and Publish Docker Image (Semantic Cache) / build-and-push-semantic (push) Failing after 19m18s
Build and Publish Docker Image / build-and-push (push) Failing after 19m17s
Reviewed-on: https://bitfreedom.net/code/code/nomyo-ai/nomyo-router/pulls/7
2026-04-02 14:51:38 +02:00
df445bab88
fix: correct registry path
2026-04-02 14:51:05 +02:00
62b2eed04d
Merge pull request 'fix: remove dind' ( #6 ) from dev-v0.7.x into main
...
Build and Publish Docker Image (Semantic Cache) / build-and-push-semantic (push) Failing after 19m16s
Reviewed-on: https://bitfreedom.net/code/code/nomyo-ai/nomyo-router/pulls/6
2026-04-02 13:43:27 +02:00
a5fb82d006
fix: remove dind
2026-04-02 13:42:49 +02:00
c3b1b0f864
Merge pull request 'fix: wait for docker daemon for docker info to succeed' ( #5 ) from dev-v0.7.x into main
...
Build and Publish Docker Image (Semantic Cache) / build-and-push-semantic (push) Failing after 1m21s
Build and Publish Docker Image / build-and-push (push) Failing after 1m20s
Reviewed-on: https://bitfreedom.net/code/code/nomyo-ai/nomyo-router/pulls/5
2026-04-02 13:40:02 +02:00
82f2c034da
fix: wait for docker daemon for docker info to succeed
2026-04-02 13:39:25 +02:00
ac6bcbf296
Merge pull request 'fix: bypass automatic repo base_url' ( #4 ) from dev-v0.7.x into main
...
Build and Publish Docker Image (Semantic Cache) / build-and-push-semantic (push) Failing after 24s
Build and Publish Docker Image / build-and-push (push) Failing after 23s
Reviewed-on: https://bitfreedom.net/code/code/nomyo-ai/nomyo-router/pulls/4
2026-04-02 13:36:31 +02:00
3d01ba7408
fix: bypass automatic repo base_url
2026-04-02 13:35:49 +02:00
748bb1e932
Merge pull request 'fix: repo url' ( #3 ) from dev-v0.7.x into main
...
Build and Publish Docker Image (Semantic Cache) / build-and-push-semantic (push) Failing after 53s
Build and Publish Docker Image / build-and-push (push) Failing after 51s
Reviewed-on: https://bitfreedom.net/code/code/nomyo-ai/nomyo-router/pulls/3
2026-04-02 13:31:08 +02:00
929a972c16
fix: repo url
2026-04-02 13:30:01 +02:00
18f99b402b
Merge pull request 'feat: add forgejo workflows' ( #2 ) from dev-v0.7.x into main
...
Build and Publish Docker Image (Semantic Cache) / build-and-push-semantic (push) Failing after 2m30s
Build and Publish Docker Image / build-and-push (push) Failing after 49s
Reviewed-on: https://bitfreedom.net/code/code/nomyo-ai/nomyo-router/pulls/2
2026-04-02 13:13:58 +02:00
1e709814c7
feat: add forgejo workflows
2026-04-02 12:49:04 +02:00
ba1b2fb651
Merge pull request 'dev-v0.7.x to prod' ( #1 ) from dev-v0.7.x into main
...
Build and Publish Docker Image (Semantic Cache) / build-and-push-semantic (push) Has been cancelled
Build and Publish Docker Image / build-and-push (push) Has been cancelled
Reviewed-on: https://bitfreedom.net/code/code/nomyo-ai/nomyo-router/pulls/1
2026-04-02 09:17:59 +02:00
b899ac8559
feat: add all models to TPS graph in dashboard
2026-04-01 18:10:48 +02:00
f0dd124118
doc: update repo base_url
2026-04-01 17:00:14 +02:00
031de165a1
feat: prettyfy dashboard
Build and Publish Docker Image / build-and-push (push) Has been cancelled
Build and Publish Docker Image (Semantic Cache) / build-and-push-semantic (push) Has been cancelled
2026-03-27 16:24:57 +01:00
c796fd6a47
fix: add missing git to docker for semcache dependency install
2026-03-23 17:06:46 +01:00
c0dc0a10af
fix: catch non-standard openai sdk error bodies for parsing
2026-03-12 19:08:01 +01:00
1e9996c393
fix: exclude embedding models from preemptive context shift caches
2026-03-12 18:56:51 +01:00
21d6835253
Merge pull request #37 from nomyo-ai/dev-v0.7.x-semcache
...
Dev v0.7.x semcache addtl. feature
2026-03-12 16:08:23 +01:00
e416542bf8
fix: model name normalization for context_cash preemptive context-shifting for smaller context-windows with previous failure
2026-03-12 16:08:01 +01:00
be60a348e1
fix: changing error_cache to stale-while-revalidate same as available_models_cache
2026-03-12 14:47:54 +01:00
9acc37951a
feat: add reactive auto context-shift in openai endpoints to prevent recover from out of context errors
2026-03-12 10:15:52 +01:00
95c643109a
feat: add an openai retry if request with image is send to a pure text model
2026-03-12 10:06:18 +01:00
1ae989788b
fix(router): normalize multimodal input to extract text for embeddings
...
Extract text parts from multimodal payloads (lists/dicts).
Skip image_url and other non-text types to ensure embedding
models receive compatible text-only input.
2026-03-11 16:41:21 +01:00
7468bfffbb
Merge branch 'main' into dev-v0.7.x
2026-03-11 09:47:13 +01:00
ca773d6ddb
Merge pull request #35 from nomyo-ai/dev-v0.7.x-semcache
...
Dev v0.7.x semcache -> dev-v0.7.x
2026-03-11 09:40:55 +01:00
46da392a53
fix: semcache version pinned
2026-03-11 09:40:00 +01:00
95d03d828e
Merge pull request #34 from nomyo-ai/dev-v0.7.x
...
docs: adding ghcr docker pull instructions
2026-03-10 15:58:45 +01:00
fbdc73eebb
fix: improvements, fixes and opt-in cache
...
doc: semantic-cache.md added with detailed write-up
2026-03-10 15:19:37 +01:00
a5108486e3
conf: clean default conf
2026-03-08 09:35:40 +01:00
e8b8981421
doc: updated usage.md
2026-03-08 09:26:53 +01:00
dd4b12da6a
feat: adding a semantic cache layer
2026-03-08 09:12:09 +01:00
c3d47c7ffe
docs: adding ghcr docker pull instructions
2026-03-05 11:54:42 +01:00
cce8e66c3e
Merge pull request #32 from nomyo-ai/dev-v0.7.x
...
Dev v0.7.x -> main
2026-03-05 11:12:38 +01:00
b951cc82e3
bump version
2026-03-05 11:09:20 +01:00
00a06dca51
feat: add docker publish workflow
2026-03-05 11:09:16 +01:00
e51969a2bb
Merge pull request #30 from nomyo-ai/dev-v0.7.x
...
- improved performance
- added /v1/rerank endpoint
- refactor of choose_endpoints for atomic upgrade of usage counters
- fixes for security, type- and keyerrors
- improved database handling
2026-03-04 11:01:22 +01:00
8037706f0b
fix(db.py): remove full table scans with proper where clauses for dashboard statistics and calc in db rather than python
2026-03-03 17:20:33 +01:00
45315790d1
fix(router.py):
...
- added global for orphaned token_worker_task and flust_task
- fixed a regex to effectively _mask_secrets
- fixed several Type and KeyErrors
- fixed model deduplication for llama_server_endpoints
2026-03-03 16:34:16 +01:00