Commit graph

38 commits

Author SHA1 Message Date
263c66aedd
feat: add hostname to dashboard 2026-04-10 17:29:43 +02:00
b899ac8559
feat: add all models to TPS graph in dashboard 2026-04-01 18:10:48 +02:00
031de165a1 feat: prettyfy dashboard
Some checks failed
Build and Publish Docker Image / build-and-push (push) Has been cancelled
Build and Publish Docker Image (Semantic Cache) / build-and-push-semantic (push) Has been cancelled
2026-03-27 16:24:57 +01:00
836c5f41ea fix(router): normalize model names for usage tracking across endpoints 2026-02-17 11:35:53 +01:00
9875eb977a feat: Add tool call normalization and streaming delta accumulation
Adds support for correctly handling tool calls in chat requests. Normalizes tool call data (ensuring IDs, types, and JSON arguments) in non-streaming mode and accumulates OpenAI-style deltas during streaming to build the final Ollama response.
2026-02-10 20:21:46 +01:00
4892998abc feat(router): Add llama-server endpoints support and model parsing
Add `llama_server_endpoints` configuration field to support llama_server OpenAI-compatible endpoints for status checks. Implement helper functions to parse model names and quantization levels from llama-server responses (best effort). Update `is_ext_openai_endpoint` to properly distinguish these endpoints from external OpenAI services. Update sample configuration documentation.
2026-02-10 16:46:51 +01:00
bd0d210b2a feat: enforce api key authentication and update table header
- Added proper API key validation in router.py with 401 response when key is missing
- Implemented CORS headers for authentication requests
- Updated table header from "Until" to "Unload" in static/index.html
- Improved security by preventing API key leakage in access logs
2026-02-01 10:05:46 +01:00
efdf14a207 fix: optimize table column widths and improve time formatting for responsive layout
- Reduced min-width of model columns from 340px to 200px with max-width of 300px
- Added specific styling for narrow columns (3rd-5th) with fixed width and center alignment
- Removed "Instance count" as it has redundant information
- Enhanced time formatting logic to show relative time instead of absolute dates
- Simplified digest display to show last 6 characters instead of truncated format
- Added proper handling for various time value types (number, string, null)
2026-01-29 10:54:43 +01:00
YetheSamartaka
d3aa87ca15 Added endpoint differentiation for models ps board
Added endpoint differentiation for models PS board to see where which model is loaded and for how long to ease the viewing of multiple same models deployed for load balancing
2026-01-27 13:29:54 +01:00
YetheSamartaka
eca4a92a33 add: Optional router-level API key that gates router/API/web UI access
Optional router-level API key that gates router/API/web UI access (leave empty to disable)

## Supplying the router API key

If you set `nomyo-router-api-key` in `config.yaml` (or `NOMYO_ROUTER_API_KEY` env), every request to NOMYO Router must include the key:

- HTTP header (recommended): `Authorization: Bearer <router_key>`
- Query param (fallback): `?api_key=<router_key>`

Examples:
```bash
curl -H "Authorization: Bearer $NOMYO_ROUTER_API_KEY" http://localhost:12434/api/tags
curl "http://localhost:12434/api/tags?api_key=$NOMYO_ROUTER_API_KEY"
```
2026-01-14 09:28:02 +01:00
59a8ef3abb refactor: use a persistent WAL-enabled connection with async locks
- Introduce a lazily initialized, shared aiosqlite connection stored in self._db and two asyncio locks (_db_lock, _operation_lock) for safe concurrent access
- Ensure the database directory exists before connecting and enable WAL journaling and foreign keys on first connect
- Add close method to gracefully close the persistent connection
- Guard initialization and write operations with _operation_lock to ensure single-threaded schema setup
- Switch to ON CONFLICT UPSERT for token_counts updates and initialize token_time_series table
- Add typing for _db (Optional[aiosqlite.Connection]) and adjust imports accordingly

addition: Frontend button with total stats aggregation task and feedback span element to keep user informed and a small database footprint
2025-12-02 12:18:23 +01:00
0ffb321154 fixing total stats model, button, labels and code clean up 2025-11-28 14:59:29 +01:00
45d1d442ee sqlite: adding connection pooling and WAL 2025-11-20 15:37:04 +01:00
aa23a4dd81 fixing timezone issues 2025-11-20 12:53:18 +01:00
0d187e91b9 fixing chart timescales 2025-11-20 09:53:28 +01:00
e0c6861f2f aggregating token_counts for stats over all endpoints and adjusting the color mapping 2025-11-20 09:22:45 +01:00
3f77a8ec62 chart enhancements 2025-11-19 17:28:31 +01:00
79a7ca972b initial chart view 2025-11-19 17:05:25 +01:00
541f2826e0 fixing token_queue, prepping chart view 2025-11-18 19:02:36 +01:00
baf5d98318 adding token timeseries counting in db for future data viz 2025-11-18 11:16:21 +01:00
4c9ec5b1b2 record and display total token usage on ollama endpoints using ollama client 2025-11-04 17:55:19 +01:00
26dcbf9c02 fixing app logic and eventListeners in frontend 2025-10-30 09:06:21 +01:00
43d95fbf38 fixing headers, using ollama.Responses in rechunk class, fixing reseverd words var usage, fixing embedding output, fixing model naming in frontend 2025-09-21 16:20:36 +02:00
2813ecb044 using global aiohttp sessionpool for improved performance 2025-09-10 10:21:49 +02:00
b6a9aa82cb cosmetic fixes 2025-09-08 17:33:32 +02:00
82f4783d8a Replace json response on model pull with only the status message 2025-09-06 16:14:10 +02:00
5132adc8be Replace fontawesome with emoji for less dependencies 2025-09-06 16:07:45 +02:00
62a4c700c3 Improve design and add dark mode with toggle button 2025-09-06 15:37:36 +02:00
ee8788e482
Add files via upload
textformatting improved
2025-09-05 12:00:47 +02:00
a23ccafc5a
Add files via upload
dashboard polling removed in favour for SSE pub:sub
2025-09-05 09:44:35 +02:00
b3b67fdbf2
Add files via upload
BREAKING CHANGE:
- new config.yaml config block
- new dependency: httpx-aiohttp for faster endpoint queries in bigger installations
- new dynamic dashboard
2025-09-04 19:07:28 +02:00
fbce181a81
Add files via upload
herding ollamas
- added management functions to dashboard and updated routes in backend
2025-09-04 15:00:50 +02:00
2f09dbe22c
Add files via upload
adding dashboard copy link
adding copy get route for dashboard
2025-09-04 10:39:10 +02:00
190fa874c7
Add files via upload
cosmetics
2025-09-03 19:20:01 +02:00
f01843d12b
Add files via upload
refined available models view for copy and paste with open ai compatible endpoints
2025-09-02 12:48:19 +02:00
65ad74a452
Add files via upload
smaller adjustments and adds
2025-08-30 17:40:39 +02:00
cb7224c030
Add files via upload
Adding:
- health endpoint
- extended /api/config
- frontend extension of backend server viz
2025-08-30 12:43:35 +02:00
f40516195b
Create index.html 2025-08-30 00:13:35 +02:00