A transparent (O)llama proxy with model deployment aware routing which auto-manages multiple (O)llama instances in a given network.
Updated 2026-06-17 13:52:56 +02:00
OpenAI compatible secure chat client with end-to-end encryption on NOMYO Inference Endpoints
Updated 2026-06-17 13:39:50 +02:00
An open source, privacy focused alternative to NotebookLM for teams with no data limit's. Join our Discord: https://discord.gg/ejRNvftDp9
Updated 2026-06-16 20:16:27 +02:00
Open source voice AI platform. Self-hosted alternative to Vapi and Retell. On Prem, BYOK across Speech to Speech or LLM/STT/TTS, with a visual workflow builder, MCP native and telephony support.
Updated 2026-06-15 19:26:27 +02:00
Hypernetworks that update LLMs to remember factual information
Updated 2026-06-15 06:31:47 +02:00
A high-performance, real-time ASCII video rendering engine. Streams binary-encoded frames via WebSockets for ultra-low latency, 60 FPS playback using HTML5 Canvas and requestAnimationFrame.
Updated 2026-06-14 20:27:34 +02:00
IAI-MCP fork to work with opencode plugin and qdrant db on non-AVX CPUs
Updated 2026-06-14 16:27:27 +02:00
fork of claude-for-legal ported to opencode
Updated 2026-06-13 15:38:37 +02:00
The context development platform. Store, enrich, and retrieve structured knowledge with graph-native infrastructure, semantic retrieval, and portable context cores.
Updated 2026-06-12 05:45:39 +02:00
📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG
Updated 2026-06-06 00:08:19 +02:00
Template with pipeline that goes from a model and input data to a fully finetuned GGUF
Updated 2026-06-02 17:48:55 +02:00
The only tool you need to generate Shopify import-compatible CSVs from ANY Shopify store on the internet.
Updated 2026-05-31 08:03:15 +02:00
Updated version of the mirror of https://notabug.org/necklace/libray
Updated 2026-05-21 12:22:44 +02:00
Self-bootstrapping recipes for open base LLMs — 14B reaches 80% on HumanEval with no human-written training data. Code, mined pairs, and reproduction guide for the paper.
Updated 2026-05-13 18:09:54 +02:00
Flakestorm — Automated Robustness Testing for AI Agents. Stop guessing if your agent really works. FlakeStorm generates adversarial mutations and exposes failures your manual tests and evals miss.
Updated 2026-04-16 03:21:38 +02:00
asynchronous semantic prompt/response cache for llm apis
Updated 2026-04-01 12:25:48 +02:00
I replicated Ng's RYS method and found that duplicating 3 specific layers in Qwen2.5-32B boosts reasoning by 17% and duplicating layers 12-14 in Devstral-24B improves logical deduction from 0.22→0.76 on BBH — no training, no weight changes, just routing hidden states through the same circuit twice. Tools included. Two AMD GPUs, one evening.
Updated 2026-03-20 02:51:23 +01:00
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Updated 2026-01-21 11:12:32 +01:00
A python package for text sanitization with differential privacy
Updated 2025-12-25 06:23:10 +01:00