feat: Vestige v1.6.0 — 6x storage reduction, neural reranking, instant startup

Four internal optimizations for dramatically better performance:

1. F16 vector quantization (ScalarKind::F16 in USearch) — 2x storage savings
2. Matryoshka 256-dim truncation (768→256) — 3x embedding storage savings
3. Convex Combination fusion (0.3 keyword / 0.7 semantic) replacing RRF
4. Cross-encoder reranker (Jina Reranker v1 Turbo via fastembed TextRerank)

Combined: 6x vector storage reduction, ~20% better retrieval quality.
Cross-encoder loads in background — server starts instantly.
Old 768-dim embeddings auto-migrated on load.

614 tests pass, zero warnings.
This commit is contained in:
Sam Valladares 2026-02-19 01:09:39 -06:00
parent 5b7d22d427
commit 495a88331f
19 changed files with 195 additions and 98 deletions

View file

@ -1,6 +1,6 @@
{
"name": "@vestige/init",
"version": "1.5.0",
"version": "1.6.0",
"description": "Give your AI a brain in 10 seconds — zero-config Vestige installer",
"bin": {
"vestige-init": "bin/init.js"