Cognitive memory for AI agents — FSRS-6 spaced repetition, 29 brain modules, 3D dashboard, single 22MB Rust binary. MCP server for Claude, Cursor, VS Code, Xcode, JetBrains. https://github.com/samvallad33/vestige
Find a file
Sam Valladares 495a88331f feat: Vestige v1.6.0 — 6x storage reduction, neural reranking, instant startup
Four internal optimizations for dramatically better performance:

1. F16 vector quantization (ScalarKind::F16 in USearch) — 2x storage savings
2. Matryoshka 256-dim truncation (768→256) — 3x embedding storage savings
3. Convex Combination fusion (0.3 keyword / 0.7 semantic) replacing RRF
4. Cross-encoder reranker (Jina Reranker v1 Turbo via fastembed TextRerank)

Combined: 6x vector storage reduction, ~20% better retrieval quality.
Cross-encoder loads in background — server starts instantly.
Old 768-dim embeddings auto-migrated on load.

614 tests pass, zero warnings.
2026-02-19 01:09:39 -06:00
.github/workflows release: v1.1.3 — security hardening, edition 2024, dependency updates 2026-02-12 03:19:07 -06:00
crates feat: Vestige v1.6.0 — 6x storage reduction, neural reranking, instant startup 2026-02-19 01:09:39 -06:00
docs feat: v2.0 distribution — IDE integrations, zero-config installer, README overhaul 2026-02-12 17:18:15 -06:00
packages feat: Vestige v1.6.0 — 6x storage reduction, neural reranking, instant startup 2026-02-19 01:09:39 -06:00
tests/e2e release: v1.1.3 — security hardening, edition 2024, dependency updates 2026-02-12 03:19:07 -06:00
.gitignore Initial commit: Vestige v1.0.0 - Cognitive memory MCP server 2026-01-25 01:31:03 -06:00
Cargo.lock feat: Vestige v1.6.0 — 6x storage reduction, neural reranking, instant startup 2026-02-19 01:09:39 -06:00
Cargo.toml feat: Vestige v1.6.0 — 6x storage reduction, neural reranking, instant startup 2026-02-19 01:09:39 -06:00
CHANGELOG.md docs: add v1.1.2 to CHANGELOG 2026-01-27 02:34:10 -06:00
CLAUDE.md feat: Vestige v1.5.0 — Cognitive Engine, memory dreaming, graph exploration, predictive retrieval 2026-02-18 23:34:15 -06:00
CLAUDE.md.template chore: add CLAUDE.md template for Vestige integration 2026-02-12 04:33:53 -06:00
CODE_OF_CONDUCT.md Initial commit: Vestige v1.0.0 - Cognitive memory MCP server 2026-01-25 01:31:03 -06:00
CONTRIBUTING.md Initial commit: Vestige v1.0.0 - Cognitive memory MCP server 2026-01-25 01:31:03 -06:00
demo.sh fix: accurate science claims, security docs, remove hardcoded path 2026-01-25 20:29:37 -06:00
LICENSE Initial commit: Vestige v1.0.0 - Cognitive memory MCP server 2026-01-25 01:31:03 -06:00
LICENSE-APACHE Initial commit: Vestige v1.0.0 - Cognitive memory MCP server 2026-01-25 01:31:03 -06:00
LICENSE-MIT Initial commit: Vestige v1.0.0 - Cognitive memory MCP server 2026-01-25 01:31:03 -06:00
package.json feat: Vestige v1.6.0 — 6x storage reduction, neural reranking, instant startup 2026-02-19 01:09:39 -06:00
pnpm-lock.yaml Replace all engram references with vestige 2026-01-25 01:44:11 -06:00
pnpm-workspace.yaml Initial commit: Vestige v1.0.0 - Cognitive memory MCP server 2026-01-25 01:31:03 -06:00
README.md docs: update README for v1.5.0 — 23 tools, cognitive engine, dreaming 2026-02-18 23:39:40 -06:00
SECURITY.md release: v1.1.3 — security hardening, edition 2024, dependency updates 2026-02-12 03:19:07 -06:00

Vestige

The open-source cognitive engine for AI.

GitHub stars Release License MCP Compatible

Your AI forgets everything between sessions. Vestige fixes that. Built on 130 years of memory research — FSRS-6 spaced repetition, prediction error gating, synaptic tagging — all running in a single Rust binary, 100% local.


Give Your AI a Brain in 30 Seconds

# 1. Install
curl -L https://github.com/samvallad33/vestige/releases/latest/download/vestige-mcp-aarch64-apple-darwin.tar.gz | tar -xz
sudo mv vestige-mcp vestige vestige-restore /usr/local/bin/

# 2. Connect
claude mcp add vestige vestige-mcp -s user

# 3. Test
# "Remember that I prefer TypeScript over JavaScript"
# New session → "What are my coding preferences?"
# It remembers.
Other platforms & install methods

macOS (Intel):

curl -L https://github.com/samvallad33/vestige/releases/latest/download/vestige-mcp-x86_64-apple-darwin.tar.gz | tar -xz
sudo mv vestige-mcp vestige vestige-restore /usr/local/bin/

Linux:

curl -L https://github.com/samvallad33/vestige/releases/latest/download/vestige-mcp-x86_64-unknown-linux-gnu.tar.gz | tar -xz
sudo mv vestige-mcp vestige vestige-restore /usr/local/bin/

Windows: Download from Releases

Build from source:

git clone https://github.com/samvallad33/vestige && cd vestige
cargo build --release
sudo cp target/release/{vestige-mcp,vestige,vestige-restore} /usr/local/bin/

npm:

npm install -g vestige-mcp

Works Everywhere

Vestige speaks MCP — the universal protocol for AI tools. One brain, every IDE.

IDE Setup
Claude Code claude mcp add vestige vestige-mcp -s user
Claude Desktop 2-min setup
Xcode 26.3 Integration guide
Cursor Integration guide
VS Code (Copilot) Integration guide
JetBrains Integration guide
Windsurf Integration guide

Fix a bug in VS Code. Open Xcode. Your AI already knows about the fix.


Why Not Just Use RAG?

RAG is a dumb bucket. Vestige is an active organ.

RAG / Vector Store Vestige
Storage Store everything, retrieve everything Prediction Error Gating — only stores what's surprising or new
Retrieval Nearest-neighbor similarity Spreading activation — finds related memories through association chains
Decay Nothing ever expires FSRS-6 — memories fade like yours do, keeping context lean
Duplicates Manual dedup or none Self-healing — automatically merges "likes dark mode" + "prefers dark themes"
Importance All memories are equal Synaptic tagging — retroactively strengthens memories that turn out to matter
Privacy Usually cloud-dependent 100% local — your data never leaves your machine

The Cognitive Science Stack

This isn't a key-value store with an embedding model bolted on. Vestige implements real neuroscience:

Prediction Error Gating — The bouncer for your brain. When new information arrives, Vestige compares it against existing memories. Redundant? Merged. Contradictory? Superseded. Novel? Stored. Just like the hippocampus.

FSRS-6 Spaced Repetition — 21 parameters governing the mathematics of forgetting. Frequently-used memories stay strong. Unused memories naturally decay. Your context window stays clean.

Synaptic Tagging — A memory that seemed trivial this morning can be retroactively tagged as critical tonight. Based on Frey & Morris, 1997.

Spreading Activation — Search for "auth bug" and find the related memory about the JWT library update you saved last week. Memories form a graph, not a flat list. Based on Collins & Loftus, 1975.

Dual-Strength Model — Every memory has two values: storage strength (how well it's encoded) and retrieval strength (how easily it surfaces). A memory can be deeply stored but temporarily hard to retrieve — just like real forgetting. Based on Bjork & Bjork, 1992.

Memory States — Active, Dormant, Silent, Unavailable. Memories transition between states based on usage patterns, exactly like human cognitive architecture.

Memory Dreaming (v1.5.0) — Like sleep consolidation. Replays recent memories to discover hidden connections, strengthen important patterns, and synthesize insights. Based on the Active Dreaming Memory framework.

ACT-R Activation (v1.5.0) — Retrieval strength depends on BOTH recency AND frequency of access, computed from full access history. A memory accessed 50 times over 3 weeks is stronger than one accessed once yesterday. Based on Anderson, 1993.

Automatic Consolidation (v1.5.0) — FSRS-6 decay runs automatically every 6 hours + inline every 100 tool calls. Episodic memories auto-merge into semantic summaries. Cross-memory reinforcement strengthens neighbors on access. No manual maintenance needed.

Full science documentation →


Tools — 23 MCP Tools

Core Memory

Tool What It Does
search 7-stage cognitive search — keyword + semantic + RRF fusion + reranking + temporal boost + competition + spreading activation
smart_ingest Intelligent storage with automatic CREATE/UPDATE/SUPERSEDE via Prediction Error Gating
ingest Direct memory storage with cognitive post-processing
memory Get, delete, or check memory accessibility state
codebase Remember code patterns and architectural decisions per-project
intention Prospective memory — "remind me to X when Y happens"

Cognitive Engine (v1.5.0)

Tool What It Does
dream Memory consolidation via replay — discovers hidden connections, synthesizes insights
explore_connections Graph traversal — build reasoning chains, find associations via spreading activation, discover bridges between memories
predict Proactive retrieval — predicts what memories you'll need next based on context and activity patterns
restore Restore memories from JSON backup files

Feedback & Scoring

Tool What It Does
promote_memory / demote_memory Feedback loop with full cognitive pipeline — reward signals, reconsolidation, competition
importance_score 4-channel neuroscience scoring (novelty, arousal, reward, attention)

Auto-Save & Maintenance

Tool What It Does
session_checkpoint Batch-save up to 20 items in one call
find_duplicates Self-healing — detect and merge redundant memories via cosine similarity
consolidate Run FSRS-6 decay cycle (also runs automatically every 6 hours)
memory_timeline Browse memories chronologically, grouped by day
memory_changelog Audit trail of memory state transitions
health_check / stats System health, retention curves, cognitive state breakdown
backup / export / gc Database backup, JSON export, garbage collection

Make Your AI Use Vestige Automatically

Add this to your CLAUDE.md and your AI becomes proactive:

## Memory

At the start of every session:
1. Search Vestige for user preferences and project context
2. Save bug fixes, decisions, and patterns without being asked
3. Create reminders when the user mentions deadlines
You Say AI Does
"Remember this" Saves immediately
"I prefer..." / "I always..." Saves as preference
"Remind me..." Creates a future trigger
"This is important" Saves + strengthens

Full CLAUDE.md templates →


CLI

vestige stats              # Memory statistics
vestige stats --tagging    # Retention distribution
vestige stats --states     # Cognitive state breakdown
vestige health             # System health check
vestige consolidate        # Run memory maintenance
vestige restore <file>     # Restore from backup

Technical Details

  • Language: Rust (46,000+ lines)
  • Binary size: ~20MB
  • Embeddings: Nomic Embed Text v1.5 (768-dim, local ONNX inference via fastembed)
  • Vector search: USearch HNSW (20x faster than FAISS)
  • Storage: SQLite + FTS5 (optional SQLCipher encryption)
  • Transport: MCP stdio (JSON-RPC 2.0)
  • Dependencies: Zero runtime dependencies beyond the binary
  • First run: Downloads embedding model (~130MB), then fully offline
  • Platforms: macOS (ARM/Intel), Linux (x86_64), Windows

Documentation

Document Contents
FAQ 30+ answers to common questions
How It Works The neuroscience behind every feature
Storage Modes Global, per-project, multi-instance setup
CLAUDE.md Setup Templates for proactive memory
Configuration CLI commands, environment variables
Integrations Xcode, Cursor, VS Code, JetBrains, Windsurf
Changelog Version history

Troubleshooting

"Command not found" after installation

Ensure vestige-mcp is in your PATH:

which vestige-mcp

Or use the full path:

claude mcp add vestige /usr/local/bin/vestige-mcp -s user
Embedding model download fails

First run downloads ~130MB from Hugging Face. If behind a proxy:

export HTTPS_PROXY=your-proxy:port

Cache locations:

  • macOS: ~/Library/Caches/com.vestige.core/fastembed
  • Linux: ~/.cache/vestige/fastembed
  • Windows: %LOCALAPPDATA%\vestige\cache\fastembed

More troubleshooting →


Contributing

Issues and PRs welcome. See CONTRIBUTING.md.

License

AGPL-3.0 — free to use, modify, and self-host. If you offer Vestige as a network service, you must open-source your modifications.


Built by @samvallad33