vestige

mirror of https://github.com/samvallad33/vestige.git synced 2026-05-08 15:22:37 +02:00

Author	SHA1	Message	Date
NoahToKnow	9c022a0f54	fix(deep_reference): incorporate query relevance into recommended/confidence The Stage 8 `recommended` selector and the evidence sort both rank by FSRS-6 trust only, discarding the `combined_score` signal that the upstream hybrid_search + cross-encoder reranker just computed. Confidence is then derived from `recommended.trust + evidence_count`, neither of which moves with the query — so any query against the same corpus returns the same primary memory and the same confidence score. Empirical reproduction (15 deep_reference probes against an 11-memory corpus, 9 with a unique correct answer + 6 with no relevant memories): - Distinct primary memories returned : 1 / 15 - Confidence values returned : 1 distinct (0.82 for all) - Ground-truth accuracy on specific queries : 1 / 9 (11.1%) The single hit is coincidental: the always-returned memory happened to be the correct answer for one query. Random guessing across the 11-memory corpus would be ~9% baseline, so the tool is performing at random. Fix --- Replace trust-only ranking at three sites with a 50/50 composite of combined_score (query relevance) and FSRS-6 trust: let composite = \|s: &ScoredMemory\| s.combined_score as f64 * 0.5 + s.trust * 0.5; Used in: - cross_reference.rs:573 — `recommended` max_by - cross_reference.rs:589 — `non_superseded` evidence sort_by - cross_reference.rs:622 — `base_confidence` formula The 50/50 weighting is a design choice — see PR body for the knob to tweak if a different blend is preferred. The pre-existing updated_at tiebreaker is preserved. Tests ----- Two regression tests, both verified to FAIL on `main` and PASS with the fix via negative control (temporarily set the composite weights to 1.0 trust + 0.0 relevance and confirmed both tests fail again): - test_recommended_uses_query_relevance_not_just_trust Two-memory corpus, ingested in order so the off-topic memory wins the trust tiebreaker. Query targets the on-topic memory. The fix ensures `recommended` is the on-topic one. - test_confidence_varies_with_query_relevance Single-memory corpus. Identical execute() calls with a relevant query and an irrelevant query. The fix ensures the relevant query produces higher confidence. Full crate suite: 410 / 410 passing (was 408 + 2 new). Out of scope ------------ While running the live MCP probes I observed two further inconsistencies in `cross_reference.rs` that I cannot reproduce in cargo test (the synthetic test environment with mock embeddings does not trigger the required combined_score > 0.2 floor condition): - The `effective_sim` floor at line 551 fabricates contradictions between memories with no real topical overlap when one contains a CORRECTION_SIGNALS keyword. - The Stage 5 `contradictions` field (strict) and the Stage 7 `pair_relations` feeding the reasoning text (loose, post-floor) disagree, producing responses where `reasoning` claims N contradictions while `contradictions` is empty and `status` is "resolved". I have empirical data for both from live MCP usage but no reproducible cargo test, so they are intentionally not addressed in this PR. Happy to file them as a separate issue with the raw probe data if useful.	2026-04-09 20:09:56 -06:00
Sam Valladares	17038fccc4	fix(intention): accept snake_case in_minutes / file_pattern on TriggerSpec (#26 ) Some checks are pending CI / Test (macos-latest) (push) Waiting to run Details CI / Test (ubuntu-latest) (push) Waiting to run Details CI / Release Build (aarch64-apple-darwin) (push) Blocked by required conditions Details CI / Release Build (x86_64-unknown-linux-gnu) (push) Blocked by required conditions Details Test Suite / Unit Tests (push) Waiting to run Details Test Suite / MCP E2E Tests (push) Waiting to run Details Test Suite / User Journey Tests (push) Blocked by required conditions Details Test Suite / Dashboard Build (push) Waiting to run Details Test Suite / Code Coverage (push) Waiting to run Details fix(intention): accept snake_case in_minutes / file_pattern on TriggerSpec	2026-04-09 17:39:11 -05:00
Sam Valladares	3239295ab8	fix: resolve clippy collapsible-if errors in explore.rs Collapsed nested if statements into single conditions using let-chains (if a && let Ok(b) = ...). Fixes CI clippy failures on both macOS and Ubuntu. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 17:37:41 -05:00
NoahToKnow	f97dc7d084	fix(intention): accept snake_case in_minutes / file_pattern on TriggerSpec The public JSON schema in schema() declares `in_minutes` and `file_pattern` in snake_case, but TriggerSpec uses `#[serde(rename_all = "camelCase")]` which makes serde expect `inMinutes` / `filePattern`. Snake_case inputs are silently dropped to None, so time-based intentions with `in_minutes` never fire (triggerAt becomes null) and file_pattern-only context intentions never match. Added `#[serde(alias = ...)]` so both naming conventions deserialize correctly — purely additive, existing camelCase callers unaffected. Two regression tests added, verified to FAIL without the aliases (negative control confirmed the snake_case duration test sees `triggerAt: null` and the file_pattern test sees an empty `triggered` array). Both pass with the fix. Full crate suite: 408/408 passing. Related to #25 (Bug #8 was half-fixed — check-side re-derivation works, but the set-side was still dropping the value before it could be persisted).	2026-04-09 16:24:17 -06:00
Sam Valladares	5b1127d630	fix: remove vestige-agent from workspace (not shipped), improve reasoning chain output - Removed vestige-agent and vestige-agent-py from workspace members (ARC-AGI-3 code, not part of Vestige release — caused CI failure) - Improved deep_reference reasoning chain: fuller output with arrows on supersession reasoning, longer primary finding preview, fallback message when no relations found, boosted relation detection for search results with high combined_score Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 17:06:24 -05:00
Sam Valladares	04781a95e2	feat: v2.0.4 "Deep Reference" — cognitive reasoning engine + 10 bug fixes New features: - deep_reference tool (#22): 8-stage cognitive reasoning pipeline with FSRS-6 trust scoring, intent classification (FactCheck/Timeline/RootCause/Comparison/ Synthesis), spreading activation expansion, temporal supersession, trust-weighted contradiction analysis, relation assessment, dream insight integration, and algorithmic reasoning chain generation — all without calling an LLM - cross_reference (#23): backward-compatible alias for deep_reference - retrieval_mode parameter on search (precise/balanced/exhaustive) - get_batch action on memory tool (up to 20 IDs per call) - Token budget raised from 10K to 100K on search + session_context - Dates (createdAt/updatedAt) on all search results and session_context lines Bug fixes (GitHub Issue #25 — all 10 resolved): - state_transitions empty: wired record_memory_access into strengthen_batch - chain/bridges no storage fallback: added with edge deduplication - knowledge_edges dead schema: documented as deprecated - insights not persisted from dream: wired save_insight after generation - find_duplicates threshold dropped: serde alias fix - search min_retention ignored: serde aliases for snake_case params - intention time triggers null: removed dead trigger_at embedding - changelog missing dreams: added get_dream_history + event integration - phantom Related IDs: clarified message text - fsrs_cards empty: documented as harmless dead schema Security hardening: - HTTP transport CORS: permissive() → localhost-only - Auth token panic guard: &token[..8] → safe min(8) slice - UTF-8 boundary fix: floor_char_boundary on content truncation - All unwrap() removed from HTTP transport (unwrap_or_else fallback) - Dream memory_count capped at 500 (prevents O(N²) hang) - Dormant state threshold aligned (0.3 → 0.4) Stats: 23 tools, 758 tests, 0 failures, 0 warnings, 0 unwraps in production Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 16:15:26 -05:00
Sam Valladares	c6090dc2ba	fix: v2.0.1 release — fix broken installs, CI, security, and docs Critical fixes: - npm postinstall.js: BINARY_VERSION '1.1.3' → '2.0.1' (every install was 404ing) - npm package name: corrected error messages to 'vestige-mcp-server' - README: npm install command pointed to wrong package - MSRV: bumped from 1.85 to 1.91 (uses floor_char_boundary from 1.91) - CI: removed stale 'develop' branch from test.yml triggers Security hardening: - CSP: restricted connect-src from wildcard 'ws: wss:' to localhost-only - Added X-Frame-Options, X-Content-Type-Options, Referrer-Policy, Permissions-Policy headers - Added frame-ancestors 'none', base-uri 'self', form-action 'self' to CSP - Capped retention_distribution endpoint from 10k to 1k nodes - Added debug logging for WebSocket connections without Origin header Maintenance: - All clippy warnings fixed (58 total: redundant closures, collapsible ifs, no-op casts) - All versions harmonized to 2.0.1 across Cargo.toml and package.json - CLAUDE.md updated to match v2.0.1 (21 tools, 29 modules, 1238 tests) - docs/CLAUDE-SETUP.md updated deprecated function names - License corrected to AGPL-3.0-only in root package.json 1,238 tests passing, 0 clippy warnings. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 20:20:14 -06:00
Alec Marcus	5eccc728bd	fix: hydrate cognitive modules from persisted connections (#14 ) explore_connections and memory_graph returned empty results because in-memory cognitive modules were never loaded from the database. Connections were persisting to SQLite correctly (795 in production) but the query path only checked empty ActivationNetwork. - Add CognitiveEngine::hydrate() to load connections at startup - Add storage fallback in explore_connections associations - Hydrate live engine after dream persists new connections - Add error logging for save_connection failures - Add 7 integration tests for the full round-trip Closes #14 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 00:51:49 -05:00
Sam Valladares	c2d28f3433	feat: Vestige v2.0.0 "Cognitive Leap" — 3D dashboard, HyDE search, WebSocket events The biggest release in Vestige history. Complete visual and cognitive overhaul. Dashboard: - SvelteKit 2 + Three.js 3D neural visualization at localhost:3927/dashboard - 7 interactive pages: Graph, Memories, Timeline, Feed, Explore, Intentions, Stats - WebSocket event bus with 16 event types, real-time 3D animations - Bloom post-processing, GPU instanced rendering, force-directed layout - Dream visualization mode, FSRS retention curves, command palette (Cmd+K) - Keyboard shortcuts, responsive mobile layout, PWA installable - Single binary deployment via include_dir! (22MB) Engine: - HyDE query expansion (intent classification + 3-5 semantic variants + centroid) - fastembed 5.11 with optional Nomic v2 MoE + Qwen3 reranker + Metal GPU - Emotional memory module (#29) - Criterion benchmark suite Backend: - Axum WebSocket at /ws with heartbeat + event broadcast - 7 new REST endpoints for cognitive operations - Event emission from MCP tools via shared broadcast channel - CORS for SvelteKit dev mode Distribution: - GitHub issue templates (bug report, feature request) - CHANGELOG with comprehensive v2.0 release notes - README updated with dashboard docs, architecture diagram, comparison table 734 tests passing, zero warnings, 22MB release binary. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 03:07:25 -06:00
Sam Valladares	5b90a73055	feat: Vestige v1.9.1 AUTONOMIC — self-regulating memory with graph visualization Retention Target System: auto-GC low-retention memories during consolidation (VESTIGE_RETENTION_TARGET env var, default 0.8). Auto-Promote: memories accessed 3+ times in 24h get frequency-dependent potentiation. Waking SWR Tagging: promoted memories get preferential 70/30 dream replay. Improved Consolidation Scheduler: triggers on 6h staleness or 2h active use. New tools: memory_health (retention dashboard with distribution buckets, trend tracking, recommendations) and memory_graph (subgraph export with Fruchterman-Reingold force-directed layout, up to 200 nodes). Dream connections now persist to database via save_connection(), enabling memory_graph traversal. Schema Migration V8 adds waking_tag, utility_score, times_retrieved/useful columns and retention_snapshots table. 21 MCP tools. v1.9.1 fixes: ConnectionRecord export, UTF-8 safe truncation, link_type normalization, utility_score clamping, only-new-connections persistence, 70/30 split capacity fill, nonexistent center_id error handling. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 02:02:06 -06:00
Sam Valladares	c29023dd80	feat: Vestige v1.7.0 — 18 tools, automation triggers, SQLite perf Tool consolidation: 23 → 18 tools - ingest merged into smart_ingest (single + batch mode) - session_checkpoint merged into smart_ingest batch (items param) - promote_memory/demote_memory merged into memory(action=promote/demote) - health_check/stats merged into system_status Automation triggers in system_status: - lastDreamTimestamp, savesSinceLastDream, lastBackupTimestamp, lastConsolidationTimestamp — enables Claude to conditionally trigger dream/backup/gc/find_duplicates at session start - Migration v6: dream_history table (dreams were in-memory only) - DreamHistoryRecord struct + save/query methods - Dream persistence in dream.rs (non-fatal on failure) SQLite performance: - PRAGMA mmap_size = 256MB (2-5x read speedup) - PRAGMA journal_size_limit = 64MB (prevents WAL bloat) - PRAGMA optimize = 0x10002 (fresh query planner stats on connect) - FTS5 segment merge during consolidation (20-40% keyword boost) - PRAGMA optimize during consolidation cycle 1,152 tests passing, 0 failures, release build clean.	2026-02-20 21:59:52 -06:00
Sam Valladares	ce520bb246	chore: license AGPL-3.0, zero clippy warnings, CHANGELOG through v1.6.0 License: - Replace MIT/Apache-2.0 with AGPL-3.0-only across all crates and npm packages - Replace LICENSE file with official GNU AGPL-3.0 text - Remove LICENSE-MIT and LICENSE-APACHE Code quality: - Fix all 44 clippy warnings (zero remaining) - Collapsible if statements, redundant closures, manual Option::map - Remove duplicate #[allow(dead_code)] attributes in deprecated tool modules - Add Default impl for CognitiveEngine - Replace manual sort_by with sort_by_key Documentation: - Update CHANGELOG with v1.2.0, v1.3.0, v1.5.0, v1.6.0 entries - Update README with v1.6.0 highlights and accurate stats (52K lines, 1100+ tests) - Add fastembed-rs/ to .gitignore - Add fastembed-rs to workspace exclude 1115 tests passing, zero warnings, RUSTFLAGS="-Dwarnings" clean.	2026-02-19 03:00:39 -06:00
Sam Valladares	495a88331f	feat: Vestige v1.6.0 — 6x storage reduction, neural reranking, instant startup Four internal optimizations for dramatically better performance: 1. F16 vector quantization (ScalarKind::F16 in USearch) — 2x storage savings 2. Matryoshka 256-dim truncation (768→256) — 3x embedding storage savings 3. Convex Combination fusion (0.3 keyword / 0.7 semantic) replacing RRF 4. Cross-encoder reranker (Jina Reranker v1 Turbo via fastembed TextRerank) Combined: 6x vector storage reduction, ~20% better retrieval quality. Cross-encoder loads in background — server starts instantly. Old 768-dim embeddings auto-migrated on load. 614 tests pass, zero warnings.	2026-02-19 01:09:39 -06:00
Sam Valladares	927f41c3e4	feat: Vestige v1.5.0 — Cognitive Engine, memory dreaming, graph exploration, predictive retrieval 28-module CognitiveEngine with full neuroscience pipeline on every tool call. FSRS-6 now fully automatic: periodic consolidation (6h timer + inline every 100 tool calls), real retrievability formula, episodic-to-semantic auto-merge, cross-memory reinforcement, Park et al. triple retrieval scoring, ACT-R base-level activation, personalized w20 optimization. New tools (19 → 23): - dream: memory consolidation via replay, discovers hidden connections - explore_connections: graph traversal (chain, associations, bridges) - predict: proactive retrieval based on context and activity patterns - restore: memory restore from JSON backups All existing tools upgraded with cognitive pre/post processing pipelines. 33 files changed, ~4,100 lines added.	2026-02-18 23:34:15 -06:00
Sam Valladares	04a3062328	feat: Vestige v1.3.0 — importance scoring, session checkpoints, duplicate detection 3 new MCP tools (16 → 19 total): - importance_score: 4-channel neuroscience importance scoring (novelty/arousal/reward/attention) - session_checkpoint: batch smart_ingest up to 20 items with PE Gating - find_duplicates: cosine similarity clustering with union-find for dedup CLI: vestige ingest command for memory ingestion via command line Core: made get_node_embedding public, added get_all_embeddings for dedup scanning Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-12 05:02:09 -06:00
Sam Valladares	34f5e8d52a	feat: Vestige v1.2.0 — dashboard, temporal tools, maintenance tools, detail levels Add web dashboard (axum) on port 3927 with memory browser, search, and system stats. New MCP tools: memory_timeline, memory_changelog, health_check, consolidate, stats, backup, export, gc. Search now supports detail_level (brief/summary/full) to control token usage. Add backup_to() and get_recent_state_transitions() to storage layer. Bump to v1.2.0.	2026-02-12 04:33:05 -06:00
Sam Valladares	a92fb2b6ed	release: v1.1.3 — security hardening, edition 2024, dependency updates Security: - Fix RUSTSEC-2026-0007 (bytes integer overflow) - Restrict SQLite database file permissions to 0600 on Unix - Add 100KB size limit to intention descriptions (DoS prevention) - Redact JSON-RPC payloads from debug logs (data leakage prevention) - Update SECURITY.md with encryption docs and supported versions Modernization: - Upgrade Rust edition 2021 → 2024, MSRV 1.75 → 1.85 - Upgrade actions/checkout@v4 → v5, codecov/codecov-action@v3 → v5 - Update all dependencies to latest compatible versions - Fix edition 2024 match ergonomics in compression.rs Clippy fixes: - Rename from_str → parse_name to avoid shadowing FromStr trait - Replace .max().min() with .clamp() - Replace sort_by with sort_by_key Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-12 03:19:07 -06:00
Sam Valladares	a680fa7d2f	fix: dedup on ingest, Intel Mac CI, npm versions, remove dead TS package - Route ingest tool through smart_ingest (Prediction Error Gating) to prevent duplicate memories when content is similar to existing entries - Fix Intel Mac release build: use macos-13 runner for x86_64-apple-darwin (macos-latest is now ARM64, causing silent cross-compile failures) - Sync npm package version to 1.1.2 (was 1.0.0 in package.json, 1.1.0 in postinstall.js BINARY_VERSION) - Add vestige-restore to npm makeExecutable list - Remove abandoned packages/core/ TypeScript package (pre-Rust implementation referencing FSRS-5, chromadb, ollama — 32K lines of dead code) - Sync workspace Cargo.toml version to 1.1.2 Closes #5 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-12 02:57:03 -06:00
Sam Valladares	e06dd3d69a	chore: cleanup dead code warnings and apply clippy fixes for v1.1.1 - Add #![allow(dead_code)] to deprecated tool modules (kept for backwards compatibility but not exposed in MCP tool list) - Mark unused functions with #[allow(dead_code)] annotations - Fix unused variable warnings (prefix with _) - Apply clippy auto-fixes for redundant closures and derives - Fix test to account for protocol version negotiation - Reorganize tools/mod.rs to clarify active vs deprecated tools Security review: LOW RISK - no critical vulnerabilities found Dead code review: deprecated tools properly annotated Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-27 01:23:27 -06:00
Sam Valladares	8bb6500985	feat(v1.1): consolidate 29 tools → 8 unified tools + CLI Tool Consolidation: - search: merges recall, semantic_search, hybrid_search - memory: merges get_knowledge, delete_knowledge, get_memory_state - codebase: merges remember_pattern, remember_decision, get_codebase_context - intention: merges all 5 intention tools into action-based API New CLI Binary: - vestige stats [--tagging] [--states] - vestige health - vestige consolidate - vestige restore <file> Documentation: - Verify all neuroscience claims against codebase - Fix Memory States table: "Retention" → "Accessibility" with formula - Clarify Spreading Activation: embedding similarity vs full network module - Update Synaptic Tagging: clarify 9h/2h implementation vs biology - Add comprehensive FAQ with 30+ questions - Add storage modes: global, per-project, multi-Claude household - Add CLAUDE.md setup instructions Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-26 01:31:58 -06:00
Sam Valladares	bbd1c15b4a	Add Prediction Error Gating and smart_ingest tool (26 tools total) Implements neuroscience-inspired memory gating based on prediction error: - New smart_ingest MCP tool that auto-decides CREATE/UPDATE/SUPERSEDE - PredictionErrorGate evaluates semantic similarity vs existing memories - Automatically supersedes demoted memories with similar new content - Reinforces near-identical memories instead of creating duplicates - Adds promote_memory/demote_memory/request_feedback tools Thresholds: - >0.92 similarity = Reinforce existing - >0.75 similarity = Update/Merge - <0.75 similarity = Create new - Demoted + similar = Auto-supersede Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-25 13:30:03 -06:00
Sam Valladares	f9c60eb5a7	Initial commit: Vestige v1.0.0 - Cognitive memory MCP server FSRS-6 spaced repetition, spreading activation, synaptic tagging, hippocampal indexing, and 130 years of memory research. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-25 01:31:03 -06:00

22 commits