mirror of
https://github.com/syntrex-lab/gomcp.git
synced 2026-06-08 14:55:13 +02:00
- QJL (1-bit) approximate filter for 2.3x fast search - PolarQuant (4-bit/8-bit) compressed storage with PQDropFloat64 memory reclamation (15x heap reduction) - Two-Phase SearchQJL with fallback to CompressedSimilarity - Edge Deployment prototype (pq_attention.cu) for LLaMA 1.5M token context |
||
|---|---|---|
| .. | ||
| application | ||
| config | ||
| domain | ||
| infrastructure | ||
| transport | ||