refactor(embedding-cache): rename index cache to embedding cache

The cached payload is the indexing pipeline's embeddings (markdown is chunked then embedded), so "embedding cache" names the expensive output directly and removes the "index" ambiguity (DB index vs vector index vs indexing phase). Renames the service, settings, eligibility, eviction task, metrics, config flags (INDEX_CACHE_* -> EMBEDDING_CACHE_*), object prefix, and the table (index_cache_embedding_sets -> embedding_cache_sets) with its constraint and indexes. Migration 161 renamed accordingly.
2026-06-16 21:05:20 +02:00 · 2026-06-12 17:00:01 +02:00 · 2026-06-12 17:00:01 +02:00 · 91d947ff79
commit 91d947ff79
parent 8cf578d965
18 changed files with 93 additions and 89 deletions
--- a/surfsense_backend/.env.example
+++ b/surfsense_backend/.env.example
@ -328,19 +328,19 @@ ETL_CACHE_ENABLED=false
 # ETL_CACHE_STORAGE_CONTAINER=surfsense-etl-cache
 # ETL_CACHE_STORAGE_LOCAL_PATH=/var/lib/surfsense/etl-cache

-# Index Cache
+# Embedding Cache
 # Reuse chunk+embedding output for identical markdown across workspaces (skips
 # re-chunking and re-embedding). Blobs share the ETL_CACHE_STORAGE_* backend.
 # Off by default.
-INDEX_CACHE_ENABLED=false
+EMBEDDING_CACHE_ENABLED=false
 # Bump to invalidate all cached embedding sets after a chunker change.
-# INDEX_CACHE_CHUNKER_VERSION=1
+# EMBEDDING_CACHE_CHUNKER_VERSION=1
 # Prune entries unused for this many days.
-# INDEX_CACHE_TTL_DAYS=90
+# EMBEDDING_CACHE_TTL_DAYS=90
 # Soft cap on total cached embeddings; coldest entries are evicted past it.
-# INDEX_CACHE_MAX_TOTAL_MB=5120
+# EMBEDDING_CACHE_MAX_TOTAL_MB=5120
 # Rows deleted per eviction pass.
-# INDEX_CACHE_EVICTION_BATCH=500
+# EMBEDDING_CACHE_EVICTION_BATCH=500

 # Daytona Sandbox (isolated code execution)
 # DAYTONA_SANDBOX_ENABLED=FALSE