refactor(embedding-cache): rename index cache to embedding cache

The cached payload is the indexing pipeline's embeddings (markdown is
chunked then embedded), so "embedding cache" names the expensive output
directly and removes the "index" ambiguity (DB index vs vector index vs
indexing phase). Renames the service, settings, eligibility, eviction
task, metrics, config flags (INDEX_CACHE_* -> EMBEDDING_CACHE_*), object
prefix, and the table (index_cache_embedding_sets -> embedding_cache_sets)
with its constraint and indexes. Migration 161 renamed accordingly.
This commit is contained in:
CREDO23 2026-06-12 17:00:01 +02:00
parent 8cf578d965
commit 91d947ff79
18 changed files with 93 additions and 89 deletions

View file

@ -328,19 +328,19 @@ ETL_CACHE_ENABLED=false
# ETL_CACHE_STORAGE_CONTAINER=surfsense-etl-cache
# ETL_CACHE_STORAGE_LOCAL_PATH=/var/lib/surfsense/etl-cache
# Index Cache
# Embedding Cache
# Reuse chunk+embedding output for identical markdown across workspaces (skips
# re-chunking and re-embedding). Blobs share the ETL_CACHE_STORAGE_* backend.
# Off by default.
INDEX_CACHE_ENABLED=false
EMBEDDING_CACHE_ENABLED=false
# Bump to invalidate all cached embedding sets after a chunker change.
# INDEX_CACHE_CHUNKER_VERSION=1
# EMBEDDING_CACHE_CHUNKER_VERSION=1
# Prune entries unused for this many days.
# INDEX_CACHE_TTL_DAYS=90
# EMBEDDING_CACHE_TTL_DAYS=90
# Soft cap on total cached embeddings; coldest entries are evicted past it.
# INDEX_CACHE_MAX_TOTAL_MB=5120
# EMBEDDING_CACHE_MAX_TOTAL_MB=5120
# Rows deleted per eviction pass.
# INDEX_CACHE_EVICTION_BATCH=500
# EMBEDDING_CACHE_EVICTION_BATCH=500
# Daytona Sandbox (isolated code execution)
# DAYTONA_SANDBOX_ENABLED=FALSE