SurfSense/surfsense_backend/app
CREDO23 0fb1d3d37b feat(etl-cache): route all file-based sources through the parse cache
Every file ingestion path (Dropbox, Google Drive / Composio Drive, OneDrive,
local folder, Obsidian, and the legacy upload handlers) now parses via the
extract_with_cache facade instead of calling EtlPipelineService.extract
directly, so identical bytes are deduplicated globally regardless of source.
vision_llm is passed through, keeping the existing cacheability gate intact.
2026-06-12 14:47:25 +02:00
..
agents refactor(podcasts): regenerate via brief gate, render brief inline in chat 2026-06-11 11:45:17 +02:00
automations refactor(agents): consolidate chat runtime infra under chat/runtime 2026-06-05 13:19:24 +02:00
config feat(config): add ETL_CACHE_* settings 2026-06-12 11:23:50 +02:00
connectors feat(etl-cache): route all file-based sources through the parse cache 2026-06-12 14:47:25 +02:00
etl_pipeline feat(etl-cache): emit hit/miss and eviction metrics 2026-06-12 11:57:03 +02:00
event_bus refactor(event_bus): wire catalog and events into package, rename builtin to events 2026-05-29 22:15:18 +02:00
file_storage chore: linting 2026-06-09 00:42:26 -07:00
gateway chore: linting 2026-06-09 00:42:26 -07:00
indexing_pipeline fix(indexing): log and recover session in rollback_and_persist_failure 2026-06-10 00:10:25 +02:00
notifications feat(refactor): refactor payment system to implement unified credit wallet. 2026-06-10 16:49:03 -07:00
observability feat(observability): add etl cache lookup and eviction metrics 2026-06-12 11:57:03 +02:00
podcasts chore: linting 2026-06-11 15:31:43 -07:00
prompts refactor(agents): relocate boundary-only infra out of shared/ 2026-06-05 12:36:44 +02:00
retriever feat(retriever): instrument knowledge base search 2026-05-21 23:03:31 +05:30
routes Merge remote-tracking branch 'upstream/dev' into improvement-podcast-graph 2026-06-11 23:14:49 +02:00
schemas Merge remote-tracking branch 'upstream/dev' into improvement-podcast-graph 2026-06-11 23:14:49 +02:00
services feat(etl-cache): route all file-based sources through the parse cache 2026-06-12 14:47:25 +02:00
tasks feat(etl-cache): route all file-based sources through the parse cache 2026-06-12 14:47:25 +02:00
templates feat: update report generation and export capabilities to support multiple formats (PDF, DOCX, HTML, LaTeX, EPUB, ODT, plain text) across documentation and backend 2026-03-09 18:41:21 -07:00
utils chore: linting 2026-06-09 00:42:26 -07:00
__init__.py feat: SurfSense v0.0.6 init 2025-03-14 18:53:14 -07:00
app.py feat: made chat fast 2026-06-09 04:45:17 -07:00
celery_app.py feat(celery): schedule etl cache eviction 2026-06-12 11:23:50 +02:00
db.py feat(db): register CachedParse model 2026-06-12 11:23:50 +02:00
exceptions.py feat: add processing mode support for document uploads and ETL pipeline, improded error handling ux 2026-04-14 21:26:00 -07:00
rate_limiter.py try: ip fix for cludflare 2026-04-16 02:13:52 -07:00
session_events.py refactor: anonymous/free chat experience 2026-05-31 15:58:21 -07:00
users.py Seed default prompts on registration and for existing users 2026-03-31 18:12:09 +02:00
zero_publication.py feat(migration): evolve podcast lifecycle by detaching from zero_publication and updating column handling 2026-06-11 16:17:14 -07:00