SurfSense/surfsense_backend/app/indexing_pipeline
2026-06-17 14:59:24 +02:00
..
adapters feat(backend): Remove LLM summaries from document indexing 2026-06-04 00:50:19 +05:30
cache refactor(indexing): expose chunk_markdown and embed_batch helpers 2026-06-12 18:52:57 +02:00
__init__.py test: add ConnectorDocument unit tests and factory fixture 2026-02-24 22:20:08 +02:00
chunk_reconciler.py feat(indexing): add pure chunk reconciler for content-addressed diffs 2026-06-12 18:52:46 +02:00
connector_document.py feat(backend): Remove LLM summaries from document indexing 2026-06-04 00:50:19 +05:30
document_chunker.py feat(chunker): add table-aware chunk_text_hybrid to prevent mid-row table splits 2026-05-05 12:48:04 +08:00
document_embedder.py feat: re-export embed_texts from document_embedder 2026-03-09 15:54:02 +02:00
document_hashing.py feat: made agent file sytem optimized 2026-03-28 16:39:46 -07:00
document_persistence.py batch chunk inserts in persist_scratch_index 2026-06-17 14:59:24 +02:00
exceptions.py feat(error-handling): implement LLM error adaptation and classification for chat streaming 2026-06-12 05:03:14 +05:30
indexing_pipeline_service.py wire persist_scratch_index into scratch reindex 2026-06-17 14:59:24 +02:00
pipeline_logger.py feat: enhance performance logging and caching in various components 2026-02-26 13:00:31 -08:00