SurfSense/surfsense_backend/tests/unit/indexing_pipeline
Anish Sarkar f7b52470eb feat: enhance Google connectors indexing with content extraction and document migration
- Added `download_and_extract_content` function to extract content from Google Drive files as markdown.
- Updated Google Drive indexer to utilize the new content extraction method.
- Implemented document migration logic to update legacy Composio document types to their native Google types.
- Introduced identifier hashing for stable document identification.
- Improved file pre-filtering to handle unchanged and rename-only files efficiently.
2026-03-25 18:33:44 +05:30
..
__init__.py test: bootstrap pytest environment for backend 2026-02-24 18:19:56 +02:00
conftest.py feat: enhance performance logging and caching in various components 2026-02-26 13:00:31 -08:00
test_connector_document.py test: mark test_connector_document.py with unit pytest marker 2026-03-08 02:53:47 +05:30
test_document_chunker.py add docstrings to all indexing pipeline tests 2026-02-25 20:30:31 +02:00
test_document_hashing.py feat: enhance Google connectors indexing with content extraction and document migration 2026-03-25 18:33:44 +05:30
test_document_summarizer.py feat: enhance performance logging and caching in various components 2026-02-26 13:00:31 -08:00