mirror of
https://github.com/MODSetter/SurfSense.git
synced 2026-04-28 02:23:53 +02:00
- Added `download_and_extract_content` function to extract content from Google Drive files as markdown. - Updated Google Drive indexer to utilize the new content extraction method. - Implemented document migration logic to update legacy Composio document types to their native Google types. - Introduced identifier hashing for stable document identification. - Improved file pre-filtering to handle unchanged and rename-only files efficiently. |
||
|---|---|---|
| .. | ||
| adapters | ||
| __init__.py | ||
| connector_document.py | ||
| document_chunker.py | ||
| document_embedder.py | ||
| document_hashing.py | ||
| document_persistence.py | ||
| document_summarizer.py | ||
| exceptions.py | ||
| indexing_pipeline_service.py | ||
| pipeline_logger.py | ||