SurfSense/surfsense_backend/app/tasks/connector_indexers
Anish Sarkar f7b52470eb feat: enhance Google connectors indexing with content extraction and document migration
- Added `download_and_extract_content` function to extract content from Google Drive files as markdown.
- Updated Google Drive indexer to utilize the new content extraction method.
- Implemented document migration logic to update legacy Composio document types to their native Google types.
- Introduced identifier hashing for stable document identification.
- Improved file pre-filtering to handle unchanged and rename-only files efficiently.
2026-03-25 18:33:44 +05:30
..
__init__.py refactor: remove unused COMPOSIO_CONNECTOR migration and linting 2026-01-22 16:43:08 -08:00
airtable_indexer.py fix: critical timestamp parsing and audit fixes 2026-03-23 19:49:28 +02:00
base.py chore: ran linting 2026-03-17 04:40:46 +05:30
bookstack_indexer.py fix: critical timestamp parsing and audit fixes 2026-03-23 19:49:28 +02:00
clickup_indexer.py fix: critical timestamp parsing and audit fixes 2026-03-23 19:49:28 +02:00
confluence_indexer.py fix: critical timestamp parsing and audit fixes 2026-03-23 19:49:28 +02:00
discord_indexer.py fix: critical timestamp parsing and audit fixes 2026-03-23 19:49:28 +02:00
elasticsearch_indexer.py fix: critical timestamp parsing and audit fixes 2026-03-23 19:49:28 +02:00
github_indexer.py fix: critical timestamp parsing and audit fixes 2026-03-23 19:49:28 +02:00
google_calendar_indexer.py feat: enhance Google connectors indexing with content extraction and document migration 2026-03-25 18:33:44 +05:30
google_drive_indexer.py feat: enhance Google connectors indexing with content extraction and document migration 2026-03-25 18:33:44 +05:30
google_gmail_indexer.py feat: enhance Google connectors indexing with content extraction and document migration 2026-03-25 18:33:44 +05:30
jira_indexer.py fix: critical timestamp parsing and audit fixes 2026-03-23 19:49:28 +02:00
linear_indexer.py fix: critical timestamp parsing and audit fixes 2026-03-23 19:49:28 +02:00
luma_indexer.py fix: critical timestamp parsing and audit fixes 2026-03-23 19:49:28 +02:00
notion_indexer.py fix: critical timestamp parsing and audit fixes 2026-03-23 19:49:28 +02:00
obsidian_indexer.py fix: critical timestamp parsing and audit fixes 2026-03-23 19:49:28 +02:00
slack_indexer.py fix: critical timestamp parsing and audit fixes 2026-03-23 19:49:28 +02:00
teams_indexer.py fix: critical timestamp parsing and audit fixes 2026-03-23 19:49:28 +02:00
webcrawler_indexer.py fix: critical timestamp parsing and audit fixes 2026-03-23 19:49:28 +02:00