Commit graph

2249 commits

Author SHA1 Message Date
CREDO23
506a9297a9 fix(connectors): track delta sync tokens per folder for Google Drive
- Store tokens in folder_tokens dict instead of single global token
- Each folder now tracks its own sync state independently
- Fixes issue where indexing folder 2 incorrectly used delta sync after folder 1 was indexed
- First-time indexing now correctly uses full scan for each new folder
2025-12-28 18:32:59 +02:00
CREDO23
a5935bc677 feat(connectors): add connector parameter to file processor for source tracking
- Add optional 'connector' parameter with 'type' and 'metadata' fields
- Create helper function _update_document_from_connector
- Use document_metadata column (not metadata) for JSON field
- Merge metadata with existing using dict spread operator
- Google Drive documents now marked as GOOGLE_DRIVE_CONNECTOR
- Backward compatible - no changes to existing logic
- Simple and clean implementation
2025-12-28 18:01:39 +02:00
Anish Sarkar
a10bfe32cd feat: Adjust document mention picker max height and refine thread header shimmer display logic. 2025-12-28 21:29:28 +05:30
Anish Sarkar
446abc239b fix: now document processing UI banner does not show for periodic reindexing documents 2025-12-28 21:19:20 +05:30
CREDO23
8da58be9e0 fix(connectors): refresh document from DB before updating type
- Query document from database to ensure it's attached to session
- Prevents detached instance errors after process_file_in_background commits
- Properly updates document_type and metadata with session management
2025-12-28 17:21:44 +02:00
CREDO23
b2b891e4d7 fix(connectors): properly commit Google Drive document type changes
- Return file metadata from content_extractor for indexer to use
- Update document type and metadata in indexer after processing
- Explicitly commit changes to database
- Ensures documents are properly marked as GOOGLE_DRIVE_CONNECTOR type
2025-12-28 17:15:29 +02:00
CREDO23
9f1fd20944 feat(connectors): mark Google Drive documents with GOOGLE_DRIVE_CONNECTOR type
- Change document_type from file type (PDF, DOCX) to GOOGLE_DRIVE_CONNECTOR
- Store original file type in metadata for reference
- Add Google Drive specific metadata (file_id, mime_type, source)
- Include export format info for Google Workspace files
- Enables proper source tracking and bulk management
2025-12-28 16:55:14 +02:00
CREDO23
c9815fd6fb feat(celery): update Google Drive task for multiple folders
- Accept comma-separated folder_ids and folder_names parameters
- Pass through to indexing function for batch processing
2025-12-28 16:49:47 +02:00
CREDO23
634eeb887e feat(routes): support multiple Google Drive folder indexing
- Accept comma-separated folder_ids and folder_names
- Loop through each folder and index sequentially
- Collect total indexed count and errors
- Update timestamp only on full success
2025-12-28 16:49:20 +02:00
CREDO23
27a4bcdfc2 feat(ui): support multiple folder selection in Google Drive indexing
- Update manage page to handle array of selected folders
- Add info icon with clear description about folder-level indexing
- Display list of all selected folders before indexing
- Remove unnecessary file type details section
- Pass comma-separated folder IDs and names to backend
2025-12-28 16:48:56 +02:00
CREDO23
e0edfef5fc feat(ui): add multiple folder selection with checkboxes to Google Drive tree
- Replace single folder selection with multi-select checkboxes
- Remove cascading auto-select for clearer UX
- Each folder must be selected individually
- Visual indicators for selected folders
2025-12-28 16:48:34 +02:00
CREDO23
c4a95ecc02 feat(ui): integrate Google Drive folder selection into manage connectors page
- Add folder selection dialog for Google Drive indexing
- Hide date picker and quick index for Google Drive
- Show folder tree browser in modal
- Pass selected folder to indexing API
- Adjust modal size to prevent overflow
2025-12-28 16:04:58 +02:00
CREDO23
5df04c3caa feat(ui): add hierarchical Google Drive folder tree browser
- Display folders and files with lazy loading
- Show different icons for file types (docs, sheets, slides, etc)
- Expandable folder tree with proper indentation
- Selectable folders for indexing
- Handle overflow with proper truncation
- Full pagination support for large folder structures
2025-12-28 16:04:58 +02:00
CREDO23
ad4d424d38 feat(ui): add Google Drive OAuth connection page
- Handle OAuth flow similar to Gmail/Calendar
- Show connection status and redirect to manage page
- Display connector features and file type support
- No folder selection at connection time (done at index time)
2025-12-28 16:04:58 +02:00
CREDO23
90b3474b47 feat(hooks): add folder parameters to indexConnector function
- Accept folderId and folderName for Google Drive indexing
- Pass folder parameters to backend API
2025-12-28 16:04:58 +02:00
CREDO23
48112f66df feat(ui): add Google Drive connector card to Productivity category 2025-12-28 16:04:58 +02:00
CREDO23
bfbd813f42 feat(i18n): add Google Drive connector translation keys 2025-12-28 16:04:58 +02:00
CREDO23
11d94e0ea6 feat(ui): add Google Drive icon to connector icons mapping 2025-12-28 16:04:58 +02:00
CREDO23
2d24f9ac79 feat(types): add GOOGLE_DRIVE_CONNECTOR to frontend enum 2025-12-28 16:04:58 +02:00
CREDO23
1c83327fc7 feat(celery): add Google Drive indexing Celery task
- Create async task for Google Drive folder indexing
- Accept folder_id and folder_name parameters
- Call indexing wrapper to avoid circular imports
2025-12-28 15:56:11 +02:00
CREDO23
358abdf02f feat(routes): add Google Drive indexing support with folder selection
- Accept folder_id and folder_name as indexing parameters
- Hide date range for Google Drive connectors
- Create wrapper function to avoid circular imports
- Trigger Google Drive indexing Celery task
2025-12-28 15:55:57 +02:00
CREDO23
7b8900d51f feat(indexer): export Google Drive indexer function 2025-12-28 15:55:46 +02:00
CREDO23
501d08f2f4 feat(routes): register Google Drive OAuth router 2025-12-28 15:55:38 +02:00
CREDO23
1696c7056a feat(indexer): add Google Drive folder indexing with delta sync
- Full folder scan on first index
- Delta sync using change tracking for subsequent indexes
- Process files in parallel batches
- Handle file additions, modifications, and deletions
- Store change tracking token for efficient re-indexing
2025-12-28 15:55:25 +02:00
CREDO23
bf02005d82 feat(routes): add Google Drive OAuth and folder listing endpoints
- OAuth initialization and callback handling
- Folder and file browsing with parent_id support
- Validate credentials and handle token refresh
- Return folder contents with metadata for UI tree view
2025-12-28 15:55:13 +02:00
CREDO23
3e67d5f31e feat(connectors): add Google Drive delta sync with change tracking
- Get start page token for change tracking baseline
- Fetch incremental changes using Google Drive Changes API
- Categorize changes into added, modified, and removed files
- Enable efficient re-indexing of only changed content
2025-12-28 15:55:06 +02:00
CREDO23
84bde67979 feat(connectors): add Google Drive folder browsing and file listing
- List folder contents with full pagination support
- Query root folder or specific parent folder
- Return both folders and files with metadata (size, icons, links)
- Filter out shortcuts and trashed items
2025-12-28 15:54:58 +02:00
CREDO23
40304c6795 feat(connectors): add Google Drive content extraction using existing ETL
- Download files from Google Drive to temporary location
- Export Google Workspace files as PDF
- Delegate content extraction to existing process_file_in_background
- Reuse Surfsense's ETL services (Unstructured, LlamaCloud, Docling)
2025-12-28 15:54:50 +02:00
CREDO23
701c3409b3 feat(connectors): add Google Drive file type detection and mapping
- Detect Google Workspace files (Docs, Sheets, Slides)
- Map to PDF export format to preserve rich content (images, formatting)
- Identify files to skip (shortcuts, unsupported types)
2025-12-28 15:54:42 +02:00
CREDO23
74386affdc feat(connectors): add Google Drive API client wrapper
- Build and manage Google Drive service with credentials
- List files with query support and pagination
- Download binary files and export Google Workspace files as PDF
- Handle HTTP errors gracefully
2025-12-28 15:54:32 +02:00
CREDO23
2c8717b14b feat(connectors): add Google Drive credentials module for OAuth management
- Handle Google OAuth credential initialization and validation
- Automatic token refresh with database persistence
- Reuse existing tokens when valid
2025-12-28 15:54:26 +02:00
CREDO23
2897985127 feat(config): add GOOGLE_DRIVE_REDIRECT_URI environment variable 2025-12-28 15:53:51 +02:00
CREDO23
5dd8838638 feat(db): add idempotent Alembic migration for GOOGLE_DRIVE_CONNECTOR enums 2025-12-28 15:53:44 +02:00
CREDO23
f54079643f feat(db): add GOOGLE_DRIVE_CONNECTOR to DocumentType and SearchSourceConnectorType enums 2025-12-28 15:53:35 +02:00
Rohan Verma
0e1ea9c30f
Merge pull request #639 from MODSetter/dev
fix: docker version mismatch
2025-12-27 15:14:27 -08:00
DESKTOP-RTLN3BA\$punk
dbd9c837e6 fix: docker version mismatch 2025-12-27 15:13:52 -08:00
Rohan Verma
96f82d73ad
Merge pull request #638 from MODSetter/dev
feat: remove google analytics
2025-12-27 14:31:23 -08:00
DESKTOP-RTLN3BA\$punk
0bcccc9fd3 Merge commit 'ed6c83615e' into dev 2025-12-27 14:30:03 -08:00
DESKTOP-RTLN3BA\$punk
f2df655469 chore: Remove unused dependency '@next/third-parties' from package.json and pnpm-lock.yaml, and clean up related code in layout.tsx 2025-12-27 14:29:52 -08:00
Rohan Verma
ed6c83615e
Merge pull request #636 from MODSetter/dev
feat: Replace AsyncChromiumLoader with Playwright for web crawling an…
2025-12-27 13:58:20 -08:00
DESKTOP-RTLN3BA\$punk
9d0721de43 feat: Replace AsyncChromiumLoader with Playwright for web crawling and content extraction in link preview and web crawler connector modules. 2025-12-27 13:58:00 -08:00
Rohan Verma
c2e6bf2018
Merge pull request #635 from MODSetter/dev
feat: sweet UX updates
2025-12-27 13:45:44 -08:00
DESKTOP-RTLN3BA\$punk
528c59ce58 chore: linting 2025-12-27 13:44:23 -08:00
DESKTOP-RTLN3BA\$punk
d154f510a3 fix: Adjust viewport overflow behavior in Thread component for improved scrolling experience 2025-12-27 13:42:37 -08:00
Rohan Verma
4a011af204
Merge pull request #628 from AnishSarkar22/fix/chatpage-ux
more improvements
2025-12-27 12:42:41 -08:00
Anish Sarkar
74f17f61ec feat: Enhance agent system prompt with write_todos examples, and apply minor code formatting across connector pages and hooks. 2025-12-28 01:41:27 +05:30
Anish Sarkar
0164659f7a refactor: comment out ls tool handling, update assistant UI colors and spacing. 2025-12-28 01:30:53 +05:30
Anish Sarkar
2c64fcc38e feat: Enhance sidebar navigation active state logic, introduce static display items, refine active item styling 2025-12-28 01:20:39 +05:30
Anish Sarkar
8b10b0cd24 feat: Display document reindexing status in the sidebar by adding document ID to logs 2025-12-28 01:07:42 +05:30
Anish Sarkar
2570360079 fix: logs now showing in docker builds 2025-12-28 00:37:26 +05:30