SurfSense

mirror of https://github.com/MODSetter/SurfSense.git synced 2026-04-26 09:16:22 +02:00

Author	SHA1	Message	Date
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	62e698d8aa	refactor: streamline document upload limits and enhance handling of mentioned documents - Updated maximum file size limit to 500 MB per file. - Removed restrictions on the number of files per upload and total upload size. - Enhanced handling of user-mentioning documents in the knowledge base search middleware. - Improved document reading and processing logic to accommodate new features and optimizations.	2026-04-02 19:39:10 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	a9fd45844d	feat: integrate Stripe for page purchases and reconciliation tasks	2026-03-31 18:39:45 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	17642493eb	chore: linting	2026-03-31 14:45:46 -07:00
Anish Sarkar	526940e9fe	fix: improve error handling and path retrieval in Dropbox indexing for better reliability	2026-03-30 23:51:21 +05:30
Anish Sarkar	d8d5102416	feat: introduce incremental sync option for Dropbox indexing, enhancing performance and user control	2026-03-30 23:27:48 +05:30
Anish Sarkar	0d5b902c26	feat: extend Dropbox support in chat event streaming and connector naming for enhanced integration	2026-03-30 23:07:25 +05:30
Anish Sarkar	1f12151e03	feat: implement Dropbox API client and folder management for enhanced file indexing	2026-03-30 22:17:50 +05:30
Anish Sarkar	04691d572b	chore: ran linting	2026-03-30 01:50:41 +05:30
Anish Sarkar	74826b3714	feat: enhance web search tool integration with citation management and UI enhancements	2026-03-30 01:38:36 +05:30
Anish Sarkar	5a3eece397	Merge remote-tracking branch 'upstream/dev' into feat/onedrive-connector	2026-03-29 11:55:06 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	2cc2d339e6	feat: made agent file sytem optimized	2026-03-28 16:39:46 -07:00
Anish Sarkar	8035eb9749	feat: enhance OneDrive file creation by converting markdown to DOCX format and updating client to handle byte content	2026-03-29 05:02:08 +05:30
Anish Sarkar	5bddde60cb	feat: implement Microsoft OneDrive connector with OAuth support and indexing capabilities	2026-03-28 14:31:25 +05:30
Anish Sarkar	17091edb77	Merge remote-tracking branch 'upstream/dev' into refactor/indexing-pipelines	2026-03-27 22:36:34 +05:30
Anish Sarkar	489e48644f	fix: revert native excel parsing	2026-03-27 22:15:24 +05:30
Anish Sarkar	3da0ffd683	feat: add native Excel parsing and improve Google Drive content extraction - Introduced a new utility for parsing .xlsx files into markdown format, enhancing the ability to process Excel documents natively. - Updated the Google Drive content extractor to utilize the new Excel parsing functionality, allowing for better handling of spreadsheet files. - Enhanced file type detection and export logic to support various document formats, improving overall content extraction accuracy. - Added unit tests to ensure the correctness of the new Excel parsing feature and its integration with existing content extraction workflows.	2026-03-27 21:47:14 +05:30
Anish Sarkar	4e0749f907	fix: update file skipping logic for failed documents in Google Drive indexer - Modified the `_should_skip_file` function to skip previously failed documents during processing, improving error handling. - Updated the corresponding test to reflect the new behavior, ensuring that failed documents are correctly identified and skipped during automatic sync.	2026-03-27 20:01:08 +05:30
Anish Sarkar	00934ff462	feat: enhance Google Drive client with improved logging and thread-safe operations - Added logging to track the start and end of file download and export processes, improving visibility into execution time. - Implemented per-thread HTTP transport for concurrent downloads and exports, ensuring thread safety. - Refactored download and export methods to utilize resolved credentials, enhancing functionality. - Updated unit tests to validate the new threading and logging features, ensuring robust parallel execution.	2026-03-27 19:25:45 +05:30
Anish Sarkar	0bc1c766ff	feat: migrate Confluence and Jira indexers to unified parallel pipeline - Refactored Confluence and Jira indexers to utilize the shared IndexingPipelineService for improved document processing. - Updated the `_build_connector_doc` function in both indexers to create ConnectorDocument instances with enhanced metadata and fallback summaries. - Modified the `index_confluence_pages` and `index_jira_issues` functions to return a tuple of (indexed_count, skipped_count, warning_or_error_message) for better error handling and reporting. - Added unit tests for both indexers to validate the new parallel processing logic and ensure correct document creation and indexing behavior.	2026-03-27 16:02:09 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	685ad0c02d	feat: add folder management features including creation, deletion, and organization of documents within folders	2026-03-27 01:39:15 -07:00
Anish Sarkar	db6dd058dd	feat: migrate Linear and Notion indexers to unified parallel pipeline - Refactored Linear and Notion indexers to utilize the shared IndexingPipelineService for improved document deduplication, summarization, chunking, and embedding with bounded parallel indexing. - Updated the `_build_connector_doc` function in both indexers to create ConnectorDocument instances with enhanced metadata and fallback summaries. - Modified the `index_linear_issues` and `index_notion_pages` functions to return a tuple of (indexed_count, skipped_count, warning_or_error_message) for better error handling and reporting. - Added unit tests for both indexers to validate the new parallel processing logic and ensure correct document creation and indexing behavior.	2026-03-27 11:19:32 +05:30
Anish Sarkar	7c7f8b216c	feat: implement batch indexing for selected Google Drive files - Introduced `index_google_drive_selected_files` function to enable indexing of multiple user-selected files in parallel, improving efficiency. - Refactored existing indexing logic to handle batch processing, including error handling for individual file failures. - Added unit tests for the new batch indexing functionality, ensuring robustness and proper error collection during the indexing process.	2026-03-27 00:17:07 +05:30
Anish Sarkar	c016962064	feat: implement parallel file downloading and indexing in Google Drive indexer - Added `_download_files_parallel` function to enable concurrent downloading of files from Google Drive, improving efficiency in document processing. - Introduced `_download_and_index` function to handle the parallel downloading and indexing phases, streamlining the overall workflow. - Updated `_index_full_scan` and `_index_with_delta_sync` methods to utilize the new parallel downloading functionality, enhancing performance. - Added unit tests to validate the new parallel downloading and indexing logic, ensuring robustness and error handling during document processing.	2026-03-26 23:53:26 +05:30
Anish Sarkar	4fd776e7ef	feat: implement parallel indexing for Google Calendar and Gmail connectors - Refactored Google Calendar and Gmail indexers to utilize the new `index_batch_parallel` method for concurrent document indexing, enhancing performance. - Updated the indexing logic to replace serial processing with parallel execution, allowing for improved efficiency in handling multiple documents. - Adjusted logging and error handling to accommodate the new parallel processing approach, ensuring robust operation during indexing. - Enhanced unit tests to validate the functionality of the parallel indexing method and its integration with existing workflows.	2026-03-26 19:34:04 +05:30
Anish Sarkar	c3d5c865fd	fix: update file skipping logic in Google Drive indexer - Modified the `_should_skip_file` function to prevent skipping of documents with a FAILED status, ensuring they are reprocessed even if their content remains unchanged. - Added a new integration test to verify that FAILED documents are not skipped during the indexing process.	2026-03-25 18:51:40 +05:30
Anish Sarkar	f7b52470eb	feat: enhance Google connectors indexing with content extraction and document migration - Added `download_and_extract_content` function to extract content from Google Drive files as markdown. - Updated Google Drive indexer to utilize the new content extraction method. - Implemented document migration logic to update legacy Composio document types to their native Google types. - Introduced identifier hashing for stable document identification. - Improved file pre-filtering to handle unchanged and rename-only files efficiently.	2026-03-25 18:33:44 +05:30
Anish Sarkar	778cfac6fa	Merge remote-tracking branch 'upstream/dev' into impr/thinking-steps	2026-03-25 01:50:10 +05:30
CREDO23	5d8a62a4a6	merge upstream/dev into feat/migrate-electric-to-zero Resolve 8 conflicts: - Accept upstream deletion of 3 composio_*_connector.py (unified Google connectors) - Accept our deletion of ElectricProvider.tsx, use-connectors-electric.ts, use-messages-electric.ts (replaced by Zero equivalents) - Keep both new deps in package.json (@rocicorp/zero + @slate-serializers/html) - Regenerate pnpm-lock.yaml	2026-03-24 17:40:34 +02:00
Anish Sarkar	a009cae62a	refactor: remove link_preview tool and associated components to streamline agent functionality	2026-03-24 17:15:29 +05:30
Anish Sarkar	6c507989d2	refactor: remove display_image tool and update related components to streamline image handling	2026-03-24 16:28:11 +05:30
CREDO23	cf21eaacfc	fix: critical timestamp parsing and audit fixes - Fix timestamp conversion: String(epochMs) → new Date(epochMs).toISOString() in use-messages-sync, use-comments-sync, use-documents, use-inbox. Without this, date comparisons (isEdited, cutoff filters) would fail. - Fix updated_at: undefined → null in use-inbox to match InboxItem type - Fix ZeroProvider: skip Zero connection for unauthenticated users - Clean 30+ stale "Electric SQL" comments in backend Python code	2026-03-23 19:49:28 +02:00
Anish Sarkar	5c598e8588	Merge remote-tracking branch 'upstream/dev' into feat/human-in-the-loop	2026-03-22 15:45:45 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	d90b6d35ce	feat: enhance video presentation agent with parallel theme assignment and watermarking	2026-03-21 23:02:09 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	b28f135a96	feat: init video presentation agent	2026-03-21 22:13:41 -07:00
Anish Sarkar	2bc6a0c3bc	chore: ran linting	2026-03-22 00:43:53 +05:30
Anish Sarkar	de8841fb86	chore: ran linting	2026-03-21 13:20:13 +05:30
Anish Sarkar	79bc123439	feat: implement lazy imports for token refresh in Confluence and Jira connectors - Refactored token refresh logic in ConfluenceHistoryConnector and JiraHistoryConnector to use lazy imports, avoiding circular dependencies. - Enhanced the ComposerAction component to manage tool availability based on connected types, adding support for Jira and Confluence tools. - Updated tool icon management to include Jira and Confluence, improving the user interface for tool interactions.	2026-03-21 12:41:06 +05:30
Anish Sarkar	9a750fba74	feat: update Google Calendar tool actions in chat streaming	2026-03-21 01:44:54 +05:30
Anish Sarkar	bd2d633546	feat: enhance Gmail account handling and expand chat tool capabilities - Updated the GmailAccount class to extract email from the connector name when formatted with " - ". - Added new tool actions for Gmail and Google Calendar, including creating drafts, sending emails, and managing calendar events, improving integration and user functionality.	2026-03-20 20:18:03 +05:30
Anish Sarkar	d21593ee71	feat: unify handling of native and legacy document types for Google connectors - Introduced a mapping of native Google document types to their legacy Composio equivalents, ensuring seamless search and indexing for both types. - Updated relevant components to utilize the new mapping, enhancing the consistency of document type handling across the application. - Improved search functionality to transparently include legacy types, maintaining accessibility for older documents until re-indexed.	2026-03-20 03:41:32 +05:30
Anish Sarkar	aaf34800e6	feat: enhance legacy document migration for Google connectors - Implemented fallback logic in Google Calendar, Drive, and Gmail indexers to handle legacy Composio document types, ensuring smooth migration to native types. - Updated document indexing functions to check for existing documents using both primary and legacy hashes, improving data integrity during indexing.	2026-03-20 03:39:05 +05:30
Anish Sarkar	8e7cda31c5	feat: update Google indexing functions to track skipped messages - Modified the indexing functions for Google Calendar and Gmail to return the count of skipped messages alongside indexed messages, enhancing performance tracking. - Updated related tests to accommodate the new return values, ensuring comprehensive coverage of the indexing process. - Improved error handling to maintain consistency in returned values across different indexing functions.	2026-03-19 20:56:40 +05:30
Anish Sarkar	e9485ab2df	feat: update Google Drive indexing to include skipped file tracking	2026-03-19 20:27:50 +05:30
Anish Sarkar	eac4cb6075	feat: enhance Google Drive indexing to track skipped files - Updated the indexing function to return the count of skipped files alongside indexed files, improving tracking of indexing performance. - Added logic to accumulate skipped file counts during the indexing process, providing better insights into potential issues. - Enhanced notification updates to include skipped file counts, ensuring comprehensive progress reporting for users.	2026-03-19 20:27:36 +05:30
Anish Sarkar	2390bd7d26	feat: enhance Google Drive authentication error handling - Improved error handling for Google Drive indexing and listing operations to manage authentication failures more effectively. - Added logic to mark connectors as 'auth_expired' when a 401 error or invalid credentials are detected, prompting users to re-authenticate. - Updated error messages to provide clearer guidance on authentication issues, ensuring a better user experience.	2026-03-19 18:24:41 +05:30
Anish Sarkar	83152e8e7e	refactor: unify all 3 google Composio and non-Composio connector types and pipelines keeping same credential adapters	2026-03-19 05:08:21 +05:30
Anish Sarkar	ac0f2fa2eb	chore: ran linting	2026-03-17 04:40:46 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	2b33dfe728	refactor: update safe_set_chunks function to be asynchronous and modify all connector and document processor files to use the new async implementation	2026-03-15 00:44:27 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	d8a05ae4d5	feat: refactor agent tools management and add UI integration - Added endpoint to list agent tools with metadata, excluding hidden tools. - Updated NewChatRequest and RegenerateRequest schemas to include disabled tools. - Integrated disabled tools management in the NewChatPage and Composer components. - Improved tool instructions and visibility in the system prompt. - Refactored tool registration to support hidden tools and default enabled states. - Enhanced document chunk creation to handle strict zip behavior. - Cleaned up imports and formatting across various files for consistency.	2026-03-10 17:36:26 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	403097646d	feat: implement batch unread counts for notifications to reduce API calls and improve performance	2026-03-10 01:26:37 -07:00

1 2 3 4 5 ...

467 commits