SurfSense

mirror of https://github.com/MODSetter/SurfSense.git synced 2026-04-26 17:26:23 +02:00

Author	SHA1	Message	Date
Anish Sarkar	97d7207bd4	fix: update Google Drive indexer to use SQLAlchemy casting for metadata queries - Modified the Google Drive indexer to use SQLAlchemy's cast function for querying document metadata, ensuring proper type handling for file IDs. - Improved the consistency of metadata queries across the indexing functions, enhancing reliability in document retrieval and processing.	2026-01-24 04:33:10 +05:30
Anish Sarkar	5cf6fb15ed	fix: improve error logging for indexing tasks across multiple connectors - Updated error handling in the indexing functions for BookStack, Confluence, Google Calendar, Jira, Linear, and Luma connectors to log specific error messages when failures occur. - Enhanced logging for cases where no pages or events are found, providing clearer informational messages instead of treating them as critical errors. - Ensured consistent error reporting across all connector indexers, improving debugging and user feedback during indexing operations.	2026-01-24 03:59:17 +05:30
Anish Sarkar	c48ba36fa4	feat: improve indexing logic and duplicate handling in connectors - Enhanced Google Calendar and Composio connector indexing to track and log duplicate content, preventing re-indexing of already processed events. - Implemented robust error handling during final commits to manage integrity errors gracefully, ensuring successful indexing despite potential duplicates. - Updated notification service to differentiate between actual errors and warnings for duplicate content, improving user feedback. - Refactored date handling to ensure valid date ranges and adjusted end dates when necessary for better indexing accuracy.	2026-01-23 23:36:14 +05:30
Anish Sarkar	d20bb385b5	feat: enhance date handling and indexing logic across connectors - Added normalization for "undefined" strings to None in date parameters to prevent parsing errors. - Improved date range validation to ensure start_date is strictly before end_date, adjusting end_date if necessary. - Updated Google Calendar and Composio connector indexing logic to handle duplicate content more effectively, logging warnings for skipped events. - Enhanced error handling during final commits to manage integrity errors gracefully. - Refactored date handling in various connector indexers for consistency and reliability.	2026-01-23 23:03:29 +05:30
Anish Sarkar	1343fabeee	feat: refactor composio connectors for modularity	2026-01-23 19:56:19 +05:30
Anish Sarkar	8d8f69545e	feat: improve Google Calendar and Gmail connectors with enhanced error handling - Added user-friendly re-authentication messages for expired or revoked tokens in both Google Calendar and Gmail connectors. - Updated error handling in indexing tasks to log specific authentication errors and provide clearer feedback to users. - Enhanced the connector UI to handle indexing failures more effectively, improving overall user experience.	2026-01-23 18:57:10 +05:30
Anish Sarkar	29382070aa	feat: enhance Composio connector functionality with Google Drive delta sync support - Added methods to retrieve the starting page token and list changes in Google Drive, enabling delta sync capabilities. - Updated Composio service to handle file download directory configuration. - Modified indexing tasks to support delta sync, improving efficiency by processing only changed files. - Adjusted date handling in connector tasks to allow optional start and end dates. - Improved error handling and logging throughout the Composio indexing process.	2026-01-23 18:37:09 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	ad475397c4	feat(chat): add regenerate endpoint for chat threads to support editing and reloading responses	2026-01-23 01:42:10 -08:00
Anish Sarkar	fae52345f8	Merge remote-tracking branch 'upstream/dev' into feat/composio	2026-01-23 14:35:17 +05:30
Manoj Aggarwal	49d51ba569	merge	2026-01-22 20:57:48 -08:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	8b81507739	refactor: remove unused COMPOSIO_CONNECTOR migration and linting	2026-01-22 16:43:08 -08:00
Anish Sarkar	8a0b8346a5	chore: ran linting	2026-01-23 05:28:18 +05:30
Anish Sarkar	42752bbeab	feat: improve Composio file processing and error handling - Enhanced the handling of file content from Composio, supporting both binary and text files with appropriate processing methods. - Introduced robust error logging and handling for file content extraction, ensuring better visibility into issues during processing. - Updated the indexing logic to accommodate new content processing methods, improving overall reliability and user feedback on errors. - Added temporary file handling for binary files to facilitate text extraction using the ETL service.	2026-01-23 05:28:03 +05:30
Anish Sarkar	7ec7ed5c3b	feat: enhance Composio Google Drive integration with folder and file selection - Added a new endpoint to list folders and files in a user's Composio Google Drive, supporting hierarchical structure. - Implemented UI components for selecting specific folders and files to index, improving user control over indexing options. - Introduced indexing options for maximum files per folder and inclusion of subfolders, allowing for customizable indexing behavior. - Enhanced error handling and logging for Composio Drive operations, ensuring better visibility into issues during file retrieval and indexing. - Updated the Composio configuration component to reflect new selection capabilities and indexing options.	2026-01-23 05:17:28 +05:30
Anish Sarkar	4cbf80d73a	feat: enhance Composio integration with pagination and improved error handling - Updated the list_gmail_messages method to support pagination with page tokens, allowing for more efficient message retrieval. - Modified the return structure to include next_page_token and result_size_estimate for better client-side handling. - Improved error handling and logging throughout the Gmail indexing process, ensuring better visibility into failures. - Implemented batch processing for Gmail messages, committing changes incrementally to prevent data loss. - Ensured consistent timestamp updates for connectors, even when no documents are indexed, to maintain accurate UI states. - Refactored the indexing logic to streamline message processing and enhance overall performance.	2026-01-23 04:44:37 +05:30
Manoj Aggarwal	4b60a2b805	nit	2026-01-22 13:01:10 -08:00
CREDO23	1a2fa23916	Merge upstream/dev with live collaboration features	2026-01-22 23:00:42 +02:00
Manoj Aggarwal	f0760c14e9	Merge dev into feature/obsidian - resolved conflicts keeping both Obsidian and Composio connectors	2026-01-22 11:43:18 -08:00
Anish Sarkar	be5715cfeb	feat: add Composio connector types and enhance integration - Introduced new enum values for Composio connectors: COMPOSIO_GOOGLE_DRIVE_CONNECTOR, COMPOSIO_GMAIL_CONNECTOR, and COMPOSIO_GOOGLE_CALENDAR_CONNECTOR. - Updated database migration to add these new enum values to the relevant types. - Refactored Composio integration logic to handle specific connector types, improving the management of connected accounts and indexing processes. - Enhanced frontend components to support the new Composio connector types, including updated UI elements and connector configuration handling. - Improved backend services to manage Composio connected accounts more effectively, including deletion and indexing tasks.	2026-01-22 22:33:28 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	12b825bff0	Merge branch 'dev' of https://github.com/MODSetter/SurfSense into dev	2026-01-21 22:58:48 -08:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	8c625d4237	feat: composio connector	2026-01-21 22:57:58 -08:00
Manoj Aggarwal	88a9a5bef2	format	2026-01-21 22:54:09 -08:00
Manoj Aggarwal	8a83424be5	Add support for obsidian to surfsense	2026-01-21 15:21:06 -08:00
Rohan Verma	cc658789e4	Merge pull request #722 from manojag115/feature/user-memory Add user memory feature to SurfSense	2026-01-21 14:54:06 -08:00
Manoj Aggarwal	48fb38bafc	Add ability to store and retreive user memory through mcp tool	2026-01-20 15:04:07 -08:00
Anish Sarkar	35888144eb	refactor: Update GitHub connector to use gitingest CLI - Refactored GitHubConnector to utilize gitingest CLI via subprocess, improving performance and avoiding async issues with Celery. - Updated ingestion method to handle repository digests more efficiently, including error handling for subprocess execution. - Adjusted GitHub indexer to call the new synchronous ingestion method. - Clarified documentation regarding the optional nature of the Personal Access Token for public repositories.	2026-01-20 23:24:33 +05:30
Anish Sarkar	49b8a46d10	feat: Integrate gitingest for GitHub repository ingestion - Added gitingest as a dependency to streamline the ingestion of GitHub repositories. - Refactored GitHubConnector to utilize gitingest for efficient repository digest generation, reducing API calls. - Updated GitHub indexer to process entire repository digests, enhancing performance and simplifying the indexing process. - Modified GitHub connect form to indicate that the Personal Access Token is optional for public repositories.	2026-01-20 21:52:32 +05:30
CREDO23	dc628198ce	Integrate session state into chat streaming	2026-01-20 16:40:38 +02:00
Anish Sarkar	e0be1b9133	chore: ran backend and frontend linting	2026-01-17 16:30:07 +05:30
Anish Sarkar	f538d59ca3	feat: enhance Google Drive file metadata handling - Updated Google Drive API calls to include md5Checksum in file metadata retrieval for improved content tracking. - Added logic to check for rename-only updates based on md5Checksum, optimizing document processing by preventing unnecessary ETL operations for unchanged content. - Enhanced existing document update logic to handle renaming and metadata updates more effectively, particularly for Google Drive files.	2026-01-17 16:24:53 +05:30
Anish Sarkar	49efc50767	feat: enhance document processing with content hash deduplication - Added support for content hash fallback in document migration to prevent duplicate entries from different sources. - Improved existing document update logic to handle renaming and metadata updates more effectively, particularly for Google Drive files. - Updated functions to check for existing documents with enhanced logging for better traceability of duplicate content detection.	2026-01-17 15:39:36 +05:30
Anish Sarkar	6550c378b2	feat: enhance Google Drive document handling and UI integration - Implemented support for both new file_id-based and legacy filename-based hash schemes in document processing. - Added functions to generate unique identifier hashes and find existing documents with migration support. - Improved existing document update logic to handle content changes and metadata updates, particularly for Google Drive files. - Enhanced UI components to display appropriate file icons based on file types in the Google Drive connector. - Updated document processing functions to accommodate the new connector structure and ensure seamless integration.	2026-01-17 14:57:31 +05:30
Anish Sarkar	7af3d1bc1a	feat: improve Google Drive connector handling and UI feedback - Added logic to refresh connector and notification attributes after indexing to ensure up-to-date information. - Enhanced periodic sync configuration to disable the option when no folders or files are selected for Google Drive, providing user feedback through a message. - Updated the connector edit view to reflect the new disabled state for periodic sync based on selected items. - Implemented validation in the connector dialog to prevent enabling periodic sync without selected items, improving user experience.	2026-01-17 12:59:18 +05:30
Anish Sarkar	a3112a24fe	feat: enhance Google Drive indexing with new options - Updated the Google Drive indexing functionality to include indexing options such as max files per folder, incremental sync, and inclusion of subfolders. - Modified the API to accept a new 'indexing_options' parameter in the request body. - Enhanced the UI to allow users to configure these options when selecting folders and files for indexing. - Updated related components and tasks to support the new indexing options, ensuring a more flexible and efficient indexing process.	2026-01-17 12:33:57 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	8aad15d392	Reapply "Merge pull request #686 from AnishSarkar22/feat/replace-logs" This reverts commit `3418c0e026`.	2026-01-16 11:32:06 -08:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	3418c0e026	Revert "Merge pull request #686 from AnishSarkar22/feat/replace-logs" This reverts commit `5963a1125e`, reversing changes made to `0d2a2f8ea1`.	2026-01-16 00:49:33 -08:00
Anish Sarkar	28aa4814bd	refactor: improve chat UI and greeting logic - Simplified chat document processing display by removing the book emoji for a cleaner look. - Enhanced the greeting function to prioritize user display names over email for a more personalized experience. - Adjusted the ChatShareButton component by removing unused imports and unnecessary elements for better clarity and performance. - Updated the title in the Electric SQL documentation for conciseness.	2026-01-15 18:29:30 +05:30
Anish Sarkar	ab63b23f0a	Merge remote-tracking branch 'upstream/dev' into feat/replace-logs	2026-01-15 15:52:47 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	7ae68455b3	chore: linting	2026-01-15 00:05:53 -08:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	bab89274e0	feat: implement LlamaCloud parsing with retry logic for transient errors	2026-01-14 23:53:17 -08:00
Anish Sarkar	2e0f742000	Merge remote-tracking branch 'upstream/dev' into feat/replace-logs	2026-01-15 03:07:20 +05:30
Anish Sarkar	7023223213	chore: ran backend linting	2026-01-14 13:28:50 +05:30
Anish Sarkar	1ea0475f73	refactor: Improve indexing notification handling and return values - Enhanced error handling in the indexing process to differentiate between actual failures and cases where no new documents are processed. - Updated notification messages to reflect the status accurately, including a message for when no new items are synced. - Standardized return values across various indexer tasks to return `None` on success, simplifying logging and error management.	2026-01-14 13:16:11 +05:30
Anish Sarkar	9d0f5b4249	fix: Ensure notification updates are reliable during error handling - Added session refresh for notifications to prevent stale data after rollbacks in multiple document processing tasks. - Wrapped notification update logic in try-except blocks to handle potential failures gracefully and log errors without crashing the process. - Improved error handling for notification updates in various document processing functions, enhancing overall robustness.	2026-01-14 04:01:20 +05:30
Manoj Aggarwal	305a981d14	feat: add MCP connector backend support	2026-01-13 13:46:01 -08:00
Anish Sarkar	5bd6bd3d67	chore: ran both frontend and backend linting	2026-01-14 02:05:40 +05:30
Anish Sarkar	99bd2df463	Merge remote-tracking branch 'upstream/dev' into feat/replace-logs	2026-01-14 02:04:54 +05:30
Anish Sarkar	48b67d9bc1	fix: remove the document processing UI which used polling	2026-01-13 19:31:31 +05:30
Anish Sarkar	12671ede0e	feat: Enhance document processing notifications and refactor related services - Introduced a new DocumentProcessingNotificationHandler to manage notifications for document processing stages. - Updated existing notification methods to include detailed progress updates for various stages (queued, parsing, chunking, embedding, storing, completed, failed). - Refactored NotificationService to support the new document processing notification type and metadata schema. - Updated multiple document processing tasks to create and manage notifications throughout the processing lifecycle. - Adjusted UI components to reflect changes in notification types and improve user experience during document uploads and processing.	2026-01-13 19:09:12 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	25b9118306	feat: implement search space deletion and fixed rback issues with shared chats	2026-01-13 01:45:58 -08:00

... 2 3 4 5 6 ...

438 commits