SurfSense

mirror of https://github.com/MODSetter/SurfSense.git synced 2026-05-15 18:25:18 +02:00

Author	SHA1	Message	Date
CREDO23	506a9297a9	fix(connectors): track delta sync tokens per folder for Google Drive - Store tokens in folder_tokens dict instead of single global token - Each folder now tracks its own sync state independently - Fixes issue where indexing folder 2 incorrectly used delta sync after folder 1 was indexed - First-time indexing now correctly uses full scan for each new folder	2025-12-28 18:32:59 +02:00
CREDO23	a5935bc677	feat(connectors): add connector parameter to file processor for source tracking - Add optional 'connector' parameter with 'type' and 'metadata' fields - Create helper function _update_document_from_connector - Use document_metadata column (not metadata) for JSON field - Merge metadata with existing using dict spread operator - Google Drive documents now marked as GOOGLE_DRIVE_CONNECTOR - Backward compatible - no changes to existing logic - Simple and clean implementation	2025-12-28 18:01:39 +02:00
CREDO23	8da58be9e0	fix(connectors): refresh document from DB before updating type - Query document from database to ensure it's attached to session - Prevents detached instance errors after process_file_in_background commits - Properly updates document_type and metadata with session management	2025-12-28 17:21:44 +02:00
CREDO23	b2b891e4d7	fix(connectors): properly commit Google Drive document type changes - Return file metadata from content_extractor for indexer to use - Update document type and metadata in indexer after processing - Explicitly commit changes to database - Ensures documents are properly marked as GOOGLE_DRIVE_CONNECTOR type	2025-12-28 17:15:29 +02:00
CREDO23	9f1fd20944	feat(connectors): mark Google Drive documents with GOOGLE_DRIVE_CONNECTOR type - Change document_type from file type (PDF, DOCX) to GOOGLE_DRIVE_CONNECTOR - Store original file type in metadata for reference - Add Google Drive specific metadata (file_id, mime_type, source) - Include export format info for Google Workspace files - Enables proper source tracking and bulk management	2025-12-28 16:55:14 +02:00
CREDO23	c9815fd6fb	feat(celery): update Google Drive task for multiple folders - Accept comma-separated folder_ids and folder_names parameters - Pass through to indexing function for batch processing	2025-12-28 16:49:47 +02:00
CREDO23	634eeb887e	feat(routes): support multiple Google Drive folder indexing - Accept comma-separated folder_ids and folder_names - Loop through each folder and index sequentially - Collect total indexed count and errors - Update timestamp only on full success	2025-12-28 16:49:20 +02:00
CREDO23	1c83327fc7	feat(celery): add Google Drive indexing Celery task - Create async task for Google Drive folder indexing - Accept folder_id and folder_name parameters - Call indexing wrapper to avoid circular imports	2025-12-28 15:56:11 +02:00
CREDO23	358abdf02f	feat(routes): add Google Drive indexing support with folder selection - Accept folder_id and folder_name as indexing parameters - Hide date range for Google Drive connectors - Create wrapper function to avoid circular imports - Trigger Google Drive indexing Celery task	2025-12-28 15:55:57 +02:00
CREDO23	7b8900d51f	feat(indexer): export Google Drive indexer function	2025-12-28 15:55:46 +02:00
CREDO23	501d08f2f4	feat(routes): register Google Drive OAuth router	2025-12-28 15:55:38 +02:00
CREDO23	1696c7056a	feat(indexer): add Google Drive folder indexing with delta sync - Full folder scan on first index - Delta sync using change tracking for subsequent indexes - Process files in parallel batches - Handle file additions, modifications, and deletions - Store change tracking token for efficient re-indexing	2025-12-28 15:55:25 +02:00
CREDO23	bf02005d82	feat(routes): add Google Drive OAuth and folder listing endpoints - OAuth initialization and callback handling - Folder and file browsing with parent_id support - Validate credentials and handle token refresh - Return folder contents with metadata for UI tree view	2025-12-28 15:55:13 +02:00
CREDO23	3e67d5f31e	feat(connectors): add Google Drive delta sync with change tracking - Get start page token for change tracking baseline - Fetch incremental changes using Google Drive Changes API - Categorize changes into added, modified, and removed files - Enable efficient re-indexing of only changed content	2025-12-28 15:55:06 +02:00
CREDO23	84bde67979	feat(connectors): add Google Drive folder browsing and file listing - List folder contents with full pagination support - Query root folder or specific parent folder - Return both folders and files with metadata (size, icons, links) - Filter out shortcuts and trashed items	2025-12-28 15:54:58 +02:00
CREDO23	40304c6795	feat(connectors): add Google Drive content extraction using existing ETL - Download files from Google Drive to temporary location - Export Google Workspace files as PDF - Delegate content extraction to existing process_file_in_background - Reuse Surfsense's ETL services (Unstructured, LlamaCloud, Docling)	2025-12-28 15:54:50 +02:00
CREDO23	701c3409b3	feat(connectors): add Google Drive file type detection and mapping - Detect Google Workspace files (Docs, Sheets, Slides) - Map to PDF export format to preserve rich content (images, formatting) - Identify files to skip (shortcuts, unsupported types)	2025-12-28 15:54:42 +02:00
CREDO23	74386affdc	feat(connectors): add Google Drive API client wrapper - Build and manage Google Drive service with credentials - List files with query support and pagination - Download binary files and export Google Workspace files as PDF - Handle HTTP errors gracefully	2025-12-28 15:54:32 +02:00
CREDO23	2c8717b14b	feat(connectors): add Google Drive credentials module for OAuth management - Handle Google OAuth credential initialization and validation - Automatic token refresh with database persistence - Reuse existing tokens when valid	2025-12-28 15:54:26 +02:00
CREDO23	2897985127	feat(config): add GOOGLE_DRIVE_REDIRECT_URI environment variable	2025-12-28 15:53:51 +02:00
CREDO23	5dd8838638	feat(db): add idempotent Alembic migration for GOOGLE_DRIVE_CONNECTOR enums	2025-12-28 15:53:44 +02:00
CREDO23	f54079643f	feat(db): add GOOGLE_DRIVE_CONNECTOR to DocumentType and SearchSourceConnectorType enums	2025-12-28 15:53:35 +02:00
Anish Sarkar	2fdf567b71	refactor: replace DocumentsDataTable with DocumentMentionPicker for improved document selection - Introduced DocumentMentionPicker component to enhance document selection experience in the chat interface. - Updated InlineMentionEditor and Composer components to utilize the new DocumentMentionPicker. - Removed the deprecated DocumentsDataTable component to streamline the codebase and improve maintainability. - Enhanced type safety and validation in document handling logic.	2025-12-26 00:41:14 +05:30
Anish Sarkar	bea18960a4	refactor: enhance link preview functionality with Chromium fallback - Added a fallback mechanism using headless Chromium to fetch page content when standard HTTP requests fail. - Introduced utility functions for unescaping HTML entities and converting relative URLs to absolute. - Updated HTTP request headers to mimic a browser for better compatibility with web servers. - Improved error handling and logging for better debugging and user feedback. - Made various properties in Zod schemas nullable for better type safety and flexibility in handling optional data.	2025-12-26 00:07:45 +05:30
Anish Sarkar	9e7f8d7fe3	feat: enhance chat functionality with improved attachment handling and user experience - Updated system prompt to clarify usage of the display_image tool, emphasizing URL requirements and restrictions on user-uploaded images. - Enhanced the streaming chat process to provide more context about user attachments and documents during analysis. - Implemented state resets when switching between chats to prevent stale data and race conditions. - Added new components for displaying image previews and document attachments in the chat interface. - Improved attachment processing to support image data URLs for persistent display after uploads.	2025-12-25 17:52:48 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	b4b7059035	Revert "Merge pull request #622 from CREDO23/documents-mentions" This reverts commit `fb719faa0d`, reversing changes made to `efd20ea208`.	2025-12-24 18:00:03 -08:00
Rohan Verma	fb719faa0d	Merge pull request #622 from CREDO23/documents-mentions [Fix] Documents mentions \| Use the same structure of document returned from retriever	2025-12-24 13:46:31 -08:00
CREDO23	ef9e9b65df	fix: mentioned documents xml structure	2025-12-24 23:35:20 +02:00
CREDO23	3660b91e63	refact: follow the structure of document returned from retriever	2025-12-24 20:12:40 +02:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	dfed7187bc	feat: updated version to 0.0.9	2025-12-23 22:12:53 -08:00
CREDO23	deec8c5c6c	fix: formatting	2025-12-24 07:06:35 +02:00
Thierry CH.	c4400a0ec2	Merge branch 'dev' into documents-mentions	2025-12-24 06:31:49 +02:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	0b86756082	chore: ruff format	2025-12-23 02:56:13 -08:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	acf0396aa5	feat: fixed migrations - Migrated data from the old 'chats' table to 'new_chat_threads' and 'new_chat_messages'. - Dropped the 'chats' table and removed the 'chattype' enum as part of the migration process. - Updated the migration script to truncate thread titles to 500 characters to comply with database constraints. - Adjusted the downgrade function to reflect changes in the migration process.	2025-12-23 02:55:37 -08:00
Anish Sarkar	6f330e7b8d	Merge remote-tracking branch 'upstream/dev' into pr-611	2025-12-23 15:45:28 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	4a0c3e368a	feat: migrated to surfsense deep agent	2025-12-23 01:16:25 -08:00
Anish Sarkar	ceb01dc544	feat: enhance new chat functionality with document mentions support - Updated the new chat routes to include handling for mentioned document IDs, allowing users to reference specific documents in their chat. - Modified the NewChatRequest schema to accommodate optional document IDs. - Implemented document mention formatting in the chat streaming service for improved context. - Enhanced the frontend to manage document mentions, including a new atom for state management and UI updates for document selection. - Refactored the DocumentsDataTable component for better integration with the new mention functionality.	2025-12-23 14:24:36 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	b14283e300	feat: refactor new chat agent to support configurable tools and remove deprecated components - Enhanced the new chat agent module to allow for configurable tools, enabling users to customize their experience with various functionalities. - Removed outdated tools including display image, knowledge base search, link preview, podcast generation, and web scraping, streamlining the codebase. - Updated the system prompt and agent factory to reflect these changes, ensuring a more cohesive and efficient architecture.	2025-12-22 20:17:08 -08:00
Anish Sarkar	7ca490c740	feat: enhance chain-of-thought display with smart expand/collapse behavior and state management for improved user interaction	2025-12-23 02:21:41 +05:30
Anish Sarkar	24dd52ed99	feat: add web scraping tool to chat agent for extracting and summarizing webpage content	2025-12-23 01:49:29 +05:30
Anish Sarkar	da7cb81252	feat: introduce display image tool for enhanced image rendering in chat with metadata support	2025-12-23 01:11:56 +05:30
Anish Sarkar	4b69fdf214	feat: add link preview tool for enhanced URL metadata display in chat	2025-12-23 00:58:27 +05:30
Anish Sarkar	8a99752f2f	feat: enhance chat functionality with chain-of-thought display and thinking steps management	2025-12-22 22:54:22 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	1cbb1b5d66	refactor: remove chat-related fields and legacy podcast generation function	2025-12-21 23:31:11 -08:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	c2dcb2045d	feat: added attachment support	2025-12-21 22:26:33 -08:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	bb971460fc	feat: migrated old chat to new chat	2025-12-21 19:33:52 -08:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	b5e20e7515	feat: old chat to new-chat with persistance	2025-12-21 16:32:55 -08:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	0c3574d049	feat: implement new chat feature with message persistence and UI integration	2025-12-21 16:16:50 -08:00
Anish Sarkar	35463eeab4	Merge remote-tracking branch 'upstream/dev' into feature/podcast-agent	2025-12-21 20:39:21 +05:30
Anish Sarkar	783ee9c154	feat: enhance podcast generation with duplicate request prevention and improved UI feedback	2025-12-21 20:07:04 +05:30

1 2 3 4 5 ...

494 commits