SurfSense

mirror of https://github.com/MODSetter/SurfSense.git synced 2026-04-26 01:06:23 +02:00

Author	SHA1	Message	Date
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	656e061f84	feat: add processing mode support for document uploads and ETL pipeline, improded error handling ux Some checks are pending Build and Push Docker Images / tag_release (push) Waiting to run Details Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Details Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Details Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Details Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Details Build and Push Docker Images / create_manifest (backend, surfsense-backend) (push) Blocked by required conditions Details Build and Push Docker Images / create_manifest (web, surfsense-web) (push) Blocked by required conditions Details - Introduced a `ProcessingMode` enum to differentiate between basic and premium processing modes. - Updated `EtlRequest` to include a `processing_mode` field, defaulting to basic. - Enhanced ETL pipeline services to utilize the selected processing mode for Azure Document Intelligence and LlamaCloud parsing. - Modified various routes and services to handle processing mode, affecting document upload and indexing tasks. - Improved error handling and logging to include processing mode details. - Added tests to validate processing mode functionality and its impact on ETL operations.	2026-04-14 21:26:00 -07:00
CREDO23	a95bf58c8f	Make Vision LLM opt-in for uploads and connectors	2026-04-10 16:45:51 +02:00
Anish Sarkar	f03bf05aaa	refactor: enhance Google Drive indexer to support file extension filtering, improving file handling and error reporting	2026-04-06 22:34:49 +05:30
Anish Sarkar	a2b3541046	chore: ran linting	2026-04-04 03:11:56 +05:30
Anish Sarkar	ce40da80ea	feat: implement page limit estimation and enforcement in file based connector indexers - Added a static method `estimate_pages_from_metadata` to `PageLimitService` for estimating page counts based on file metadata. - Integrated page limit checks in Google Drive, Dropbox, and OneDrive indexers to prevent exceeding user quotas during file indexing. - Updated relevant indexing methods to utilize the new page estimation logic and enforce limits accordingly. - Enhanced tests for page limit functionality, ensuring accurate estimation and enforcement across different file types.	2026-04-04 02:51:28 +05:30
Anish Sarkar	851856a54b	fix: update document cleanup logic and mock Celery task in tests	2026-03-11 12:27:32 +05:30
Rohan Verma	547077e5b9	Merge pull request #865 from CREDO23/sur-182-fix-ux-experience-for-composio-google-drive-connector [Perf] Batch embedding, non-blocking search, chunks index & Google Drive UX fix	2026-03-10 12:52:16 -07:00
CREDO23	e951fbb991	fix: update stale embed_text mock in document_upload tests	2026-03-09 21:47:27 +02:00
Anish Sarkar	ca3710a239	fix: remove slowapi limiter for testing	2026-03-08 02:41:05 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	a4dc84d1ab	feat: add should_summarize parameter to task dispatchers - Introduced should_summarize parameter in TaskDispatcher and CeleryTaskDispatcher to control summary generation. - Updated InlineTaskDispatcher to support the new parameter for document processing.	2026-02-26 19:12:37 -08:00
Anish Sarkar	223c2de0d2	refactor: update database connection handling in test configurations	2026-02-27 00:05:21 +05:30
Anish Sarkar	3393e435f9	feat: implement task dispatcher for document processing - Introduced a TaskDispatcher abstraction to decouple the upload endpoint from Celery, allowing for easier testing with synchronous implementations. - Updated the create_documents_file_upload function to utilize the new dispatcher for task management. - Removed direct Celery task imports from the upload function, enhancing modularity. - Added integration tests for document upload, including page limit enforcement and file size restrictions.	2026-02-26 23:55:47 +05:30

12 commits