Commit graph

1531 commits

Author SHA1 Message Date
Anish Sarkar
dc7047f64d refactor: implement file type classification for supported extensions across Dropbox, Google Drive, and OneDrive connectors, enhancing file handling and error management 2026-04-06 22:03:47 +05:30
Anish Sarkar
47f4be08d9 refactor: remove allowed_formats from DocumentConverter initialization in DoclingService to allow acceptance of all supported formats 2026-04-06 19:31:42 +05:30
Anish Sarkar
caca491774 test: add unit tests for Dropbox integration, covering delta sync methods, file type filtering, and re-authentication behavior 2026-04-06 18:36:48 +05:30
Anish Sarkar
b5a15b7681 feat: implement cursor-based delta sync for Dropbox integration, enhancing file indexing efficiency and preserving folder cursors during re-authentication 2026-04-06 18:36:29 +05:30
Anish Sarkar
f8913adaa3 test: add unit tests for content extraction from cloud connectors and ETL pipeline functionality 2026-04-05 17:46:04 +05:30
Anish Sarkar
87af012a60 refactor: streamline file processing by integrating ETL pipeline for all file types and removing redundant functions 2026-04-05 17:45:18 +05:30
Anish Sarkar
8224360afa refactor: unify file parsing logic across Dropbox, Google Drive, and OneDrive using the ETL pipeline 2026-04-05 17:30:29 +05:30
Anish Sarkar
1248363ca9 refactor: consolidate document processing logic and remove unused files and ETL strategies 2026-04-05 17:29:24 +05:30
Anish Sarkar
f40de6b695 feat: add parsers for Docling, LlamaCloud, and Unstructured to ETL pipeline 2026-04-05 17:27:24 +05:30
Anish Sarkar
2824410be2 feat: add plaintext parser to ETL pipeline for reading text files 2026-04-05 17:26:42 +05:30
Anish Sarkar
35582c9389 feat: add direct_convert module to ETL pipeline for file conversion 2026-04-05 17:26:29 +05:30
Anish Sarkar
02fc6f1d16 feat: add audio transcription functionality to ETL pipeline 2026-04-05 17:26:03 +05:30
Anish Sarkar
5d22349dc1 feat: implement ETL pipeline with file classification and extraction services 2026-04-05 17:25:25 +05:30
Anish Sarkar
9c0af6569d feat: implement page limit checks in local folder indexing to manage user page usage 2026-04-03 19:13:25 +05:30
Anish Sarkar
edda5b98cb chore: ran linting 2026-04-03 17:38:29 +05:30
Anish Sarkar
b759bb36a9 feat: add direct conversion support for CSV, TSV, and HTML files in local folder indexing 2026-04-03 17:36:48 +05:30
Anish Sarkar
746c730b2e chore: ran linting 2026-04-03 13:14:40 +05:30
Anish Sarkar
3833084dad feat: changed the revision number of folder alembic migration 2026-04-03 11:45:53 +05:30
Anish Sarkar
62b44889d1 Merge remote-tracking branch 'upstream/dev' into feat/local-folder-sync 2026-04-03 11:42:43 +05:30
Anish Sarkar
525fae5abf feat: trigger document reindexing after restoring a document version to ensure content is up-to-date 2026-04-03 11:19:54 +05:30
Anish Sarkar
4ca0787ab7 feat: add version history functionality to document management, enabling users to view and manage document versions through UI updates and backend integration 2026-04-03 10:42:21 +05:30
Anish Sarkar
2b9d79d44c feat: add integration tests for batch processing of local folder indexing, covering multiple file scenarios and error handling 2026-04-03 10:04:14 +05:30
Anish Sarkar
1fa8e1cc83 feat: refactor folder indexing to support batch processing of multiple files, enhancing performance and error handling 2026-04-03 10:02:36 +05:30
Anish Sarkar
e2ba509314 feat: enhance error handling in local folder indexing by adding rollback and refresh on IntegrityError 2026-04-03 09:29:59 +05:30
DESKTOP-RTLN3BA\$punk
6b06d3abb1 chore: optimize zero publication column migration process
- Updated migration instructions to emphasize the importance of stopping zero-cache before and after running the migration.
- Added a function to terminate blocked PIDs that could interfere with the migration.
- Set a lock timeout to prevent deadlocks during the migration process.
2026-04-02 20:38:10 -07:00
DESKTOP-RTLN3BA\$punk
62e698d8aa refactor: streamline document upload limits and enhance handling of mentioned documents
- Updated maximum file size limit to 500 MB per file.
- Removed restrictions on the number of files per upload and total upload size.
- Enhanced handling of user-mentioning documents in the knowledge base search middleware.
- Improved document reading and processing logic to accommodate new features and optimizations.
2026-04-02 19:39:10 -07:00
Anish Sarkar
fe7fcaae5d feat: update folder deletion process to queue document deletions first and handle folder cleanup in Celery task 2026-04-03 04:16:19 +05:30
Anish Sarkar
44e39792da feat: assign folder_id to documents before indexing to ensure correct folder visibility during processing 2026-04-03 04:14:28 +05:30
Anish Sarkar
bd21c2842d feat: enhance document upload and folder synchronization UI with improved processing state indicators and responsive design adjustments 2026-04-03 04:14:09 +05:30
Anish Sarkar
f0a7c7134a feat: move endpoint above to retrieve watched folders based on search space ID 2026-04-03 00:09:17 +05:30
Anish Sarkar
25358fddcf feat: implement local folder synchronization and versioning with new metadata handling and document_versions table 2026-04-02 23:46:21 +05:30
Anish Sarkar
53df393cf7 refactor: streamline local folder indexing logic by removing unused imports, enhancing content hashing, and improving document creation process 2026-04-02 23:28:23 +05:30
Anish Sarkar
c27d24a117 feat: enhance folder indexing by adding root folder ID support and implement folder creation and cleanup logic 2026-04-02 22:41:45 +05:30
Anish Sarkar
caf2525ab5 fix: update folder ID collection logic to include deleted directories and adjust test cases for document titles 2026-04-02 22:29:07 +05:30
Anish Sarkar
22ee5c99cc refactor: remove Local Folder connector and related tasks, implement new folder indexing endpoints 2026-04-02 22:21:31 +05:30
Anish Sarkar
40ade4889e feat: add LOCAL_FOLDER_FILE document type and update document_versions table management 2026-04-02 21:01:31 +05:30
Anish Sarkar
8e58094a86 refactor: update permission checks in document and connector routes to use specific permission values 2026-04-02 12:13:14 +05:30
Anish Sarkar
775dea7894 feat: add integration and unit tests for local folder indexing and document versioning 2026-04-02 11:12:16 +05:30
Anish Sarkar
96a58d0d30 feat: implement local folder indexing and document versioning capabilities 2026-04-02 11:11:57 +05:30
Anish Sarkar
3b92e99d28 feat: add local folder connector and document versioning functionality 2026-04-02 10:35:32 +05:30
Anish Sarkar
000c2d9b5b style: simplify LLM model terminology in UI 2026-04-02 10:11:35 +05:30
Anish Sarkar
d7dd6db1b9 fix: improve document loading error handling and UI feedback for processing state 2026-04-01 20:31:45 +05:30
CREDO23
2a809d0418 fix: make migration 109 idempotent 2026-04-01 11:12:21 +02:00
CREDO23
cd1bb79db8 feat: add migration to create zero_publication for zero-cache 2026-04-01 09:56:33 +02:00
DESKTOP-RTLN3BA\$punk
529af2be39 feat: bumped version to 0.0.14 2026-03-31 21:45:18 -07:00
DESKTOP-RTLN3BA\$punk
ad0e77c3d6 feat: enhance knowledge base search with date filtering 2026-03-31 20:13:46 -07:00
DESKTOP-RTLN3BA\$punk
a9fd45844d feat: integrate Stripe for page purchases and reconciliation tasks 2026-03-31 18:39:45 -07:00
DESKTOP-RTLN3BA\$punk
17642493eb chore: linting 2026-03-31 14:45:46 -07:00
CREDO23
0f0bb5509c Merge upstream/dev, renumber migrations after dropbox 112 2026-03-31 22:43:14 +02:00
CREDO23
1aeb5ba645 Merge remote-tracking branch 'upstream/dev' into feature/prompt-library 2026-03-31 22:41:53 +02:00