SurfSense/surfsense_backend/app/etl_pipeline/parsers
DESKTOP-RTLN3BA\$punk 656e061f84
Some checks are pending
Build and Push Docker Images / tag_release (push) Waiting to run
Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions
Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions
Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions
Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions
Build and Push Docker Images / create_manifest (backend, surfsense-backend) (push) Blocked by required conditions
Build and Push Docker Images / create_manifest (web, surfsense-web) (push) Blocked by required conditions
feat: add processing mode support for document uploads and ETL pipeline, improded error handling ux
- Introduced a `ProcessingMode` enum to differentiate between basic and premium processing modes.
- Updated `EtlRequest` to include a `processing_mode` field, defaulting to basic.
- Enhanced ETL pipeline services to utilize the selected processing mode for Azure Document Intelligence and LlamaCloud parsing.
- Modified various routes and services to handle processing mode, affecting document upload and indexing tasks.
- Improved error handling and logging to include processing mode details.
- Added tests to validate processing mode functionality and its impact on ETL operations.
2026-04-14 21:26:00 -07:00
..
__init__.py feat: add audio transcription functionality to ETL pipeline 2026-04-05 17:26:03 +05:30
audio.py feat: add audio transcription functionality to ETL pipeline 2026-04-05 17:26:03 +05:30
azure_doc_intelligence.py feat: add processing mode support for document uploads and ETL pipeline, improded error handling ux 2026-04-14 21:26:00 -07:00
direct_convert.py feat: add direct_convert module to ETL pipeline for file conversion 2026-04-05 17:26:29 +05:30
docling.py feat: add parsers for Docling, LlamaCloud, and Unstructured to ETL pipeline 2026-04-05 17:27:24 +05:30
llamacloud.py feat: add processing mode support for document uploads and ETL pipeline, improded error handling ux 2026-04-14 21:26:00 -07:00
plaintext.py feat: add plaintext parser to ETL pipeline for reading text files 2026-04-05 17:26:42 +05:30
unstructured.py feat: add parsers for Docling, LlamaCloud, and Unstructured to ETL pipeline 2026-04-05 17:27:24 +05:30
vision_llm.py Harden vision LLM fallback, folder upload validation, and export memory 2026-04-09 16:14:53 +02:00