- Removed outdated dependencies: langchain-community, langchain-unstructured, langgraph, langchain-litellm, and litellm.
- Added updated versions of langchain-community (0.3.31), langchain-unstructured (0.1.6), langgraph (1.0.1), langchain-litellm (0.3.5), and litellm (1.77.5).
- Updated langchain (0.3.27 to 0.3.80) and langchain-community (0.3.25 to 0.3.31) in uv.lock.
- Updated langgraph (0.4.8 to 1.0.1) and langchain-litellm (0.2.3 to 0.3.5) in uv.lock.
- Updated xxhash version from 3.5.0 to 3.6.0 in uv.lock.
- Added DOCLING as third ETL_SERVICE option (alongside UNSTRUCTURED/LLAMACLOUD)
- Implemented add_received_file_document_using_docling function
- Added Docling processing logic in documents_routes.py
- Enhanced chunking with configurable overlap support
- Added comprehensive document processing service
- Supports both CPU and GPU processing with user selection
Addresses #161 - Add Docling Support as an ETL_SERVICE
Follows same pattern as LlamaCloud integration (PR #123)
- Added support for processing YouTube videos, including transcript extraction and document creation.
- Implemented a new background task for adding YouTube video documents.
- Enhanced the connector service to search for YouTube videos and return relevant results.
- Updated frontend components to include YouTube video options in the dashboard and connector sources.
- Added necessary dependencies for YouTube transcript API.