mirror of
https://github.com/MODSetter/SurfSense.git
synced 2026-06-26 21:39:43 +02:00
When the Unstructured API returns a Table element without text_as_html in its metadata (e.g. local install or free-tier API), the lambda was raising KeyError: 'text_as_html', crashing the entire document indexing pipeline for any file containing tables. Guard the key access with .get() and fall back to the plain extracted text content (x) so the pipeline continues and the table content is still indexed, just without HTML formatting. |
||
|---|---|---|
| .. | ||
| proxy | ||
| async_retry.py | ||
| blocknote_to_markdown.py | ||
| chat_comments.py | ||
| connector_naming.py | ||
| content_utils.py | ||
| document_converters.py | ||
| document_versioning.py | ||
| file_extensions.py | ||
| google_credentials.py | ||
| indexing_locks.py | ||
| notion_utils.py | ||
| oauth_security.py | ||
| perf.py | ||
| periodic_scheduler.py | ||
| proxy_config.py | ||
| rbac.py | ||
| refresh_tokens.py | ||
| signed_image_urls.py | ||
| user_message_multimodal.py | ||
| validators.py | ||
| webcrawler_utils.py | ||