trustgraph/trustgraph-unstructured/trustgraph/decoding/universal
cybermaggedon 6b1dd16f9f
fix: large document handling and Cassandra query pagination (#969)
- Paginate heavy Cassandra reads (triples, graph/document embeddings)
  using synchronous session.execute() in run_in_executor with fetch_size
  paging, preventing materialization hang on large result sets
- Fix document stream endpoint to use workspace-scoped librarian queues
- Add decoder error handling for PDF/OCR/unstructured processors
- Add WebSocket mux guards for missing auth fields
- Add null check in librarian document streaming
- Rewrite get_document_content CLI to stream via librarian
- Add Poppler dependency to unstructured container
2026-06-01 22:39:30 +01:00
..
__init__.py Add universal document decoder with multi-format support (#705) 2026-03-23 12:56:35 +00:00
__main__.py Add universal document decoder with multi-format support (#705) 2026-03-23 12:56:35 +00:00
processor.py fix: large document handling and Cassandra query pagination (#969) 2026-06-01 22:39:30 +01:00
strategies.py Add universal document decoder with multi-format support (#705) 2026-03-23 12:56:35 +00:00