feat: add document pipeline — PDF decoder, Ollama LLM, storage services

Add end-to-end document processing pipeline: - PDF decoder service (pdfjs-dist) extracts text per page from librarian docs - Ollama native LLM service for local model inference - FalkorDB triples store FlowProcessor consumer - Qdrant graph embeddings store FlowProcessor consumer - Fix spec name collisions in chunker/extractor (input→chunk-input, etc.) - Gateway /load endpoint to trigger document processing - Align flow manager blueprint and seed config with full pipeline topics - Add runner scripts and test coverage for document load Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-07-03 23:11:00 +02:00 · 2026-04-06 23:47:43 -05:00 · 2026-04-06 23:47:43 -05:00 · 8f7008822a
commit 8f7008822a
parent 8f9de7604e
20 changed files with 894 additions and 37 deletions
--- a/ts/packages/flow/src/gateway/dispatch/manager.ts
+++ b/ts/packages/flow/src/gateway/dispatch/manager.ts
@ -235,6 +235,18 @@ export class DispatcherManager {
    });
  }

+  // ---------- Fire-and-forget publish ----------
+
+  /**
+   * Publish a single message to an arbitrary topic (no request/response).
+   * Used for injecting documents into the processing pipeline.
+   */
+  async publishToTopic(topic: string, message: unknown): Promise<void> {
+    const producer = await this.pubsub.createProducer<unknown>({ topic });
+    await producer.send(message);
+    await producer.close();
+  }
+
  // ---------- Static introspection ----------

  static get flowServiceNames(): readonly string[] {