trustgraph/trustgraph-flow/trustgraph
cybermaggedon 7f5f2f955d
Fix Metadata/EntityEmbeddings schema migration tail and add regression tests (#776)
The Metadata dataclass dropped its `metadata: list[Triple]` field
and EntityEmbeddings/ChunkEmbeddings settled on a singular
`vector: list[float]` field, but several call sites kept passing
`Metadata(metadata=...)` and `EntityEmbeddings(vectors=...)`. The
bugs were latent until a websocket client first hit
`/api/v1/flow/default/import/entity-contexts`, at which point the
dispatcher TypeError'd on construction.

Production fixes (5 call sites on the same migration tail):

  * trustgraph-flow gateway dispatchers entity_contexts_import.py
    and graph_embeddings_import.py — drop the stale
    Metadata(metadata=...)  kwarg; switch graph_embeddings_import
    to the singular `vector` wire key.
  * trustgraph-base messaging translators knowledge.py and
    document_loading.py — fix decode side to read the singular
    `"vector"` key, matching what their own encode sides have
    always written.
  * trustgraph-flow tables/knowledge.py — fix Cassandra row
    deserialiser to construct EntityEmbeddings(vector=...)
    instead of vectors=.
  * trustgraph-flow gateway core_import/core_export — switch the
    kg-core msgpack wire format to the singular `"v"`/`"vector"`
    key and drop the dead `m["m"]` envelope field that referenced
    the removed Metadata.metadata triples list (it was a
    guaranteed KeyError on the export side).

Defense-in-depth regression coverage (32 new tests across 7 files):

  * tests/contract/test_schema_field_contracts.py — pin the field
    set of Metadata, EntityEmbeddings, ChunkEmbeddings,
    EntityContext so any future schema rename fails CI loudly
    with a clear diff.
  * tests/unit/test_translators/test_knowledge_translator_roundtrip.py
    and test_document_embeddings_translator_roundtrip.py -
    encode→decode round-trip the affected translators end to end,
    locking in the singular `"vector"` wire key.
  * tests/unit/test_gateway/test_entity_contexts_import_dispatcher.py
    and test_graph_embeddings_import_dispatcher.py — exercise the
    websocket dispatchers' receive() path with realistic
    payloads, the direct regression test for the original
    production crash.
  * tests/unit/test_gateway/test_core_import_export_roundtrip.py
    — pack/unpack the kg-core msgpack format through the real
    dispatcher classes (with KnowledgeRequestor mocked),
    including a full export→import round-trip.
  * tests/unit/test_tables/test_knowledge_table_store.py —
    exercise the Cassandra row → schema conversion via __new__ to
    bypass the live cluster connection.

Also fixes an unrelated leaked-coroutine RuntimeWarning in
test_gateway/test_service.py::test_run_method_calls_web_run_app: the
mocked aiohttp.web.run_app now closes the coroutine that Api.run() hands
it, mirroring what the real run_app would do, instead of leaving it for
the GC to complain about.
2026-04-10 20:42:59 +01:00
..
agent Forward missing explain_triples through RAG clients and agent tool callback (#768) 2026-04-08 11:41:17 +01:00
chunking RabbitMQ pub/sub backend with topic exchange architecture (#752) 2026-04-02 12:47:16 +01:00
config/service Config push notify pattern: replace stateful pub/sub with signal+ fetch (#760) 2026-04-06 16:57:27 +01:00
cores Subscriber resilience and RabbitMQ fixes (#765) 2026-04-07 14:51:14 +01:00
decoding RabbitMQ pub/sub backend with topic exchange architecture (#752) 2026-04-02 12:47:16 +01:00
direct Fix Cassandra schema and graph filter semantics (#680) 2026-03-10 12:52:51 +00:00
embeddings Config push notify pattern: replace stateful pub/sub with signal+ fetch (#760) 2026-04-06 16:57:27 +01:00
external Implement logging strategy (#444) 2025-07-30 23:18:38 +01:00
extract Config push notify pattern: replace stateful pub/sub with signal+ fetch (#760) 2026-04-06 16:57:27 +01:00
gateway Fix Metadata/EntityEmbeddings schema migration tail and add regression tests (#776) 2026-04-10 20:42:59 +01:00
librarian Subscriber resilience and RabbitMQ fixes (#765) 2026-04-07 14:51:14 +01:00
metering Subscriber resilience and RabbitMQ fixes (#765) 2026-04-07 14:51:14 +01:00
model Feat: Auto-pull missing Ollama models (#757) 2026-04-06 11:10:53 +01:00
processing Fix/startup failure (#445) 2025-07-30 23:42:11 +01:00
prompt Config push notify pattern: replace stateful pub/sub with signal+ fetch (#760) 2026-04-06 16:57:27 +01:00
query Config push notify pattern: replace stateful pub/sub with signal+ fetch (#760) 2026-04-06 16:57:27 +01:00
retrieval fix: preserve literal types in focus quoted triples and document tracing (#769) 2026-04-08 13:37:02 +01:00
rev_gateway Messaging fabric plugins (#592) 2025-12-17 21:40:43 +00:00
storage Config push notify pattern: replace stateful pub/sub with signal+ fetch (#760) 2026-04-06 16:57:27 +01:00
tables Fix Metadata/EntityEmbeddings schema migration tail and add regression tests (#776) 2026-04-10 20:42:59 +01:00
template Feature/prompts jsonl (#619) 2026-01-26 17:38:00 +00:00
tool_service Tool services - dynamically pluggable tool implementations for agent frameworks (#658) 2026-03-04 14:51:32 +00:00
__init__.py Feature/pkgsplit (#83) 2024-09-30 19:36:09 +01:00