trustgraph/tests/unit/test_knowledge_graph
cybermaggedon 64e3f6bd0d
Subgraph provenance (#694)
Replace per-triple provenance reification with subgraph model

Extraction provenance previously created a full reification (statement
URI, activity, agent) for every single extracted triple, producing ~13
provenance triples per knowledge triple.  Since each chunk is processed
by a single LLM call, this was both redundant and semantically
inaccurate.

Now one subgraph object is created per chunk extraction, with
tg:contains linking to each extracted triple.  For 20 extractions from
a chunk this reduces provenance from ~260 triples to ~33.

- Rename tg:reifies -> tg:contains, stmt_uri -> subgraph_uri
- Replace triple_provenance_triples() with subgraph_provenance_triples()
- Refactor kg-extract-definitions and kg-extract-relationships to
  generate provenance once per chunk instead of per triple
- Add subgraph provenance to kg-extract-ontology and kg-extract-agent
  (previously had none)
- Update CLI tools and tech specs to match

Also rename tg-show-document-hierarchy to tg-show-extraction-provenance.

Added extra typing for extraction provenance, fixed extraction prov CLI
2026-03-13 11:37:59 +00:00
..
__init__.py Extending test coverage (#434) 2025-07-14 17:54:04 +01:00
conftest.py The id field in pipeline Metadata was being overwritten at each processing (#686) 2026-03-11 12:16:39 +00:00
test_agent_extraction.py Subgraph provenance (#694) 2026-03-13 11:37:59 +00:00
test_agent_extraction_edge_cases.py Subgraph provenance (#694) 2026-03-13 11:37:59 +00:00
test_entity_extraction.py Extending test coverage (#434) 2025-07-14 17:54:04 +01:00
test_graph_validation.py Changed schema for Value -> Term, majorly breaking change (#622) 2026-01-27 13:48:08 +00:00
test_object_extraction_logic.py Remove redundant metadata (#685) 2026-03-11 10:51:39 +00:00
test_object_validation.py Structured data 2 (#645) 2026-02-23 15:56:29 +00:00
test_relationship_extraction.py Extending test coverage (#434) 2025-07-14 17:54:04 +01:00
test_triple_construction.py Remove redundant metadata (#685) 2026-03-11 10:51:39 +00:00