trustgraph

mirror of https://github.com/trustgraph-ai/trustgraph.git synced 2026-07-25 21:21:02 +02:00

cybermaggedon 64e3f6bd0d Subgraph provenance (#694 ) Replace per-triple provenance reification with subgraph model Extraction provenance previously created a full reification (statement URI, activity, agent) for every single extracted triple, producing ~13 provenance triples per knowledge triple. Since each chunk is processed by a single LLM call, this was both redundant and semantically inaccurate. Now one subgraph object is created per chunk extraction, with tg:contains linking to each extracted triple. For 20 extractions from a chunk this reduces provenance from ~260 triples to ~33. - Rename tg:reifies -> tg:contains, stmt_uri -> subgraph_uri - Replace triple_provenance_triples() with subgraph_provenance_triples() - Refactor kg-extract-definitions and kg-extract-relationships to generate provenance once per chunk instead of per triple - Add subgraph provenance to kg-extract-ontology and kg-extract-agent (previously had none) - Update CLI tools and tech specs to match Also rename tg-show-document-hierarchy to tg-show-extraction-provenance. Added extra typing for extraction provenance, fixed extraction prov CLI		2026-03-13 11:37:59 +00:00
..
__init__.py	Extending test coverage (#434 )	2025-07-14 17:54:04 +01:00
conftest.py	The id field in pipeline Metadata was being overwritten at each processing (#686 )	2026-03-11 12:16:39 +00:00
test_agent_extraction.py	Subgraph provenance (#694 )	2026-03-13 11:37:59 +00:00
test_agent_extraction_edge_cases.py	Subgraph provenance (#694 )	2026-03-13 11:37:59 +00:00
test_entity_extraction.py	Extending test coverage (#434 )	2025-07-14 17:54:04 +01:00
test_graph_validation.py	Changed schema for Value -> Term, majorly breaking change (#622 )	2026-01-27 13:48:08 +00:00
test_object_extraction_logic.py	Remove redundant metadata (#685 )	2026-03-11 10:51:39 +00:00
test_object_validation.py	Structured data 2 (#645 )	2026-02-23 15:56:29 +00:00
test_relationship_extraction.py	Extending test coverage (#434 )	2025-07-14 17:54:04 +01:00
test_triple_construction.py	Remove redundant metadata (#685 )	2026-03-11 10:51:39 +00:00