Enhance retrieval pipelines: 4-stage GraphRAG, DocRAG grounding (#697)

mirror of https://github.com/trustgraph-ai/trustgraph.git synced 2026-05-17 19:35:13 +02:00

Enhance retrieval pipelines: 4-stage GraphRAG, DocRAG grounding,
consistent PROV-O

GraphRAG:
- Split retrieval into 4 prompt stages: extract-concepts,
  kg-edge-scoring,
  kg-edge-reasoning, kg-synthesis (was single-stage)
- Add concept extraction (grounding) for per-concept embedding
- Filter main query to default graph, ignoring
  provenance/explainability edges
- Add source document edges to knowledge graph

DocumentRAG:
- Add grounding step with concept extraction, matching GraphRAG's
  pattern:
  Question → Grounding → Exploration → Synthesis
- Per-concept embedding and chunk retrieval with deduplication

Cross-pipeline:
- Make PROV-O derivation links consistent: wasGeneratedBy for first
  entity from Activity, wasDerivedFrom for entity-to-entity chains
- Update CLIs (tg-invoke-agent, tg-invoke-graph-rag,
  tg-invoke-document-rag) for new explainability structure
- Fix all affected unit and integration tests

This commit is contained in:

cybermaggedon

2026-03-16 12:12:13 +00:00

• committed by

GitHub

parent 29b4300808

commit a115ec06ab

No known key found for this signature in database

GPG key ID: B5690EEEBB952194

25 changed files with 1537 additions and 1008 deletions

									
										1

trustgraph-base/trustgraph/schema/services/retrieval.py
									
										View file
										
				@ -15,6 +15,7 @@ class GraphRagQuery:

				    triple_limit: int = 0

				    max_subgraph_size: int = 0

				    max_path_length: int = 0

				    edge_limit: int = 0

				    streaming: bool = False

				@dataclass

Rows
Columns

Enhance retrieval pipelines: 4-stage GraphRAG, DocRAG grounding (#697)

1 trustgraph-base/trustgraph/schema/services/retrieval.py Unescape Escape View file

1

trustgraph-base/trustgraph/schema/services/retrieval.py

View file