Adding explainability to the ReACT agent (#689)

* Added tech spec * Add provenance recording to React agent loop Enables agent sessions to be traced and debugged using the same explainability infrastructure as GraphRAG. Agent traces record: - Session start with query and timestamp - Each iteration's thought, action, arguments, and observation - Final answer with derivation chain Changes: - Add session_id and collection fields to AgentRequest schema - Add agent predicates (TG_THOUGHT, TG_ACTION, etc.) to namespaces - Create agent provenance triple generators in provenance/agent.py - Register explainability producer in agent service - Emit provenance triples during agent execution - Update CLI tools to detect and render agent traces alongside GraphRAG * Updated explainability taxonomy: GraphRAG: tg:Question → tg:Exploration → tg:Focus → tg:Synthesis Agent: tg:Question → tg:Analysis(s) → tg:Conclusion All entities also have their PROV-O type (prov:Activity or prov:Entity). Updated commit message: Add provenance recording to React agent loop Enables agent sessions to be traced and debugged using the same explainability infrastructure as GraphRAG. Entity types follow human reasoning patterns: - tg:Question - the user's query (shared with GraphRAG) - tg:Analysis - each think/act/observe cycle - tg:Conclusion - the final answer Also adds explicit TG types to GraphRAG entities: - tg:Question, tg:Exploration, tg:Focus, tg:Synthesis All types retain their PROV-O base types (prov:Activity, prov:Entity). Changes: - Add session_id and collection fields to AgentRequest schema - Add explainability entity types to namespaces.py - Create agent provenance triple generators - Register explainability producer in agent service - Emit provenance triples during agent execution - Update CLI tools to detect and render both trace types * Document RAG explainability is now complete. Here's a summary of the changes made: Schema Changes: - trustgraph-base/trustgraph/schema/services/retrieval.py: Added explain_id and explain_graph fields to DocumentRagResponse - trustgraph-base/trustgraph/messaging/translators/retrieval.py: Updated translator to handle explainability fields Provenance Changes: - trustgraph-base/trustgraph/provenance/namespaces.py: Added TG_CHUNK_COUNT and TG_SELECTED_CHUNK predicates - trustgraph-base/trustgraph/provenance/uris.py: Added docrag_question_uri, docrag_exploration_uri, docrag_synthesis_uri generators - trustgraph-base/trustgraph/provenance/triples.py: Added docrag_question_triples, docrag_exploration_triples, docrag_synthesis_triples builders - trustgraph-base/trustgraph/provenance/__init__.py: Exported all new Document RAG functions and predicates Service Changes: - trustgraph-flow/trustgraph/retrieval/document_rag/document_rag.py: Added explainability callback support and triple emission at each phase (Question → Exploration → Synthesis) - trustgraph-flow/trustgraph/retrieval/document_rag/rag.py: Registered explainability producer and wired up the callback Documentation: - docs/tech-specs/agent-explainability.md: Added Document RAG entity types and provenance model documentation Document RAG Provenance Model: Question (urn:trustgraph:docrag:{uuid}) │ │ tg:query, prov:startedAtTime │ rdf:type = prov:Activity, tg:Question │ ↓ prov:wasGeneratedBy │ Exploration (urn:trustgraph:docrag:{uuid}/exploration) │ │ tg:chunkCount, tg:selectedChunk (multiple) │ rdf:type = prov:Entity, tg:Exploration │ ↓ prov:wasDerivedFrom │ Synthesis (urn:trustgraph:docrag:{uuid}/synthesis) │ │ tg:content = "The answer..." │ rdf:type = prov:Entity, tg:Synthesis * Specific subtype that makes the retrieval mechanism immediately obvious: System: GraphRAG TG Types on Question: tg:Question, tg:GraphRagQuestion URI Pattern: urn:trustgraph:question:{uuid} ──────────────────────────────────────── System: Document RAG TG Types on Question: tg:Question, tg:DocRagQuestion URI Pattern: urn:trustgraph:docrag:{uuid} ──────────────────────────────────────── System: Agent TG Types on Question: tg:Question, tg:AgentQuestion URI Pattern: urn:trustgraph:agent:{uuid} Files modified: - trustgraph-base/trustgraph/provenance/namespaces.py - Added TG_GRAPH_RAG_QUESTION, TG_DOC_RAG_QUESTION, TG_AGENT_QUESTION - trustgraph-base/trustgraph/provenance/triples.py - Added subtype to question_triples and docrag_question_triples - trustgraph-base/trustgraph/provenance/agent.py - Added subtype to agent_session_triples - trustgraph-base/trustgraph/provenance/__init__.py - Exported new types - docs/tech-specs/agent-explainability.md - Documented the subtypes This allows: - Query all questions: ?q rdf:type tg:Question - Query only GraphRAG: ?q rdf:type tg:GraphRagQuestion - Query only Document RAG: ?q rdf:type tg:DocRagQuestion - Query only Agent: ?q rdf:type tg:AgentQuestion * Fixed tests
2026-04-25 00:16:23 +02:00 · 2026-03-11 15:28:15 +00:00 · 2026-03-11 15:28:15 +00:00 · 312174eb88
commit 312174eb88
parent a53ed41da2
17 changed files with 1269 additions and 44 deletions
--- a/docs/tech-specs/agent-explainability.md
+++ b/docs/tech-specs/agent-explainability.md
@ -0,0 +1,272 @@
 # Agent Explainability: Provenance Recording
 ## Overview
 Add provenance recording to the React agent loop so agent sessions can be traced and debugged using the same explainability infrastructure as GraphRAG.
 **Design Decisions:**
 - Write to `urn:graph:retrieval` (generic explainability graph)
 - Linear dependency chain for now (analysis N → wasDerivedFrom → analysis N-1)
 - Tools are opaque black boxes (record input/output only)
 - DAG support deferred to future iteration
 ## Entity Types
 Both GraphRAG and Agent use PROV-O as the base ontology with TrustGraph-specific subtypes:
 ### GraphRAG Types
 | Entity | PROV-O Type | TG Types | Description |
 |--------|-------------|----------|-------------|
 | Question | `prov:Activity` | `tg:Question`, `tg:GraphRagQuestion` | The user's query |
 | Exploration | `prov:Entity` | `tg:Exploration` | Edges retrieved from knowledge graph |
 | Focus | `prov:Entity` | `tg:Focus` | Selected edges with reasoning |
 | Synthesis | `prov:Entity` | `tg:Synthesis` | Final answer |
 ### Agent Types
 | Entity | PROV-O Type | TG Types | Description |
 |--------|-------------|----------|-------------|
 | Question | `prov:Activity` | `tg:Question`, `tg:AgentQuestion` | The user's query |
 | Analysis | `prov:Entity` | `tg:Analysis` | Each think/act/observe cycle |
 | Conclusion | `prov:Entity` | `tg:Conclusion` | Final answer |
 ### Document RAG Types
 | Entity | PROV-O Type | TG Types | Description |
 |--------|-------------|----------|-------------|
 | Question | `prov:Activity` | `tg:Question`, `tg:DocRagQuestion` | The user's query |
 | Exploration | `prov:Entity` | `tg:Exploration` | Chunks retrieved from document store |
 | Synthesis | `prov:Entity` | `tg:Synthesis` | Final answer |
 **Note:** Document RAG uses a subset of GraphRAG's types (no Focus step since there's no edge selection/reasoning phase).
 ### Question Subtypes
 All Question entities share `tg:Question` as a base type but have a specific subtype to identify the retrieval mechanism:
 | Subtype | URI Pattern | Mechanism |
 |---------|-------------|-----------|
 | `tg:GraphRagQuestion` | `urn:trustgraph:question:{uuid}` | Knowledge graph RAG |
 | `tg:DocRagQuestion` | `urn:trustgraph:docrag:{uuid}` | Document/chunk RAG |
 | `tg:AgentQuestion` | `urn:trustgraph:agent:{uuid}` | ReAct agent |
 This allows querying all questions via `tg:Question` while filtering by specific mechanism via the subtype.
 ## Provenance Model
 ```
 Question (urn:trustgraph:agent:{uuid})
    │
    │  tg:query = "User's question"
    │  prov:startedAtTime = timestamp
    │  rdf:type = prov:Activity, tg:Question
    │
    ↓ prov:wasDerivedFrom
    │
 Analysis1 (urn:trustgraph:agent:{uuid}/i1)
    │
    │  tg:thought = "I need to query the knowledge base..."
    │  tg:action = "knowledge-query"
    │  tg:arguments = {"question": "..."}
    │  tg:observation = "Result from tool..."
    │  rdf:type = prov:Entity, tg:Analysis
    │
    ↓ prov:wasDerivedFrom
    │
 Analysis2 (urn:trustgraph:agent:{uuid}/i2)
    │  ...
    ↓ prov:wasDerivedFrom
    │
 Conclusion (urn:trustgraph:agent:{uuid}/final)
    │
    │  tg:answer = "The final response..."
    │  rdf:type = prov:Entity, tg:Conclusion
 ```
 ### Document RAG Provenance Model
 ```
 Question (urn:trustgraph:docrag:{uuid})
    │
    │  tg:query = "User's question"
    │  prov:startedAtTime = timestamp
    │  rdf:type = prov:Activity, tg:Question
    │
    ↓ prov:wasGeneratedBy
    │
 Exploration (urn:trustgraph:docrag:{uuid}/exploration)
    │
    │  tg:chunkCount = 5
    │  tg:selectedChunk = "chunk-id-1"
    │  tg:selectedChunk = "chunk-id-2"
    │  ...
    │  rdf:type = prov:Entity, tg:Exploration
    │
    ↓ prov:wasDerivedFrom
    │
 Synthesis (urn:trustgraph:docrag:{uuid}/synthesis)
    │
    │  tg:content = "The synthesized answer..."
    │  rdf:type = prov:Entity, tg:Synthesis
 ```
 ## Changes Required
 ### 1. Schema Changes
 **File:** `trustgraph-base/trustgraph/schema/services/agent.py`
 Add `session_id` and `collection` fields to `AgentRequest`:
 ```python
@dataclass
 class AgentRequest:
    question: str = ""
    state: str = ""
    group: list[str] | None = None
    history: list[AgentStep] = field(default_factory=list)
    user: str = ""
    collection: str = "default"  # NEW: Collection for provenance traces
    streaming: bool = False
    session_id: str = ""         # NEW: For provenance tracking across iterations
 ```
 **File:** `trustgraph-base/trustgraph/messaging/translators/agent.py`
 Update translator to handle `session_id` and `collection` in both `to_pulsar()` and `from_pulsar()`.
 ### 2. Add Explainability Producer to Agent Service
 **File:** `trustgraph-flow/trustgraph/agent/react/service.py`
 Register an "explainability" producer (same pattern as GraphRAG):
 ```python
 from ... base import ProducerSpec
 from ... schema import Triples
 # In __init__:
 self.register_specification(
    ProducerSpec(
        name = "explainability",
        schema = Triples,
    )
 )
 ```
 ### 3. Provenance Triple Generation
 **File:** `trustgraph-base/trustgraph/provenance/agent.py`
 Create helper functions (similar to GraphRAG's `question_triples`, `exploration_triples`, etc.):
 ```python
 def agent_session_triples(session_uri, query, timestamp):
    """Generate triples for agent Question."""
    return [
        Triple(s=session_uri, p=RDF_TYPE, o=PROV_ACTIVITY),
        Triple(s=session_uri, p=RDF_TYPE, o=TG_QUESTION),
        Triple(s=session_uri, p=TG_QUERY, o=query),
        Triple(s=session_uri, p=PROV_STARTED_AT_TIME, o=timestamp),
    ]
 def agent_iteration_triples(iteration_uri, parent_uri, thought, action, arguments, observation):
    """Generate triples for one Analysis step."""
    return [
        Triple(s=iteration_uri, p=RDF_TYPE, o=PROV_ENTITY),
        Triple(s=iteration_uri, p=RDF_TYPE, o=TG_ANALYSIS),
        Triple(s=iteration_uri, p=TG_THOUGHT, o=thought),
        Triple(s=iteration_uri, p=TG_ACTION, o=action),
        Triple(s=iteration_uri, p=TG_ARGUMENTS, o=json.dumps(arguments)),
        Triple(s=iteration_uri, p=TG_OBSERVATION, o=observation),
        Triple(s=iteration_uri, p=PROV_WAS_DERIVED_FROM, o=parent_uri),
    ]
 def agent_final_triples(final_uri, parent_uri, answer):
    """Generate triples for Conclusion."""
    return [
        Triple(s=final_uri, p=RDF_TYPE, o=PROV_ENTITY),
        Triple(s=final_uri, p=RDF_TYPE, o=TG_CONCLUSION),
        Triple(s=final_uri, p=TG_ANSWER, o=answer),
        Triple(s=final_uri, p=PROV_WAS_DERIVED_FROM, o=parent_uri),
    ]
 ```
 ### 4. Type Definitions
 **File:** `trustgraph-base/trustgraph/provenance/namespaces.py`
 Add explainability entity types and agent predicates:
 ```python
 # Explainability entity types (used by both GraphRAG and Agent)
 TG_QUESTION = TG + "Question"
 TG_EXPLORATION = TG + "Exploration"
 TG_FOCUS = TG + "Focus"
 TG_SYNTHESIS = TG + "Synthesis"
 TG_ANALYSIS = TG + "Analysis"
 TG_CONCLUSION = TG + "Conclusion"
 # Agent predicates
 TG_THOUGHT = TG + "thought"
 TG_ACTION = TG + "action"
 TG_ARGUMENTS = TG + "arguments"
 TG_OBSERVATION = TG + "observation"
 TG_ANSWER = TG + "answer"
 ```
 ## Files Modified
 | File | Change |
 |------|--------|
 | `trustgraph-base/trustgraph/schema/services/agent.py` | Add session_id and collection to AgentRequest |
 | `trustgraph-base/trustgraph/messaging/translators/agent.py` | Update translator for new fields |
 | `trustgraph-base/trustgraph/provenance/namespaces.py` | Add entity types, agent predicates, and Document RAG predicates |
 | `trustgraph-base/trustgraph/provenance/triples.py` | Add TG types to GraphRAG triple builders, add Document RAG triple builders |
 | `trustgraph-base/trustgraph/provenance/uris.py` | Add Document RAG URI generators |
 | `trustgraph-base/trustgraph/provenance/__init__.py` | Export new types, predicates, and Document RAG functions |
 | `trustgraph-base/trustgraph/schema/services/retrieval.py` | Add explain_id and explain_graph to DocumentRagResponse |
 | `trustgraph-base/trustgraph/messaging/translators/retrieval.py` | Update DocumentRagResponseTranslator for explainability fields |
 | `trustgraph-flow/trustgraph/agent/react/service.py` | Add explainability producer + recording logic |
 | `trustgraph-flow/trustgraph/retrieval/document_rag/document_rag.py` | Add explainability callback and emit provenance triples |
 | `trustgraph-flow/trustgraph/retrieval/document_rag/rag.py` | Add explainability producer and wire up callback |
 | `trustgraph-cli/trustgraph/cli/show_explain_trace.py` | Handle agent trace types |
 | `trustgraph-cli/trustgraph/cli/list_explain_traces.py` | List agent sessions alongside GraphRAG |
 ## Files Created
 | File | Purpose |
 |------|---------|
 | `trustgraph-base/trustgraph/provenance/agent.py` | Agent-specific triple generators |
 ## CLI Updates
 **Detection:** Both GraphRAG and Agent Questions have `tg:Question` type. Distinguished by:
 1. URI pattern: `urn:trustgraph:agent:` vs `urn:trustgraph:question:`
 2. Derived entities: `tg:Analysis` (agent) vs `tg:Exploration` (GraphRAG)
 **`list_explain_traces.py`:**
 - Shows Type column (Agent vs GraphRAG)
 **`show_explain_trace.py`:**
 - Auto-detects trace type
 - Agent rendering shows: Question → Analysis step(s) → Conclusion
 ## Backwards Compatibility
 - `session_id` defaults to `""` - old requests work, just won't have provenance
 - `collection` defaults to `"default"` - reasonable fallback
 - CLI gracefully handles both trace types
 ## Verification
 ```bash
 # Run an agent query
 tg-invoke-agent -q "What is the capital of France?"
 # List traces (should show agent sessions with Type column)
 tg-list-explain-traces -U trustgraph -C default
 # Show agent trace
 tg-show-explain-trace "urn:trustgraph:agent:xxx"
 ```
 ## Future Work (Not This PR)
 - DAG dependencies (when analysis N uses results from multiple prior analyses)
 - Tool-specific provenance linking (KnowledgeQuery → its GraphRAG trace)
 - Streaming provenance emission (emit as we go, not batch at end)
--- a/tests/unit/test_retrieval/test_document_rag.py
+++ b/tests/unit/test_retrieval/test_document_rag.py
@ -208,9 +208,12 @@ class TestQuery:
            collection="test_collection"
        )
-        # Verify result is list of fetched document content
+        # Verify result is tuple of (docs, chunk_ids)
-        assert "Document 1 content" in result
+        docs, chunk_ids = result
-        assert "Document 2 content" in result
+        assert "Document 1 content" in docs
        assert "Document 2 content" in docs
        assert "doc/c1" in chunk_ids
        assert "doc/c2" in chunk_ids
    @pytest.mark.asyncio
    async def test_document_rag_query_method(self, mock_fetch_chunk):
@ -350,8 +353,10 @@ class TestQuery:
        mock_embeddings_client.embed.assert_called_once_with(["verbose test"])
        mock_doc_embeddings_client.query.assert_called_once()
-        # Verify result contains fetched content
+        # Verify result is tuple of (docs, chunk_ids) with fetched content
-        assert "Verbose test doc" in result
+        docs, chunk_ids = result
        assert "Verbose test doc" in docs
        assert "doc/c6" in chunk_ids
    @pytest.mark.asyncio
    async def test_document_rag_query_with_verbose(self, mock_fetch_chunk):
@ -426,8 +431,8 @@ class TestQuery:
        mock_embeddings_client.embed.assert_called_once_with(["query with no results"])
        mock_doc_embeddings_client.query.assert_called_once()
-        # Verify empty result is returned
+        # Verify empty result is returned (tuple of empty lists)
-        assert result == []
+        assert result == ([], [])
    @pytest.mark.asyncio
    async def test_document_rag_query_with_empty_documents(self, mock_fetch_chunk):
--- a/tests/unit/test_retrieval/test_document_rag_service.py
+++ b/tests/unit/test_retrieval/test_document_rag_service.py
@ -5,7 +5,7 @@ passed to the DocumentRag.query() method.
 """
 import pytest
-from unittest.mock import MagicMock, AsyncMock, patch
+from unittest.mock import MagicMock, AsyncMock, patch, ANY
 from trustgraph.retrieval.document_rag.rag import Processor
 from trustgraph.schema import DocumentRagQuery, DocumentRagResponse
@ -65,8 +65,9 @@ class TestDocumentRagService:
        mock_rag_instance.query.assert_called_once_with(
            "test query",
            user="my_user",           # Must be from message, not hardcoded default
-            collection="test_coll_1", # Must be from message, not hardcoded default  
+            collection="test_coll_1", # Must be from message, not hardcoded default
-            doc_limit=5
+            doc_limit=5,
            explain_callback=ANY,     # Explainability callback is always passed
        )
        # Verify response was sent
--- a/trustgraph-base/trustgraph/messaging/translators/agent.py
+++ b/trustgraph-base/trustgraph/messaging/translators/agent.py
@ -13,7 +13,9 @@ class AgentRequestTranslator(MessageTranslator):
            group=data.get("group", None),
            history=data.get("history", []),
            user=data.get("user", "trustgraph"),
-            streaming=data.get("streaming", False)
+            collection=data.get("collection", "default"),
            streaming=data.get("streaming", False),
            session_id=data.get("session_id", ""),
        )
    def from_pulsar(self, obj: AgentRequest) -> Dict[str, Any]:
@ -23,7 +25,9 @@ class AgentRequestTranslator(MessageTranslator):
            "group": obj.group,
            "history": obj.history,
            "user": obj.user,
-            "streaming": getattr(obj, "streaming", False)
+            "collection": getattr(obj, "collection", "default"),
            "streaming": getattr(obj, "streaming", False),
            "session_id": getattr(obj, "session_id", ""),
        }
--- a/trustgraph-base/trustgraph/messaging/translators/retrieval.py
+++ b/trustgraph-base/trustgraph/messaging/translators/retrieval.py
@ -38,6 +38,16 @@ class DocumentRagResponseTranslator(MessageTranslator):
        if obj.response is not None:
            result["response"] = obj.response
        # Include explain_id for explain messages
        explain_id = getattr(obj, "explain_id", None)
        if explain_id:
            result["explain_id"] = explain_id
        # Include explain_graph for explain messages (named graph filter)
        explain_graph = getattr(obj, "explain_graph", None)
        if explain_graph is not None:
            result["explain_graph"] = explain_graph
        # Include end_of_stream flag
        result["end_of_stream"] = getattr(obj, "end_of_stream", False)
--- a/trustgraph-base/trustgraph/provenance/init.py
+++ b/trustgraph-base/trustgraph/provenance/init.py
@ -40,11 +40,19 @@ from . uris import (
    activity_uri,
    statement_uri,
    agent_uri,
-    # Query-time provenance URIs
+    # Query-time provenance URIs (GraphRAG)
    question_uri,
    exploration_uri,
    focus_uri,
    synthesis_uri,
    # Agent provenance URIs
    agent_session_uri,
    agent_iteration_uri,
    agent_final_uri,
    # Document RAG provenance URIs
    docrag_question_uri,
    docrag_exploration_uri,
    docrag_synthesis_uri,
 )
 # Namespace constants
@ -63,8 +71,17 @@ from . namespaces import (
    TG_CHUNK_SIZE, TG_CHUNK_OVERLAP, TG_COMPONENT_VERSION,
    TG_LLM_MODEL, TG_ONTOLOGY, TG_EMBEDDING_MODEL,
    TG_SOURCE_TEXT, TG_SOURCE_CHAR_OFFSET, TG_SOURCE_CHAR_LENGTH,
-    # Query-time provenance predicates
+    # Query-time provenance predicates (GraphRAG)
    TG_QUERY, TG_EDGE_COUNT, TG_SELECTED_EDGE, TG_REASONING, TG_CONTENT,
    # Query-time provenance predicates (DocumentRAG)
    TG_CHUNK_COUNT, TG_SELECTED_CHUNK,
    # Explainability entity types
    TG_QUESTION, TG_EXPLORATION, TG_FOCUS, TG_SYNTHESIS,
    TG_ANALYSIS, TG_CONCLUSION,
    # Question subtypes (to distinguish retrieval mechanism)
    TG_GRAPH_RAG_QUESTION, TG_DOC_RAG_QUESTION, TG_AGENT_QUESTION,
    # Agent provenance predicates
    TG_THOUGHT, TG_ACTION, TG_ARGUMENTS, TG_OBSERVATION, TG_ANSWER,
    # Named graphs
    GRAPH_DEFAULT, GRAPH_SOURCE, GRAPH_RETRIEVAL,
 )
@ -74,15 +91,26 @@ from . triples import (
    document_triples,
    derived_entity_triples,
    triple_provenance_triples,
-    # Query-time provenance triple builders
+    # Query-time provenance triple builders (GraphRAG)
    question_triples,
    exploration_triples,
    focus_triples,
    synthesis_triples,
    # Query-time provenance triple builders (DocumentRAG)
    docrag_question_triples,
    docrag_exploration_triples,
    docrag_synthesis_triples,
    # Utility
    set_graph,
 )
 # Agent provenance triple builders
 from . agent import (
    agent_session_triples,
    agent_iteration_triples,
    agent_final_triples,
 )
 # Vocabulary bootstrap
 from . vocabulary import (
    get_vocabulary_triples,
@ -107,6 +135,14 @@ __all__ = [
    "exploration_uri",
    "focus_uri",
    "synthesis_uri",
    # Agent provenance URIs
    "agent_session_uri",
    "agent_iteration_uri",
    "agent_final_uri",
    # Document RAG provenance URIs
    "docrag_question_uri",
    "docrag_exploration_uri",
    "docrag_synthesis_uri",
    # Namespaces
    "PROV", "PROV_ENTITY", "PROV_ACTIVITY", "PROV_AGENT",
    "PROV_WAS_DERIVED_FROM", "PROV_WAS_GENERATED_BY",
@ -118,19 +154,36 @@ __all__ = [
    "TG_CHUNK_SIZE", "TG_CHUNK_OVERLAP", "TG_COMPONENT_VERSION",
    "TG_LLM_MODEL", "TG_ONTOLOGY", "TG_EMBEDDING_MODEL",
    "TG_SOURCE_TEXT", "TG_SOURCE_CHAR_OFFSET", "TG_SOURCE_CHAR_LENGTH",
-    # Query-time provenance predicates
+    # Query-time provenance predicates (GraphRAG)
    "TG_QUERY", "TG_EDGE_COUNT", "TG_SELECTED_EDGE", "TG_REASONING", "TG_CONTENT",
    # Query-time provenance predicates (DocumentRAG)
    "TG_CHUNK_COUNT", "TG_SELECTED_CHUNK",
    # Explainability entity types
    "TG_QUESTION", "TG_EXPLORATION", "TG_FOCUS", "TG_SYNTHESIS",
    "TG_ANALYSIS", "TG_CONCLUSION",
    # Question subtypes
    "TG_GRAPH_RAG_QUESTION", "TG_DOC_RAG_QUESTION", "TG_AGENT_QUESTION",
    # Agent provenance predicates
    "TG_THOUGHT", "TG_ACTION", "TG_ARGUMENTS", "TG_OBSERVATION", "TG_ANSWER",
    # Named graphs
    "GRAPH_DEFAULT", "GRAPH_SOURCE", "GRAPH_RETRIEVAL",
    # Triple builders
    "document_triples",
    "derived_entity_triples",
    "triple_provenance_triples",
-    # Query-time provenance triple builders
+    # Query-time provenance triple builders (GraphRAG)
    "question_triples",
    "exploration_triples",
    "focus_triples",
    "synthesis_triples",
    # Query-time provenance triple builders (DocumentRAG)
    "docrag_question_triples",
    "docrag_exploration_triples",
    "docrag_synthesis_triples",
    # Agent provenance triple builders
    "agent_session_triples",
    "agent_iteration_triples",
    "agent_final_triples",
    # Utility
    "set_graph",
    # Vocabulary
--- a/trustgraph-base/trustgraph/provenance/agent.py
+++ b/trustgraph-base/trustgraph/provenance/agent.py
@ -0,0 +1,141 @@
 """
 Helper functions to build PROV-O triples for agent provenance.
 Agent provenance tracks the reasoning trace of ReAct agent sessions:
 - Question: The root activity with query and timestamp
 - Analysis: Each think/act/observe cycle
 - Conclusion: The final answer
 """
 import json
 from datetime import datetime
 from typing import List, Optional, Dict, Any
 from .. schema import Triple, Term, IRI, LITERAL
 from . namespaces import (
    RDF_TYPE, RDFS_LABEL,
    PROV_ACTIVITY, PROV_ENTITY, PROV_WAS_DERIVED_FROM, PROV_STARTED_AT_TIME,
    TG_QUERY, TG_THOUGHT, TG_ACTION, TG_ARGUMENTS, TG_OBSERVATION, TG_ANSWER,
    TG_QUESTION, TG_ANALYSIS, TG_CONCLUSION,
    TG_AGENT_QUESTION,
 )
 def _iri(uri: str) -> Term:
    """Create an IRI term."""
    return Term(type=IRI, iri=uri)
 def _literal(value) -> Term:
    """Create a literal term."""
    return Term(type=LITERAL, value=str(value))
 def _triple(s: str, p: str, o_term: Term) -> Triple:
    """Create a triple with IRI subject and predicate."""
    return Triple(s=_iri(s), p=_iri(p), o=o_term)
 def agent_session_triples(
    session_uri: str,
    query: str,
    timestamp: Optional[str] = None,
 ) -> List[Triple]:
    """
    Build triples for an agent session start (Question).
    Creates:
    - Activity declaration with tg:Question type
    - Query text and timestamp
    Args:
        session_uri: URI of the session (from agent_session_uri)
        query: The user's query text
        timestamp: ISO timestamp (defaults to now)
    Returns:
        List of Triple objects
    """
    if timestamp is None:
        timestamp = datetime.utcnow().isoformat() + "Z"
    return [
        _triple(session_uri, RDF_TYPE, _iri(PROV_ACTIVITY)),
        _triple(session_uri, RDF_TYPE, _iri(TG_QUESTION)),
        _triple(session_uri, RDF_TYPE, _iri(TG_AGENT_QUESTION)),
        _triple(session_uri, RDFS_LABEL, _literal("Agent Question")),
        _triple(session_uri, PROV_STARTED_AT_TIME, _literal(timestamp)),
        _triple(session_uri, TG_QUERY, _literal(query)),
    ]
 def agent_iteration_triples(
    iteration_uri: str,
    parent_uri: str,
    thought: str,
    action: str,
    arguments: Dict[str, Any],
    observation: str,
 ) -> List[Triple]:
    """
    Build triples for one agent iteration (Analysis - think/act/observe cycle).
    Creates:
    - Entity declaration with tg:Analysis type
    - wasDerivedFrom link to parent (previous iteration or session)
    - Thought, action, arguments, and observation data
    Args:
        iteration_uri: URI of this iteration (from agent_iteration_uri)
        parent_uri: URI of the parent (previous iteration or session)
        thought: The agent's reasoning/thought
        action: The tool/action name
        arguments: Arguments passed to the tool (will be JSON-encoded)
        observation: The result/observation from the tool
    Returns:
        List of Triple objects
    """
    triples = [
        _triple(iteration_uri, RDF_TYPE, _iri(PROV_ENTITY)),
        _triple(iteration_uri, RDF_TYPE, _iri(TG_ANALYSIS)),
        _triple(iteration_uri, RDFS_LABEL, _literal(f"Analysis: {action}")),
        _triple(iteration_uri, PROV_WAS_DERIVED_FROM, _iri(parent_uri)),
        _triple(iteration_uri, TG_THOUGHT, _literal(thought)),
        _triple(iteration_uri, TG_ACTION, _literal(action)),
        _triple(iteration_uri, TG_ARGUMENTS, _literal(json.dumps(arguments))),
        _triple(iteration_uri, TG_OBSERVATION, _literal(observation)),
    ]
    return triples
 def agent_final_triples(
    final_uri: str,
    parent_uri: str,
    answer: str,
 ) -> List[Triple]:
    """
    Build triples for an agent final answer (Conclusion).
    Creates:
    - Entity declaration with tg:Conclusion type
    - wasDerivedFrom link to parent (last iteration or session)
    - The answer text
    Args:
        final_uri: URI of the final answer (from agent_final_uri)
        parent_uri: URI of the parent (last iteration or session if no iterations)
        answer: The final answer text
    Returns:
        List of Triple objects
    """
    return [
        _triple(final_uri, RDF_TYPE, _iri(PROV_ENTITY)),
        _triple(final_uri, RDF_TYPE, _iri(TG_CONCLUSION)),
        _triple(final_uri, RDFS_LABEL, _literal("Conclusion")),
        _triple(final_uri, PROV_WAS_DERIVED_FROM, _iri(parent_uri)),
        _triple(final_uri, TG_ANSWER, _literal(answer)),
    ]
--- a/trustgraph-base/trustgraph/provenance/namespaces.py
+++ b/trustgraph-base/trustgraph/provenance/namespaces.py
@ -59,7 +59,7 @@ TG_SOURCE_TEXT = TG + "sourceText"
 TG_SOURCE_CHAR_OFFSET = TG + "sourceCharOffset"
 TG_SOURCE_CHAR_LENGTH = TG + "sourceCharLength"
-# Query-time provenance predicates
+# Query-time provenance predicates (GraphRAG)
 TG_QUERY = TG + "query"
 TG_EDGE_COUNT = TG + "edgeCount"
 TG_SELECTED_EDGE = TG + "selectedEdge"
@ -68,6 +68,30 @@ TG_REASONING = TG + "reasoning"
 TG_CONTENT = TG + "content"
 TG_DOCUMENT = TG + "document"  # Reference to document in librarian
 # Query-time provenance predicates (DocumentRAG)
 TG_CHUNK_COUNT = TG + "chunkCount"
 TG_SELECTED_CHUNK = TG + "selectedChunk"
 # Explainability entity types (shared)
 TG_QUESTION = TG + "Question"
 TG_EXPLORATION = TG + "Exploration"
 TG_FOCUS = TG + "Focus"
 TG_SYNTHESIS = TG + "Synthesis"
 TG_ANALYSIS = TG + "Analysis"
 TG_CONCLUSION = TG + "Conclusion"
 # Question subtypes (to distinguish retrieval mechanism)
 TG_GRAPH_RAG_QUESTION = TG + "GraphRagQuestion"
 TG_DOC_RAG_QUESTION = TG + "DocRagQuestion"
 TG_AGENT_QUESTION = TG + "AgentQuestion"
 # Agent provenance predicates
 TG_THOUGHT = TG + "thought"
 TG_ACTION = TG + "action"
 TG_ARGUMENTS = TG + "arguments"
 TG_OBSERVATION = TG + "observation"
 TG_ANSWER = TG + "answer"
 # Named graph URIs for RDF datasets
 # These separate different types of data while keeping them in the same collection
 GRAPH_DEFAULT = ""                       # Core knowledge facts (triples extracted from documents)
--- a/trustgraph-base/trustgraph/provenance/triples.py
+++ b/trustgraph-base/trustgraph/provenance/triples.py
@ -17,9 +17,15 @@ from . namespaces import (
    TG_CHUNK_INDEX, TG_CHAR_OFFSET, TG_CHAR_LENGTH,
    TG_CHUNK_SIZE, TG_CHUNK_OVERLAP, TG_COMPONENT_VERSION,
    TG_LLM_MODEL, TG_ONTOLOGY, TG_REIFIES,
-    # Query-time provenance predicates
+    # Query-time provenance predicates (GraphRAG)
    TG_QUERY, TG_EDGE_COUNT, TG_SELECTED_EDGE, TG_EDGE, TG_REASONING, TG_CONTENT,
    TG_DOCUMENT,
    # Query-time provenance predicates (DocumentRAG)
    TG_CHUNK_COUNT, TG_SELECTED_CHUNK,
    # Explainability entity types
    TG_QUESTION, TG_EXPLORATION, TG_FOCUS, TG_SYNTHESIS,
    # Question subtypes
    TG_GRAPH_RAG_QUESTION, TG_DOC_RAG_QUESTION,
 )
 from . uris import activity_uri, agent_uri, edge_selection_uri
@ -310,7 +316,9 @@ def question_triples(
    return [
        _triple(question_uri, RDF_TYPE, _iri(PROV_ACTIVITY)),
-        _triple(question_uri, RDFS_LABEL, _literal("GraphRAG question")),
+        _triple(question_uri, RDF_TYPE, _iri(TG_QUESTION)),
        _triple(question_uri, RDF_TYPE, _iri(TG_GRAPH_RAG_QUESTION)),
        _triple(question_uri, RDFS_LABEL, _literal("GraphRAG Question")),
        _triple(question_uri, PROV_STARTED_AT_TIME, _literal(timestamp)),
        _triple(question_uri, TG_QUERY, _literal(query)),
    ]
@ -339,6 +347,7 @@ def exploration_triples(
    """
    return [
        _triple(exploration_uri, RDF_TYPE, _iri(PROV_ENTITY)),
        _triple(exploration_uri, RDF_TYPE, _iri(TG_EXPLORATION)),
        _triple(exploration_uri, RDFS_LABEL, _literal("Exploration")),
        _triple(exploration_uri, PROV_WAS_GENERATED_BY, _iri(question_uri)),
        _triple(exploration_uri, TG_EDGE_COUNT, _literal(edge_count)),
@ -383,6 +392,7 @@ def focus_triples(
    """
    triples = [
        _triple(focus_uri, RDF_TYPE, _iri(PROV_ENTITY)),
        _triple(focus_uri, RDF_TYPE, _iri(TG_FOCUS)),
        _triple(focus_uri, RDFS_LABEL, _literal("Focus")),
        _triple(focus_uri, PROV_WAS_DERIVED_FROM, _iri(exploration_uri)),
    ]
@ -443,6 +453,7 @@ def synthesis_triples(
    """
    triples = [
        _triple(synthesis_uri, RDF_TYPE, _iri(PROV_ENTITY)),
        _triple(synthesis_uri, RDF_TYPE, _iri(TG_SYNTHESIS)),
        _triple(synthesis_uri, RDFS_LABEL, _literal("Synthesis")),
        _triple(synthesis_uri, PROV_WAS_DERIVED_FROM, _iri(focus_uri)),
    ]
@ -455,3 +466,120 @@ def synthesis_triples(
        triples.append(_triple(synthesis_uri, TG_CONTENT, _literal(answer_text)))
    return triples
 # Document RAG provenance triple builders
 #
 # Document RAG uses a subset of GraphRAG's model:
 #   Question    - What was asked
 #   Exploration - Chunks retrieved from document store
 #   Synthesis   - The final answer (no Focus step)
 def docrag_question_triples(
    question_uri: str,
    query: str,
    timestamp: Optional[str] = None,
 ) -> List[Triple]:
    """
    Build triples for a document RAG question activity.
    Creates:
    - Activity declaration with tg:Question type
    - Query text and timestamp
    Args:
        question_uri: URI of the question (from docrag_question_uri)
        query: The user's query text
        timestamp: ISO timestamp (defaults to now)
    Returns:
        List of Triple objects
    """
    if timestamp is None:
        timestamp = datetime.utcnow().isoformat() + "Z"
    return [
        _triple(question_uri, RDF_TYPE, _iri(PROV_ACTIVITY)),
        _triple(question_uri, RDF_TYPE, _iri(TG_QUESTION)),
        _triple(question_uri, RDF_TYPE, _iri(TG_DOC_RAG_QUESTION)),
        _triple(question_uri, RDFS_LABEL, _literal("DocumentRAG Question")),
        _triple(question_uri, PROV_STARTED_AT_TIME, _literal(timestamp)),
        _triple(question_uri, TG_QUERY, _literal(query)),
    ]
 def docrag_exploration_triples(
    exploration_uri: str,
    question_uri: str,
    chunk_count: int,
    chunk_ids: Optional[List[str]] = None,
 ) -> List[Triple]:
    """
    Build triples for a document RAG exploration entity (chunks retrieved).
    Creates:
    - Entity declaration with tg:Exploration type
    - wasGeneratedBy link to question
    - Chunk count and optional chunk references
    Args:
        exploration_uri: URI of the exploration entity
        question_uri: URI of the parent question
        chunk_count: Number of chunks retrieved
        chunk_ids: Optional list of chunk URIs/IDs
    Returns:
        List of Triple objects
    """
    triples = [
        _triple(exploration_uri, RDF_TYPE, _iri(PROV_ENTITY)),
        _triple(exploration_uri, RDF_TYPE, _iri(TG_EXPLORATION)),
        _triple(exploration_uri, RDFS_LABEL, _literal("Exploration")),
        _triple(exploration_uri, PROV_WAS_GENERATED_BY, _iri(question_uri)),
        _triple(exploration_uri, TG_CHUNK_COUNT, _literal(chunk_count)),
    ]
    # Add references to selected chunks
    if chunk_ids:
        for chunk_id in chunk_ids:
            triples.append(_triple(exploration_uri, TG_SELECTED_CHUNK, _iri(chunk_id)))
    return triples
 def docrag_synthesis_triples(
    synthesis_uri: str,
    exploration_uri: str,
    answer_text: str = "",
    document_id: Optional[str] = None,
 ) -> List[Triple]:
    """
    Build triples for a document RAG synthesis entity (final answer).
    Creates:
    - Entity declaration with tg:Synthesis type
    - wasDerivedFrom link to exploration (skips focus step)
    - Either document reference or inline content
    Args:
        synthesis_uri: URI of the synthesis entity
        exploration_uri: URI of the parent exploration entity
        answer_text: The synthesized answer text (used if no document_id)
        document_id: Optional librarian document ID (preferred over inline content)
    Returns:
        List of Triple objects
    """
    triples = [
        _triple(synthesis_uri, RDF_TYPE, _iri(PROV_ENTITY)),
        _triple(synthesis_uri, RDF_TYPE, _iri(TG_SYNTHESIS)),
        _triple(synthesis_uri, RDFS_LABEL, _literal("Synthesis")),
        _triple(synthesis_uri, PROV_WAS_DERIVED_FROM, _iri(exploration_uri)),
    ]
    if document_id:
        triples.append(_triple(synthesis_uri, TG_DOCUMENT, _iri(document_id)))
    elif answer_text:
        triples.append(_triple(synthesis_uri, TG_CONTENT, _literal(answer_text)))
    return triples
--- a/trustgraph-base/trustgraph/provenance/uris.py
+++ b/trustgraph-base/trustgraph/provenance/uris.py
@ -138,3 +138,94 @@ def edge_selection_uri(session_id: str, edge_index: int) -> str:
        URN in format: urn:trustgraph:prov:edge:{uuid}:{index}
    """
    return f"urn:trustgraph:prov:edge:{session_id}:{edge_index}"
 # Agent provenance URIs
 # These URIs use the urn:trustgraph:agent: namespace to distinguish agent
 # provenance from GraphRAG question provenance
 def agent_session_uri(session_id: str = None) -> str:
    """
    Generate URI for an agent session.
    Args:
        session_id: Optional UUID string. Auto-generates if not provided.
    Returns:
        URN in format: urn:trustgraph:agent:{uuid}
    """
    if session_id is None:
        session_id = str(uuid.uuid4())
    return f"urn:trustgraph:agent:{session_id}"
 def agent_iteration_uri(session_id: str, iteration_num: int) -> str:
    """
    Generate URI for an agent iteration.
    Args:
        session_id: The session UUID.
        iteration_num: 1-based iteration number.
    Returns:
        URN in format: urn:trustgraph:agent:{uuid}/i{num}
    """
    return f"urn:trustgraph:agent:{session_id}/i{iteration_num}"
 def agent_final_uri(session_id: str) -> str:
    """
    Generate URI for an agent final answer.
    Args:
        session_id: The session UUID.
    Returns:
        URN in format: urn:trustgraph:agent:{uuid}/final
    """
    return f"urn:trustgraph:agent:{session_id}/final"
 # Document RAG provenance URIs
 # These URIs use the urn:trustgraph:docrag: namespace to distinguish
 # document RAG provenance from graph RAG provenance
 def docrag_question_uri(session_id: str = None) -> str:
    """
    Generate URI for a document RAG question activity.
    Args:
        session_id: Optional UUID string. Auto-generates if not provided.
    Returns:
        URN in format: urn:trustgraph:docrag:{uuid}
    """
    if session_id is None:
        session_id = str(uuid.uuid4())
    return f"urn:trustgraph:docrag:{session_id}"
 def docrag_exploration_uri(session_id: str) -> str:
    """
    Generate URI for a document RAG exploration entity (chunks retrieved).
    Args:
        session_id: The session UUID.
    Returns:
        URN in format: urn:trustgraph:docrag:{uuid}/exploration
    """
    return f"urn:trustgraph:docrag:{session_id}/exploration"
 def docrag_synthesis_uri(session_id: str) -> str:
    """
    Generate URI for a document RAG synthesis entity (final answer).
    Args:
        session_id: The session UUID.
    Returns:
        URN in format: urn:trustgraph:docrag:{uuid}/synthesis
    """
    return f"urn:trustgraph:docrag:{session_id}/synthesis"
--- a/trustgraph-base/trustgraph/schema/services/agent.py
+++ b/trustgraph-base/trustgraph/schema/services/agent.py
@ -23,7 +23,9 @@ class AgentRequest:
    group: list[str] | None = None
    history: list[AgentStep] = field(default_factory=list)
    user: str = ""              # User context for multi-tenancy
-    streaming: bool = False     # NEW: Enable streaming response delivery (default false)
+    collection: str = "default" # Collection for provenance traces
    streaming: bool = False     # Enable streaming response delivery (default false)
    session_id: str = ""        # For provenance tracking across iterations
@dataclass
 class AgentResponse:
--- a/trustgraph-base/trustgraph/schema/services/retrieval.py
+++ b/trustgraph-base/trustgraph/schema/services/retrieval.py
@ -42,5 +42,7 @@ class DocumentRagQuery:
@dataclass
 class DocumentRagResponse:
    error: Error | None = None
-    response: str = ""
+    response: str | None = ""
    end_of_stream: bool = False
    explain_id: str | None = None     # Single explain URI (announced as created)
    explain_graph: str | None = None  # Named graph where explain was stored (e.g., urn:graph:retrieval)
--- a/trustgraph-cli/trustgraph/cli/list_explain_traces.py
+++ b/trustgraph-cli/trustgraph/cli/list_explain_traces.py
@ -1,8 +1,8 @@
 """
-List all GraphRAG sessions (questions) in a collection.
+List all explainability sessions (GraphRAG and Agent) in a collection.
 Queries for all questions stored in the retrieval graph and displays them
-with their session IDs and timestamps.
+with their session IDs, type (GraphRAG or Agent), and timestamps.
 Examples:
  tg-list-explain-traces -U trustgraph -C default
@ -24,8 +24,14 @@ default_collection = 'default'
 # Predicates
 TG = "https://trustgraph.ai/ns/"
 TG_QUERY = TG + "query"
 TG_QUESTION = TG + "Question"
 TG_ANALYSIS = TG + "Analysis"
 TG_EXPLORATION = TG + "Exploration"
 PROV = "http://www.w3.org/ns/prov#"
 PROV_STARTED_AT_TIME = PROV + "startedAtTime"
 PROV_WAS_DERIVED_FROM = PROV + "wasDerivedFrom"
 PROV_WAS_GENERATED_BY = PROV + "wasGeneratedBy"
 RDF_TYPE = "http://www.w3.org/1999/02/22-rdf-syntax-ns#type"
 # Retrieval graph
 RETRIEVAL_GRAPH = "urn:graph:retrieval"
@ -117,8 +123,45 @@ def get_timestamp(socket, flow_id, user, collection, question_id):
    return ""
 def get_session_type(socket, flow_id, user, collection, session_id):
    """
    Get the type of session (Agent or GraphRAG).
    Both have tg:Question type, so we distinguish by URI pattern
    or by checking what's derived from it.
    """
    # Fast path: check URI pattern
    if session_id.startswith("urn:trustgraph:agent:"):
        return "Agent"
    if session_id.startswith("urn:trustgraph:question:"):
        return "GraphRAG"
    # Check what's derived from this entity
    derived = query_triples(
        socket, flow_id, user, collection,
        p=PROV_WAS_DERIVED_FROM, o=session_id, g=RETRIEVAL_GRAPH
    )
    generated = query_triples(
        socket, flow_id, user, collection,
        p=PROV_WAS_GENERATED_BY, o=session_id, g=RETRIEVAL_GRAPH
    )
    for s, p, o in derived + generated:
        child_types = query_triples(
            socket, flow_id, user, collection,
            s=s, p=RDF_TYPE, g=RETRIEVAL_GRAPH
        )
        for _, _, child_type in child_types:
            if child_type == TG_ANALYSIS:
                return "Agent"
            if child_type == TG_EXPLORATION:
                return "GraphRAG"
    return "GraphRAG"
 def list_sessions(socket, flow_id, user, collection, limit):
-    """List all GraphRAG sessions by finding questions."""
+    """List all explainability sessions (GraphRAG and Agent) by finding questions."""
    # Query for all triples with predicate = tg:query
    triples = query_triples(
        socket, flow_id, user, collection,
@ -129,9 +172,12 @@ def list_sessions(socket, flow_id, user, collection, limit):
    for question_id, _, query_text in triples:
        # Get timestamp if available
        timestamp = get_timestamp(socket, flow_id, user, collection, question_id)
        # Get session type (Agent or GraphRAG)
        session_type = get_session_type(socket, flow_id, user, collection, question_id)
        sessions.append({
            "id": question_id,
            "type": session_type,
            "question": query_text,
            "time": timestamp,
        })
@ -154,18 +200,19 @@ def truncate_text(text, max_len=60):
 def print_table(sessions):
    """Print sessions as a table."""
    if not sessions:
-        print("No GraphRAG sessions found.")
+        print("No explainability sessions found.")
        return
    rows = []
    for session in sessions:
        rows.append([
            session["id"],
-            truncate_text(session["question"], 50),
+            session.get("type", "Unknown"),
            truncate_text(session["question"], 45),
            session.get("time", "")
        ])
-    headers = ["Session ID", "Question", "Time"]
+    headers = ["Session ID", "Type", "Question", "Time"]
    print(tabulate(rows, headers=headers, tablefmt="simple"))
--- a/trustgraph-cli/trustgraph/cli/show_explain_trace.py
+++ b/trustgraph-cli/trustgraph/cli/show_explain_trace.py
@ -1,11 +1,15 @@
 """
-Show full explainability trace for a GraphRAG session.
+Show full explainability trace for a GraphRAG or Agent session.
-Given a question/session URI, displays the complete cascade:
+Given a question/session URI, displays the complete trace:
-Question -> Exploration -> Focus (edge selection) -> Synthesis (answer).
+- GraphRAG: Question -> Exploration -> Focus (edge selection) -> Synthesis (answer)
 - Agent: Session -> Iteration(s) (thought/action/observation) -> Final Answer
 The tool auto-detects the trace type based on rdf:type.
 Examples:
  tg-show-explain-trace -U trustgraph -C default "urn:trustgraph:question:abc123"
  tg-show-explain-trace -U trustgraph -C default "urn:trustgraph:agent:abc123"
  tg-show-explain-trace --max-answer 1000 "urn:trustgraph:question:abc123"
  tg-show-explain-trace --show-provenance "urn:trustgraph:question:abc123"
 """
@ -31,10 +35,25 @@ TG_REASONING = TG + "reasoning"
 TG_CONTENT = TG + "content"
 TG_DOCUMENT = TG + "document"
 TG_REIFIES = TG + "reifies"
 # Explainability entity types
 TG_QUESTION = TG + "Question"
 TG_EXPLORATION = TG + "Exploration"
 TG_FOCUS = TG + "Focus"
 TG_SYNTHESIS = TG + "Synthesis"
 TG_ANALYSIS = TG + "Analysis"
 TG_CONCLUSION = TG + "Conclusion"
 # Agent predicates
 TG_THOUGHT = TG + "thought"
 TG_ACTION = TG + "action"
 TG_ARGUMENTS = TG + "arguments"
 TG_OBSERVATION = TG + "observation"
 TG_ANSWER = TG + "answer"
 PROV = "http://www.w3.org/ns/prov#"
 PROV_STARTED_AT_TIME = PROV + "startedAtTime"
 PROV_WAS_DERIVED_FROM = PROV + "wasDerivedFrom"
 PROV_WAS_GENERATED_BY = PROV + "wasGeneratedBy"
 RDF_TYPE = "http://www.w3.org/1999/02/22-rdf-syntax-ns#type"
 RDFS_LABEL = "http://www.w3.org/2000/01/rdf-schema#label"
 # Graphs
@ -280,6 +299,186 @@ def format_edge(edge, label_cache=None, socket=None, flow_id=None, user=None, co
    return f"({s_label}, {p_label}, {o_label})"
 def detect_trace_type(socket, flow_id, user, collection, entity_id):
    """
    Detect whether an entity is an agent Question or GraphRAG Question.
    Both have rdf:type = tg:Question, so we distinguish by checking
    what's derived from it:
    - Agent: has tg:Analysis or tg:Conclusion derived
    - GraphRAG: has tg:Exploration derived
    Also checks URI pattern as fallback:
    - urn:trustgraph:agent: -> agent
    - urn:trustgraph:question: -> graphrag
    Returns:
        "agent" or "graphrag"
    """
    # Check URI pattern first (fast path)
    if entity_id.startswith("urn:trustgraph:agent:"):
        return "agent"
    if entity_id.startswith("urn:trustgraph:question:"):
        return "graphrag"
    # Check what's derived from this entity
    derived = find_by_predicate_object(
        socket, flow_id, user, collection,
        PROV_WAS_DERIVED_FROM, entity_id
    )
    # Also check wasGeneratedBy (GraphRAG exploration uses this)
    generated = find_by_predicate_object(
        socket, flow_id, user, collection,
        PROV_WAS_GENERATED_BY, entity_id
    )
    all_children = derived + generated
    for child_id in all_children:
        child_types = query_triples(
            socket, flow_id, user, collection,
            s=child_id, p=RDF_TYPE, g=RETRIEVAL_GRAPH
        )
        for s, p, o in child_types:
            if o == TG_ANALYSIS or o == TG_CONCLUSION:
                return "agent"
            if o == TG_EXPLORATION:
                return "graphrag"
    # Default to graphrag
    return "graphrag"
 def build_agent_trace(socket, flow_id, user, collection, session_id, api=None, max_answer=500):
    """Build the full explainability trace for an agent session."""
    trace = {
        "session_id": session_id,
        "type": "agent",
        "question": None,
        "time": None,
        "iterations": [],
        "final_answer": None,
    }
    # Get session metadata
    props = get_node_properties(socket, flow_id, user, collection, session_id)
    trace["question"] = props.get(TG_QUERY, [None])[0]
    trace["time"] = props.get(PROV_STARTED_AT_TIME, [None])[0]
    # Find all entities derived from this session (iterations and final)
    # Start by looking for entities where prov:wasDerivedFrom = session_id
    current_uri = session_id
    iteration_num = 1
    while True:
        # Find entities derived from current
        derived_ids = find_by_predicate_object(
            socket, flow_id, user, collection,
            PROV_WAS_DERIVED_FROM, current_uri
        )
        if not derived_ids:
            break
        derived_id = derived_ids[0]
        derived_props = get_node_properties(socket, flow_id, user, collection, derived_id)
        # Check type
        types = derived_props.get(RDF_TYPE, [])
        if TG_ANALYSIS in types:
            iteration = {
                "id": derived_id,
                "iteration_num": iteration_num,
                "thought": derived_props.get(TG_THOUGHT, [None])[0],
                "action": derived_props.get(TG_ACTION, [None])[0],
                "arguments": derived_props.get(TG_ARGUMENTS, [None])[0],
                "observation": derived_props.get(TG_OBSERVATION, [None])[0],
            }
            trace["iterations"].append(iteration)
            current_uri = derived_id
            iteration_num += 1
        elif TG_CONCLUSION in types:
            answer = derived_props.get(TG_ANSWER, [None])[0]
            if answer and len(answer) > max_answer:
                answer = answer[:max_answer] + "... [truncated]"
            trace["final_answer"] = {
                "id": derived_id,
                "answer": answer,
            }
            break
        else:
            # Unknown type, stop traversal
            break
    return trace
 def print_agent_text(trace):
    """Print agent trace in text format."""
    print(f"=== Agent Session: {trace['session_id']} ===")
    print()
    if trace["question"]:
        print(f"Question: {trace['question']}")
    if trace["time"]:
        print(f"Time: {trace['time']}")
    print()
    # Analysis steps
    print("--- Analysis ---")
    iterations = trace.get("iterations", [])
    if iterations:
        for iteration in iterations:
            print(f"Analysis {iteration['iteration_num']}:")
            print(f"  Thought: {iteration.get('thought', 'N/A')}")
            print(f"  Action: {iteration.get('action', 'N/A')}")
            args = iteration.get('arguments')
            if args:
                # Try to pretty-print JSON arguments
                try:
                    import json
                    args_obj = json.loads(args)
                    args_str = json.dumps(args_obj, indent=4)
                    # Indent each line
                    args_lines = args_str.split('\n')
                    print(f"  Arguments:")
                    for line in args_lines:
                        print(f"    {line}")
                except:
                    print(f"  Arguments: {args}")
            else:
                print(f"  Arguments: N/A")
            obs = iteration.get('observation', 'N/A')
            if obs and len(obs) > 200:
                obs = obs[:200] + "... [truncated]"
            print(f"  Observation: {obs}")
            print()
    else:
        print("No analysis steps recorded")
        print()
    # Conclusion
    print("--- Conclusion ---")
    final = trace.get("final_answer")
    if final and final.get("answer"):
        print("Answer:")
        for line in final["answer"].split("\n"):
            print(f"  {line}")
    else:
        print("No conclusion recorded")
 def print_agent_json(trace):
    """Print agent trace as JSON."""
    print(json.dumps(trace, indent=2))
 def build_trace(socket, flow_id, user, collection, question_id, api=None, show_provenance=False, max_answer=500):
    """Build the full explainability trace for a question."""
    label_cache = {}
@ -530,21 +729,48 @@ def main():
        socket = api.socket()
        try:
-            trace = build_trace(
+            # Detect trace type (agent vs graphrag)
            trace_type = detect_trace_type(
                socket=socket,
                flow_id=args.flow_id,
                user=args.user,
                collection=args.collection,
-                question_id=args.question_id,
+                entity_id=args.question_id,
                api=api,
                show_provenance=args.show_provenance,
                max_answer=args.max_answer,
            )
-            if args.format == 'json':
+            if trace_type == "agent":
-                print_json(trace)
+                # Build and print agent trace
                trace = build_agent_trace(
                    socket=socket,
                    flow_id=args.flow_id,
                    user=args.user,
                    collection=args.collection,
                    session_id=args.question_id,
                    api=api,
                    max_answer=args.max_answer,
                )
                if args.format == 'json':
                    print_agent_json(trace)
                else:
                    print_agent_text(trace)
            else:
-                print_text(trace, show_provenance=args.show_provenance)
+                # Build and print GraphRAG trace (existing behavior)
                trace = build_trace(
                    socket=socket,
                    flow_id=args.flow_id,
                    user=args.user,
                    collection=args.collection,
                    question_id=args.question_id,
                    api=api,
                    show_provenance=args.show_provenance,
                    max_answer=args.max_answer,
                )
                if args.format == 'json':
                    print_json(trace)
                else:
                    print_text(trace, show_provenance=args.show_provenance)
        finally:
            socket.close()
--- a/trustgraph-flow/trustgraph/agent/react/service.py
+++ b/trustgraph-flow/trustgraph/agent/react/service.py
@ -7,6 +7,8 @@ import re
 import sys
 import functools
 import logging
 import uuid
 from datetime import datetime
 # Module logger
 logger = logging.getLogger(__name__)
@ -14,8 +16,22 @@ logger = logging.getLogger(__name__)
 from ... base import AgentService, TextCompletionClientSpec, PromptClientSpec
 from ... base import GraphRagClientSpec, ToolClientSpec, StructuredQueryClientSpec
 from ... base import RowEmbeddingsQueryClientSpec, EmbeddingsClientSpec
 from ... base import ProducerSpec
 from ... schema import AgentRequest, AgentResponse, AgentStep, Error
 from ... schema import Triples, Metadata
 # Provenance imports for agent explainability
 from trustgraph.provenance import (
    agent_session_uri,
    agent_iteration_uri,
    agent_final_uri,
    agent_session_triples,
    agent_iteration_triples,
    agent_final_triples,
    set_graph,
    GRAPH_RETRIEVAL,
 )
 from . tools import KnowledgeQueryImpl, TextCompletionImpl, McpToolImpl, PromptImpl, StructuredQueryImpl, RowEmbeddingsQueryImpl, ToolServiceImpl
 from . agent_manager import AgentManager
@ -105,6 +121,14 @@ class Processor(AgentService):
            )
        )
        # Explainability producer for agent provenance triples
        self.register_specification(
            ProducerSpec(
                name = "explainability",
                schema = Triples,
            )
        )
    async def on_tools_config(self, config, version):
        logger.info(f"Loading configuration version {version}")
@ -285,6 +309,10 @@ class Processor(AgentService):
            # Check if streaming is enabled
            streaming = getattr(request, 'streaming', False)
            # Generate or retrieve session ID for provenance tracking
            session_id = getattr(request, 'session_id', '') or str(uuid.uuid4())
            collection = getattr(request, 'collection', 'default')
            if request.history:
                history = [
                    Action(
@ -298,6 +326,27 @@ class Processor(AgentService):
            else:
                history = []
            # Calculate iteration number (1-based)
            iteration_num = len(history) + 1
            session_uri = agent_session_uri(session_id)
            # On first iteration, emit session triples
            if iteration_num == 1:
                timestamp = datetime.utcnow().isoformat() + "Z"
                triples = set_graph(
                    agent_session_triples(session_uri, request.question, timestamp),
                    GRAPH_RETRIEVAL
                )
                await flow("explainability").send(Triples(
                    metadata=Metadata(
                        id=session_uri,
                        user=request.user,
                        collection=collection,
                    ),
                    triples=triples,
                ))
                logger.debug(f"Emitted session triples for {session_uri}")
            logger.info(f"Question: {request.question}")
            if len(history) >= self.max_iterations:
@ -447,6 +496,28 @@ class Processor(AgentService):
                else:
                    f = json.dumps(act.final)
                # Emit final answer provenance triples
                final_uri = agent_final_uri(session_id)
                # Parent is last iteration, or session if no iterations
                if iteration_num > 1:
                    parent_uri = agent_iteration_uri(session_id, iteration_num - 1)
                else:
                    parent_uri = session_uri
                final_triples = set_graph(
                    agent_final_triples(final_uri, parent_uri, f),
                    GRAPH_RETRIEVAL
                )
                await flow("explainability").send(Triples(
                    metadata=Metadata(
                        id=final_uri,
                        user=request.user,
                        collection=collection,
                    ),
                    triples=final_triples,
                ))
                logger.debug(f"Emitted final triples for {final_uri}")
                if streaming:
                    # Streaming format - send end-of-dialog marker
                    # Answer chunks were already sent via answer() callback during parsing
@ -479,8 +550,37 @@ class Processor(AgentService):
            logger.debug("Send next...")
            # Emit iteration provenance triples
            iteration_uri = agent_iteration_uri(session_id, iteration_num)
            # Parent is previous iteration, or session if this is first iteration
            if iteration_num > 1:
                parent_uri = agent_iteration_uri(session_id, iteration_num - 1)
            else:
                parent_uri = session_uri
            iter_triples = set_graph(
                agent_iteration_triples(
                    iteration_uri,
                    parent_uri,
                    act.thought,
                    act.name,
                    act.arguments,
                    act.observation,
                ),
                GRAPH_RETRIEVAL
            )
            await flow("explainability").send(Triples(
                metadata=Metadata(
                    id=iteration_uri,
                    user=request.user,
                    collection=collection,
                ),
                triples=iter_triples,
            ))
            logger.debug(f"Emitted iteration triples for {iteration_uri}")
            history.append(act)
-            
+
            # Handle state transitions if tool execution was successful
            next_state = request.state
            if act.name in filtered_tools:
@ -501,7 +601,9 @@ class Processor(AgentService):
                    for h in history
                ],
                user=request.user,
                collection=collection,
                streaming=streaming,
                session_id=session_id,  # Pass session_id for provenance continuity
            )
            await next(r)
--- a/trustgraph-flow/trustgraph/retrieval/document_rag/document_rag.py
+++ b/trustgraph-flow/trustgraph/retrieval/document_rag/document_rag.py
@ -1,6 +1,20 @@
 import asyncio
 import logging
 import uuid
 from datetime import datetime
 # Provenance imports
 from trustgraph.provenance import (
    docrag_question_uri,
    docrag_exploration_uri,
    docrag_synthesis_uri,
    docrag_question_triples,
    docrag_exploration_triples,
    docrag_synthesis_triples,
    set_graph,
    GRAPH_RETRIEVAL,
 )
 # Module logger
 logger = logging.getLogger(__name__)
@ -33,7 +47,14 @@ class Query:
        return qembeds[0] if qembeds else []
    async def get_docs(self, query):
        """
        Get documents (chunks) matching the query.
        Returns:
            tuple: (docs, chunk_ids) where:
                - docs: list of document content strings
                - chunk_ids: list of chunk IDs that were successfully fetched
        """
        vectors = await self.get_vector(query)
        if self.verbose:
@ -50,11 +71,13 @@ class Query:
        # Fetch chunk content from Garage
        docs = []
        chunk_ids = []
        for match in chunk_matches:
            if match.chunk_id:
                try:
                    content = await self.rag.fetch_chunk(match.chunk_id, self.user)
                    docs.append(content)
                    chunk_ids.append(match.chunk_id)
                except Exception as e:
                    logger.warning(f"Failed to fetch chunk {match.chunk_id}: {e}")
@ -63,7 +86,7 @@ class Query:
            for doc in docs:
                logger.debug(f"  {doc[:100]}...")
-        return docs
+        return docs, chunk_ids
 class DocumentRag:
@ -86,17 +109,56 @@ class DocumentRag:
    async def query(
            self, query, user="trustgraph", collection="default",
            doc_limit=20, streaming=False, chunk_callback=None,
            explain_callback=None,
    ):
        """
        Execute a Document RAG query with optional explainability tracking.
        Args:
            query: The query string
            user: User identifier
            collection: Collection identifier
            doc_limit: Max chunks to retrieve
            streaming: Enable streaming LLM response
            chunk_callback: async def callback(chunk, end_of_stream) for streaming
            explain_callback: async def callback(triples, explain_id) for explainability
        Returns:
            str: The synthesized answer text
        """
        if self.verbose:
            logger.debug("Constructing prompt...")
        # Generate explainability URIs upfront
        session_id = str(uuid.uuid4())
        q_uri = docrag_question_uri(session_id)
        exp_uri = docrag_exploration_uri(session_id)
        syn_uri = docrag_synthesis_uri(session_id)
        timestamp = datetime.utcnow().isoformat() + "Z"
        # Emit question explainability immediately
        if explain_callback:
            q_triples = set_graph(
                docrag_question_triples(q_uri, query, timestamp),
                GRAPH_RETRIEVAL
            )
            await explain_callback(q_triples, q_uri)
        q = Query(
            rag=self, user=user, collection=collection, verbose=self.verbose,
            doc_limit=doc_limit
        )
-        docs = await q.get_docs(query)
+        docs, chunk_ids = await q.get_docs(query)
        # Emit exploration explainability after chunks retrieved
        if explain_callback:
            exp_triples = set_graph(
                docrag_exploration_triples(exp_uri, q_uri, len(chunk_ids), chunk_ids),
                GRAPH_RETRIEVAL
            )
            await explain_callback(exp_triples, exp_uri)
        if self.verbose:
            logger.debug("Invoking LLM...")
@ -104,12 +166,21 @@ class DocumentRag:
            logger.debug(f"Query: {query}")
        if streaming and chunk_callback:
            # Accumulate chunks for answer storage while forwarding to callback
            accumulated_chunks = []
            async def accumulating_callback(chunk, end_of_stream):
                accumulated_chunks.append(chunk)
                await chunk_callback(chunk, end_of_stream)
            resp = await self.prompt_client.document_prompt(
                query=query,
                documents=docs,
                streaming=True,
-                chunk_callback=chunk_callback
+                chunk_callback=accumulating_callback
            )
            # Combine all chunks into full response
            resp = "".join(accumulated_chunks)
        else:
            resp = await self.prompt_client.document_prompt(
                query=query,
@ -119,5 +190,17 @@ class DocumentRag:
        if self.verbose:
            logger.debug("Query processing complete")
        # Emit synthesis explainability after answer generated
        if explain_callback:
            answer_text = resp if resp else ""
            syn_triples = set_graph(
                docrag_synthesis_triples(syn_uri, exp_uri, answer_text),
                GRAPH_RETRIEVAL
            )
            await explain_callback(syn_triples, syn_uri)
        if self.verbose:
            logger.debug(f"Emitted explain for session {session_id}")
        return resp
--- a/trustgraph-flow/trustgraph/retrieval/document_rag/rag.py
+++ b/trustgraph-flow/trustgraph/retrieval/document_rag/rag.py
@ -11,6 +11,8 @@ import logging
 from ... schema import DocumentRagQuery, DocumentRagResponse, Error
 from ... schema import LibrarianRequest, LibrarianResponse
 from ... schema import librarian_request_queue, librarian_response_queue
 from ... schema import Triples, Metadata
 from ... provenance import GRAPH_RETRIEVAL
 from . document_rag import DocumentRag
 from ... base import FlowProcessor, ConsumerSpec, ProducerSpec
 from ... base import PromptClientSpec, EmbeddingsClientSpec
@ -78,6 +80,13 @@ class Processor(FlowProcessor):
            )
        )
        self.register_specification(
            ProducerSpec(
                name = "explainability",
                schema = Triples,
            )
        )
        # Librarian client for fetching chunk content from Garage
        librarian_request_q = params.get(
            "librarian_request_queue", default_librarian_request_queue
@ -194,6 +203,29 @@ class Processor(FlowProcessor):
            else:
                doc_limit = self.doc_limit
            # Real-time explainability callback - emits triples and IDs as they're generated
            # Triples are stored in the user's collection with a named graph (urn:graph:retrieval)
            async def send_explainability(triples, explain_id):
                # Send triples to explainability queue - stores in same collection with named graph
                await flow("explainability").send(Triples(
                    metadata=Metadata(
                        id=explain_id,
                        user=v.user,
                        collection=v.collection,  # Store in user's collection
                    ),
                    triples=triples,
                ))
                # Send explain ID and graph to response queue
                await flow("response").send(
                    DocumentRagResponse(
                        response=None,
                        explain_id=explain_id,
                        explain_graph=GRAPH_RETRIEVAL,
                    ),
                    properties={"id": id}
                )
            # Check if streaming is requested
            if v.streaming:
                # Define async callback for streaming chunks
@ -217,6 +249,7 @@ class Processor(FlowProcessor):
                    doc_limit=doc_limit,
                    streaming=True,
                    chunk_callback=send_chunk,
                    explain_callback=send_explainability,
                )
            else:
                # Non-streaming path (existing behavior)
@ -224,7 +257,8 @@ class Processor(FlowProcessor):
                    v.query,
                    user=v.user,
                    collection=v.collection,
-                    doc_limit=doc_limit
+                    doc_limit=doc_limit,
                    explain_callback=send_explainability,
                )
                await flow("response").send(