mirror of
https://github.com/trustgraph-ai/trustgraph.git
synced 2026-04-25 00:16:23 +02:00
Add unified explainability support and librarian storage for all retrieval engines Implements consistent explainability/provenance tracking across GraphRAG, DocumentRAG, and Agent retrieval engines. All large content (answers, thoughts, observations) is now stored in librarian rather than as inline literals in the knowledge graph. Explainability API: - New explainability.py module with entity classes (Question, Exploration, Focus, Synthesis, Analysis, Conclusion) and ExplainabilityClient - Quiescence-based eventual consistency handling for trace fetching - Content fetching from librarian with retry logic CLI updates: - tg-invoke-graph-rag -x/--explainable flag returns explain_id - tg-invoke-document-rag -x/--explainable flag returns explain_id - tg-invoke-agent -x/--explainable flag returns explain_id - tg-list-explain-traces uses new explainability API - tg-show-explain-trace handles all three trace types Agent provenance: - Records session, iterations (think/act/observe), and conclusion - Stores thoughts and observations in librarian with document references - New predicates: tg:thoughtDocument, tg:observationDocument DocumentRAG provenance: - Records question, exploration (chunk retrieval), and synthesis - Stores answers in librarian with document references Schema changes: - AgentResponse: added explain_id, explain_graph fields - RetrievalResponse: added explain_id, explain_graph fields - agent_iteration_triples: supports thought_document_id, observation_document_id Update tests. |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| conftest.py | ||
| README.md | ||
| test_document_embeddings_contract.py | ||
| test_message_contracts.py | ||
| test_rows_cassandra_contracts.py | ||
| test_rows_graphql_query_contracts.py | ||
| test_structured_data_contracts.py | ||
| test_translator_completion_flags.py | ||
Contract Tests for TrustGraph
This directory contains contract tests that verify service interface contracts, message schemas, and API compatibility across the TrustGraph microservices architecture.
Overview
Contract tests ensure that:
- Message schemas remain compatible across service versions
- API interfaces stay stable for consumers
- Service communication contracts are maintained
- Schema evolution doesn't break existing integrations
Test Categories
1. Pulsar Message Schema Contracts (test_message_contracts.py)
Tests the contracts for all Pulsar message schemas used in TrustGraph service communication.
Coverage:
- ✅ Text Completion Messages:
TextCompletionRequest↔TextCompletionResponse - ✅ Document RAG Messages:
DocumentRagQuery↔DocumentRagResponse - ✅ Agent Messages:
AgentRequest↔AgentResponse↔AgentStep - ✅ Graph Messages:
Chunk→Triple→Triples→EntityContext - ✅ Common Messages:
Metadata,Value,Errorschemas - ✅ Message Routing: Properties, correlation IDs, routing keys
- ✅ Schema Evolution: Backward/forward compatibility testing
- ✅ Serialization: Schema validation and data integrity
Key Features:
- Schema Validation: Ensures all message schemas accept valid data and reject invalid data
- Field Contracts: Validates required vs optional fields and type constraints
- Nested Schema Support: Tests complex schemas with embedded objects and arrays
- Routing Contracts: Validates message properties and routing conventions
- Evolution Testing: Backward compatibility and schema versioning support
Running Contract Tests
Run All Contract Tests
pytest tests/contract/ -m contract
Run Specific Contract Test Categories
# Message schema contracts
pytest tests/contract/test_message_contracts.py -v
# Specific test class
pytest tests/contract/test_message_contracts.py::TestTextCompletionMessageContracts -v
# Schema evolution tests
pytest tests/contract/test_message_contracts.py::TestSchemaEvolutionContracts -v
Run with Coverage
pytest tests/contract/ -m contract --cov=trustgraph.schema --cov-report=html
Contract Test Patterns
1. Schema Validation Pattern
@pytest.mark.contract
def test_schema_contract(self, sample_message_data):
"""Test that schema accepts valid data and rejects invalid data"""
# Arrange
valid_data = sample_message_data["SchemaName"]
# Act & Assert
assert validate_schema_contract(SchemaClass, valid_data)
# Test field constraints
instance = SchemaClass(**valid_data)
assert hasattr(instance, 'required_field')
assert isinstance(instance.required_field, expected_type)
2. Serialization Contract Pattern
@pytest.mark.contract
def test_serialization_contract(self, sample_message_data):
"""Test schema serialization/deserialization contracts"""
# Arrange
data = sample_message_data["SchemaName"]
# Act & Assert
assert serialize_deserialize_test(SchemaClass, data)
3. Evolution Contract Pattern
@pytest.mark.contract
def test_backward_compatibility_contract(self, schema_evolution_data):
"""Test that new schema versions accept old data formats"""
# Arrange
old_version_data = schema_evolution_data["SchemaName_v1"]
# Act - Should work with current schema
instance = CurrentSchema(**old_version_data)
# Assert - Required fields maintained
assert instance.required_field == expected_value
Schema Registry
The contract tests maintain a registry of all TrustGraph schemas:
schema_registry = {
# Text Completion
"TextCompletionRequest": TextCompletionRequest,
"TextCompletionResponse": TextCompletionResponse,
# Document RAG
"DocumentRagQuery": DocumentRagQuery,
"DocumentRagResponse": DocumentRagResponse,
# Agent
"AgentRequest": AgentRequest,
"AgentResponse": AgentResponse,
# Graph/Knowledge
"Chunk": Chunk,
"Triple": Triple,
"Triples": Triples,
"Value": Value,
# Common
"Metadata": Metadata,
"Error": Error,
}
Message Contract Specifications
Text Completion Service Contract
TextCompletionRequest:
required_fields: [system, prompt]
field_types:
system: string
prompt: string
TextCompletionResponse:
required_fields: [error, response, model]
field_types:
error: Error | null
response: string | null
in_token: integer | null
out_token: integer | null
model: string
Document RAG Service Contract
DocumentRagQuery:
required_fields: [query, user, collection]
field_types:
query: string
user: string
collection: string
doc_limit: integer
DocumentRagResponse:
required_fields: [error, response]
field_types:
error: Error | null
response: string | null
Agent Service Contract
AgentRequest:
required_fields: [question, history]
field_types:
question: string
plan: string
state: string
history: Array<AgentStep>
AgentResponse:
required_fields: [error]
field_types:
answer: string | null
error: Error | null
thought: string | null
observation: string | null
Best Practices
Contract Test Design
- Test Both Valid and Invalid Data: Ensure schemas accept valid data and reject invalid data
- Verify Field Constraints: Test type constraints, required vs optional fields
- Test Nested Schemas: Validate complex objects with embedded schemas
- Test Array Fields: Ensure array serialization maintains order and content
- Test Optional Fields: Verify optional field handling in serialization
Schema Evolution
- Backward Compatibility: New schema versions must accept old message formats
- Required Field Stability: Required fields should never become optional or be removed
- Additive Changes: New fields should be optional to maintain compatibility
- Deprecation Strategy: Plan deprecation path for schema changes
Error Handling
- Error Schema Consistency: All error responses use consistent Error schema
- Error Type Contracts: Error types follow naming conventions
- Error Message Format: Error messages provide actionable information
Adding New Contract Tests
When adding new message schemas or modifying existing ones:
- Add to Schema Registry: Update
conftest.pyschema registry - Add Sample Data: Create valid sample data in
conftest.py - Create Contract Tests: Follow existing patterns for validation
- Test Evolution: Add backward compatibility tests
- Update Documentation: Document schema contracts in this README
Integration with CI/CD
Contract tests should be run:
- On every commit to detect breaking changes early
- Before releases to ensure API stability
- On schema changes to validate compatibility
- In dependency updates to catch breaking changes
# CI/CD pipeline command
pytest tests/contract/ -m contract --junitxml=contract-test-results.xml
Contract Test Results
Contract tests provide:
- ✅ Schema Compatibility Reports: Which schemas pass/fail validation
- ✅ Breaking Change Detection: Identifies contract violations
- ✅ Evolution Validation: Confirms backward compatibility
- ✅ Field Constraint Verification: Validates data type contracts
This ensures that TrustGraph services can evolve independently while maintaining stable, compatible interfaces for all service communication.