mirror of
https://github.com/trustgraph-ai/trustgraph.git
synced 2026-04-25 08:26:21 +02:00
The metadata field (list of triples) in the pipeline Metadata class was redundant. Document metadata triples already flow directly from librarian to triple-store via emit_document_provenance() - they don't need to pass through the extraction pipeline. Additionally, chunker and PDF decoder were overwriting metadata to [] anyway, so any metadata passed through the pipeline was being discarded. Changes: - Remove metadata field from Metadata dataclass (schema/core/metadata.py) - Update all Metadata instantiations to remove metadata=[] parameter - Remove metadata handling from translators (document_loading, knowledge) - Remove metadata consumption from extractors (ontology, agent) - Update gateway serializers and import handlers - Update all unit, integration, and contract tests |
||
|---|---|---|
| .. | ||
| conftest.py | ||
| test_cassandra_config_integration.py | ||
| test_cassandra_storage_logic.py | ||
| test_doc_embeddings_milvus_storage.py | ||
| test_doc_embeddings_pinecone_storage.py | ||
| test_doc_embeddings_qdrant_storage.py | ||
| test_graph_embeddings_milvus_storage.py | ||
| test_graph_embeddings_pinecone_storage.py | ||
| test_graph_embeddings_qdrant_storage.py | ||
| test_memgraph_user_collection_isolation.py | ||
| test_neo4j_user_collection_isolation.py | ||
| test_row_embeddings_qdrant_storage.py | ||
| test_rows_cassandra_storage.py | ||
| test_triples_cassandra_storage.py | ||
| test_triples_falkordb_storage.py | ||
| test_triples_memgraph_storage.py | ||
| test_triples_neo4j_storage.py | ||