trustgraph/trustgraph-base/trustgraph/schema
cybermaggedon 1a7b654bd3
Add semantic pre-filter for GraphRAG edge scoring (#702)
Embed edge descriptions and compute cosine similarity against grounding
concepts to reduce the number of edges sent to expensive LLM scoring.
Controlled by edge_score_limit parameter (default 30), skipped when edge
count is already below the limit.

Also plumbs edge_score_limit and edge_limit parameters end-to-end:
- CLI args (--edge-score-limit, --edge-limit) in both invoke and service
- Socket client: fix parameter mapping to use hyphenated wire-format keys
- Flow API, message translator, gateway all pass through correctly
- Explainable code path (_question_explainable_api) now forwards all params
- Default edge_score_limit changed from 50 to 30 based on typical subgraph
  sizes
2026-03-21 20:06:29 +00:00
..
core The id field in pipeline Metadata was being overwritten at each processing (#686) 2026-03-11 12:16:39 +00:00
knowledge Embeddings API scores (#671) 2026-03-09 10:53:44 +00:00
services Add semantic pre-filter for GraphRAG edge scoring (#702) 2026-03-21 20:06:29 +00:00
__init__.py Schema structure refactor (#451) 2025-08-04 21:42:57 +01:00
README.flows Schema structure refactor (#451) 2025-08-04 21:42:57 +01:00

                                  pdf-
                                decoder

                                   |
                                   v

                                chunker

                                   |
                ,------------------+----------- . . .
                |                  |
                v                  v

             extract-           extract-
           relationships      definitions

                |                |   |
                +----------------'   |
                |                    v
                v
                                 vectorize
             triple-
              store                  |
                                     v

                                  ge-write

Refactor:

[] Change vectorize
[] Re-route chunker to extract-*
[] Re-route vectorize to ge-write*
[] Re-route extract-definitions to ge-write*
[] Remove extract-relationships to ge-write routing