--- layout: default title: "Capability Vocabulary Technical Specification" parent: "Tech Specs" --- # Capability Vocabulary Technical Specification ## Overview Authorisation in TrustGraph is **capability-based**. Every gateway endpoint maps to exactly one *capability*; a user's roles each grant a set of capabilities; an authenticated request is permitted when the required capability is a member of the union of the caller's role capability sets. This document defines the capability vocabulary — the closed list of capability strings that the gateway recognises — and the open-source edition's role bundles. The capability mechanism is shared between open-source and potential 3rd party enterprise capability. The open-source edition ships a fixed three-role bundle (`reader`, `writer`, `admin`). Enterprise capability may define additional roles by composing their own capability bundles from the same vocabulary; no protocol, gateway, or backend-service change is required. ## Motivation The original IAM spec used hierarchical "minimum role" checks (`admin` implies `writer` implies `reader`). That shape is simple but paints the role model into a corner: any enterprise need to grant a subset of admin abilities (helpdesk that can reset passwords but not edit flows; analyst who can query but not ingest) requires a protocol-level change. A capability vocabulary decouples "what a request needs" from "what roles a user has" and makes the role table pure data. The open-source bundles can stay coarse while the enterprise role table expands without any code movement. ## Design ### Capability string format `:` or `` (for capabilities with no natural read/write split). All lowercase, kebab-case for multi-word subsystems. ### Capability list **Data plane** | Capability | Covers | |---|---| | `agent` | agent (query-only; no write counterpart) | | `graph:read` | graph-rag, graph-embeddings-query, triples-query, sparql, graph-embeddings-export, triples-export | | `graph:write` | triples-import, graph-embeddings-import | | `documents:read` | document-rag, document-embeddings-query, document-embeddings-export, entity-contexts-export, document-stream-export, library list / fetch | | `documents:write` | document-embeddings-import, entity-contexts-import, text-load, document-load, library add / replace / delete | | `rows:read` | rows-query, row-embeddings-query, nlp-query, structured-query, structured-diag | | `rows:write` | rows-import | | `llm` | text-completion, prompt (stateless invocation) | | `embeddings` | Raw text-embedding service (stateless compute; typed-data embedding stores live under their data-subject capability) | | `mcp` | mcp-tool | | `collections:read` | List / describe collections | | `collections:write` | Create / delete collections | | `knowledge:read` | List / get knowledge cores | | `knowledge:write` | Create / delete knowledge cores | **Control plane** | Capability | Covers | |---|---| | `config:read` | Read workspace config | | `config:write` | Write workspace config | | `flows:read` | List / describe flows, blueprints, flow classes | | `flows:write` | Start / stop / update flows | | `users:read` | List / get users within the workspace | | `users:write` | Create / update / disable users within the workspace | | `users:admin` | Assign / remove roles on users within the workspace | | `keys:self` | Create / revoke / list **own** API keys | | `keys:admin` | Create / revoke / list **any user's** API keys within the workspace | | `workspaces:admin` | Create / delete / disable workspaces (system-level) | | `iam:admin` | JWT signing-key rotation, IAM-level operations | | `metrics:read` | Prometheus metrics proxy | ### Open-source role bundles The open-source edition ships three roles: | Role | Capabilities | |---|---| | `reader` | `agent`, `graph:read`, `documents:read`, `rows:read`, `llm`, `embeddings`, `mcp`, `collections:read`, `knowledge:read`, `flows:read`, `config:read`, `keys:self` | | `writer` | everything in `reader` **+** `graph:write`, `documents:write`, `rows:write`, `collections:write`, `knowledge:write` | | `admin` | everything in `writer` **+** `config:write`, `flows:write`, `users:read`, `users:write`, `users:admin`, `keys:admin`, `workspaces:admin`, `iam:admin`, `metrics:read` | Open-source bundles are deliberately coarse. `workspaces:admin` and `iam:admin` live inside `admin` without a separate role; a single `admin` user holds the keys to the whole deployment. ### The `agent` capability and composition The `agent` capability is granted independently of the capabilities it composes under the hood (`llm`, `graph`, `documents`, `rows`, `mcp`, etc.). A user holding `agent` but not `llm` can still cause LLM invocations because the agent implementation chooses which services to invoke on the caller's behalf. This is deliberate. A common policy is "allow controlled access via the agent, deny raw model calls" — granting `agent` without granting `llm` expresses exactly that. An administrator granting `agent` should treat it as a grant of everything the agent composes at deployment time. ### Authorisation evaluation For a request bearing a resolved set of roles `R = {r1, r2, ...}` against an endpoint that requires capability `c`: ``` allow if c IN union(bundle(r) for r in R) ``` No hierarchy, no precedence, no role-order sensitivity. A user with a single role is the common case; a user with multiple roles gets the union of their bundles. ### Enforcement boundary Capability checks — and authentication — are applied **only at the API gateway**, on requests arriving from external callers. Operations originating inside the platform (backend service to backend service, agent to LLM, flow-svc to config-svc, bootstrap initialisers, scheduled reconcilers, autonomous flow steps) are **not capability-checked**. Backend services trust the workspace set by the gateway on inbound pub/sub messages and trust internally-originated messages without further authorisation. This policy has four consequences that are part of the spec, not accidents of implementation: 1. **The gateway is the single trust boundary for user authorisation.** Every backend service is a downstream consumer of an already-authorised workspace scope. 2. **Pub/sub carries workspace, not user identity.** Messages on the bus do not carry credentials or the identity that originated a request; they carry the resolved workspace only. This keeps the bus protocol free of secrets and aligns with the workspace resolver's role as the gateway-side narrowing step. 3. **Composition is transitive.** Granting a capability that the platform composes internally (for example, `agent`) transitively grants everything that capability composes under the hood, because the downstream calls are internal-origin and are not re-checked. The composite nature of `agent` described above is a consequence of this policy, not a special case. 4. **Internal-origin operations have no user.** Bootstrap, reconcilers, and other platform-initiated work act with system-level authority. The workspace field on such messages identifies which workspace's data is being touched, not who asked. **Trust model.** Whoever has pub/sub access is implicitly trusted to act as any workspace. Defense-in-depth within the backend is not part of this design; the security perimeter is the gateway and the bus itself (TLS / network isolation between the bus and any untrusted network). ### Unknown capabilities and unknown roles - An endpoint declaring an unknown capability is a server-side bug and fails closed (403, logged). - A user carrying a role name that is not defined in the role table is ignored for authorisation purposes and logged as a warning. Behaviour is deterministic: unknown roles contribute zero capabilities. ### Capability scope Every capability is **implicitly scoped to the caller's resolved workspace**. A `users:write` capability does not permit a user in workspace `acme` to create users in workspace `beta` — the workspace-resolver has already narrowed the request to one workspace before the capability check runs. See the IAM specification for the workspace-resolver contract. The three exceptions are the system-level capabilities `workspaces:admin` and `iam:admin`, which operate across workspaces by definition, and `metrics:read`, which returns process-level series not scoped to any workspace. ## Enterprise extensibility Enterprise editions extend the role table additively: ``` data-analyst: {query, library:read, collections:read, knowledge:read} helpdesk: {users:read, users:write, users:admin, keys:admin} data-engineer: writer + {flows:read, config:read} workspace-owner: admin − {workspaces:admin, iam:admin} ``` None of this requires a protocol change — the wire-protocol `roles` field on user records is already a set, the gateway's capability-check is already capability-based, and the capability vocabulary is closed. Enterprises may introduce roles whose bundles compose the same capabilities differently. When an enterprise introduces a new capability (e.g. for a feature that does not exist in open source), the capability string is added to the vocabulary and recognised by the gateway build that ships that feature. ## References - [Identity and Access Management Specification](iam.md) - [Architecture Principles](architecture-principles.md)