mirror of
https://github.com/trustgraph-ai/trustgraph.git
synced 2026-07-02 02:58:10 +02:00
git-subtree-dir: ai-context/trustgraph-templates git-subtree-split: 42a5fd1b678f32be378062e30451e2052ccb95dd
548 B
548 B
vLLM is a high-throughput, memory-efficient inference and serving engine for LLMs. Using PagedAttention and continuous batching, vLLM enables fully secure AI TrustGraph pipelines that aren't relying on any external APIs. No data is leaving the host environment or network.
The vLLM service must be running with the required model loaded using vllm serve. The vLLM service URL must be provided in an environment variable.
VLLM_BASE_URL=http://vllm-host:8000/v1
Replace the URL with the URL of your vLLM service, noting the v1 suffix.