trustgraph/templates/components/ollama.jsonnet

local base = import "base/base.jsonnet";
local images = import "values/images.jsonnet";
local url = import "values/url.jsonnet";
local prompts = import "prompts/slm.jsonnet";

{

    "ollama-model":: "gemma2:9b",
    "ollama-url":: "${OLLAMA_HOST}",

    "text-completion" +: {
    
        create:: function(engine)

            local container =
                engine.container("text-completion")
                    .with_image(images.trustgraph)
                    .with_command([
                        "text-completion-ollama",
                        "-p",
                        url.pulsar,
                        "-m",
                        $["ollama-model"],
                        "-r",
                        $["ollama-url"],
                    ])
                    .with_limits("0.5", "128M")
                    .with_reservations("0.1", "128M");

            local containerSet = engine.containers(
                "text-completion", [ container ]
            );

            local service =
                engine.internalService(containerSet)
                .with_port(8080, 8080, "metrics");

            engine.resources([
                containerSet,
                service,
            ])

    },

    "text-completion-rag" +: {
    
        create:: function(engine)

            local container =
                engine.container("text-completion-rag")
                    .with_image(images.trustgraph)
                    .with_command([
                        "text-completion-ollama",
                        "-p",
                        url.pulsar,
                        "-m",
                        $["ollama-model"],
                        "-r",
                        $["ollama-url"],
                        "-i",
                        "non-persistent://tg/request/text-completion-rag",
                        "-o",
                        "non-persistent://tg/response/text-completion-rag-response",
                    ])
                    .with_limits("0.5", "128M")
                    .with_reservations("0.1", "128M");

            local containerSet = engine.containers(
                "text-completion-rag", [ container ]
            );

            local service =
                engine.internalService(containerSet)
                .with_port(8080, 8080, "metrics");

            engine.resources([
                containerSet,
                service,
            ])


    }

} + prompts
Make templating work more flexibly (#44) * Restructure directory * Config loading * Variable override points in JSONNET templates, separate pulsar-manager template * Bump version * Tidy chunking * Simplified prompt overrides * Update config loader * Fix recursive chunker template 2024-08-30 17:47:35 +01:00			`local base = import "base/base.jsonnet";`
			`local images = import "values/images.jsonnet";`
			`local url = import "values/url.jsonnet";`
			`local prompts = import "prompts/slm.jsonnet";`
Refactor templates (#52) * Switching from docker compose to abstract form - should be easier to k8s later * Text loader util * Recreate templates 2024-09-05 16:40:47 +01:00
Simplify templates (#10) - Add component template files for all LLM types - Top-level templates simplified to use just components - Version to 0.6.2 2024-08-14 20:56:57 +01:00			`{`
Template rejig (#48) * document-rag / graph-rag refactor of templates * Tweaking the docs and categories * Clarify triple store vs RAG * Tweak knowledge graph linkage * Doc embedding for Qdrant * Fix document RAG on Qdrant * Fix templates * Bump version * Updated templates 2024-09-03 00:09:15 +01:00
			`"ollama-model":: "gemma2:9b",`
			`"ollama-url":: "${OLLAMA_HOST}",`

Refactor templates (#52) * Switching from docker compose to abstract form - should be easier to k8s later * Text loader util * Recreate templates 2024-09-05 16:40:47 +01:00			`"text-completion" +: {`

			`create:: function(engine)`

			`local container =`
			`engine.container("text-completion")`
			`.with_image(images.trustgraph)`
			`.with_command([`
			`"text-completion-ollama",`
			`"-p",`
			`url.pulsar,`
			`"-m",`
			`$["ollama-model"],`
			`"-r",`
			`$["ollama-url"],`
			`])`
			`.with_limits("0.5", "128M")`
			`.with_reservations("0.1", "128M");`

			`local containerSet = engine.containers(`
			`"text-completion", [ container ]`
			`);`
Simplify templates (#10) - Add component template files for all LLM types - Top-level templates simplified to use just components - Version to 0.6.2 2024-08-14 20:56:57 +01:00
K8s (#58) Added templates which produce K8s resources. With the provided GCP wrapper, it works on GCP K8s cluster. This isn't stable enough for other folks to use so will need more piloting before it can be documented and released. 2024-09-07 18:59:38 +01:00			`local service =`
			`engine.internalService(containerSet)`
			`.with_port(8080, 8080, "metrics");`

Refactor templates (#52) * Switching from docker compose to abstract form - should be easier to k8s later * Text loader util * Recreate templates 2024-09-05 16:40:47 +01:00			`engine.resources([`
			`containerSet,`
K8s (#58) Added templates which produce K8s resources. With the provided GCP wrapper, it works on GCP K8s cluster. This isn't stable enough for other folks to use so will need more piloting before it can be documented and released. 2024-09-07 18:59:38 +01:00			`service,`
Refactor templates (#52) * Switching from docker compose to abstract form - should be easier to k8s later * Text loader util * Recreate templates 2024-09-05 16:40:47 +01:00			`])`
Simplify templates (#10) - Add component template files for all LLM types - Top-level templates simplified to use just components - Version to 0.6.2 2024-08-14 20:56:57 +01:00
			`},`
Refactor templates (#52) * Switching from docker compose to abstract form - should be easier to k8s later * Text loader util * Recreate templates 2024-09-05 16:40:47 +01:00
			`"text-completion-rag" +: {`

			`create:: function(engine)`

			`local container =`
			`engine.container("text-completion-rag")`
			`.with_image(images.trustgraph)`
			`.with_command([`
			`"text-completion-ollama",`
			`"-p",`
			`url.pulsar,`
			`"-m",`
			`$["ollama-model"],`
			`"-r",`
			`$["ollama-url"],`
			`"-i",`
			`"non-persistent://tg/request/text-completion-rag",`
			`"-o",`
			`"non-persistent://tg/response/text-completion-rag-response",`
			`])`
			`.with_limits("0.5", "128M")`
			`.with_reservations("0.1", "128M");`

			`local containerSet = engine.containers(`
			`"text-completion-rag", [ container ]`
			`);`

K8s (#58) Added templates which produce K8s resources. With the provided GCP wrapper, it works on GCP K8s cluster. This isn't stable enough for other folks to use so will need more piloting before it can be documented and released. 2024-09-07 18:59:38 +01:00			`local service =`
			`engine.internalService(containerSet)`
			`.with_port(8080, 8080, "metrics");`

Refactor templates (#52) * Switching from docker compose to abstract form - should be easier to k8s later * Text loader util * Recreate templates 2024-09-05 16:40:47 +01:00			`engine.resources([`
			`containerSet,`
K8s (#58) Added templates which produce K8s resources. With the provided GCP wrapper, it works on GCP K8s cluster. This isn't stable enough for other folks to use so will need more piloting before it can be documented and released. 2024-09-07 18:59:38 +01:00			`service,`
Refactor templates (#52) * Switching from docker compose to abstract form - should be easier to k8s later * Text loader util * Recreate templates 2024-09-05 16:40:47 +01:00			`])`


			`}`

Prompt templates (#33) * Added prompt-template, allows definiton, relationships and kg query to be specified in config / command-line. * Bump version & add prompt-templates to YAMLs * Apply to graph rag flow * Break out different templates 2024-08-23 23:34:16 +01:00			`} + prompts`
Refactor templates (#52) * Switching from docker compose to abstract form - should be easier to k8s later * Text loader util * Recreate templates 2024-09-05 16:40:47 +01:00