mirror of
https://github.com/Kaelio/ktx.git
synced 2026-07-01 08:59:39 +02:00
feat: add codex llm backend for ktx runtime work (#253)
* feat: add codex sdk runner foundation * feat: parse codex runtime events * feat: expose codex runtime mcp tools * feat: add codex llm runtime * feat: wire codex llm backend * test: avoid Array.fromAsync in codex runner test * docs: document codex llm backend * fix: tighten codex runtime config ownership * fix: use codex sdk env and thread options * fix: parse codex sdk event shapes * test: add codex backend live smoke * docs: clarify codex backend isolation * fix: drive codex loop metrics from mcp events * fix: enforce codex local step budget * docs: disclose codex isolation limits * fix: count all codex agent steps and stream step callbacks live The agent-loop step budget only counted completed mcp_tool_call items, so built-in command_execution steps (which the public Codex SDK/CLI surface can still expose) never decremented the budget, letting ingest/reconciliation run past stepBudget until Codex stopped on its own. onStepFinish was also replayed only after the whole stream drained, so live work_unit_step / reconciliation progress appeared stuck until the Codex process exited. collectEvents is now the single live step accumulator: it counts every completed agent-action item via a shared isCompletedAgentStep predicate (command_execution, mcp_tool_call, file_change, web_search), fires onStepFinish as each step completes, and enforces the budget on that broader count. A no-tool turn still counts as one step. toolFailures stays MCP-specific, since a non-zero command exit is normal agent exploration, not a loop failure. * test: align ingest llm-guard assertions with codex backend The skip-llm ingest guard message now lists codex as a valid backend and mentions a Claude Code/Codex session plus a codex setup hint, but this slow suite test still asserted the pre-codex wording. Update it to match the production message (already covered by the local-bundle-runtime unit test) and add the codex setup-line assertion. * fix: treat codex error:null tool calls as success The Codex SDK serializes error: null on successful mcp_tool_call items, so the failure check (item.error !== undefined) flagged every successful tool call as failed with the empty-payload default "Codex turn failed". This killed every ingest work unit under the codex backend before it could produce a patch. Key on status === 'failed' (authoritative, always set) and only treat a populated error object as a failure. Add a regression test built from a verbatim real-SDK event capture. * fix: default codex backend to gpt-5.5 and report real probe errors The previous default gpt-5.3-codex is an API-key-only model that the OpenAI API rejects under ChatGPT-account (subscription) auth, so codex status/setup failed with a misleading "authentication is not usable" message even though auth was fine. - Default codex model is now gpt-5.5 (works on both subscription and API-key auth); the curated setup picker offers gpt-5.5 / gpt-5.4 / gpt-5.4-mini and keeps free-form entry for account-specific ids (e.g. gpt-5.3-codex-spark). - runCodexAuthProbe now distinguishes "model not available" from an auth failure and surfaces the real API error: collectEvents retains stream events when the SDK throws on a non-zero exit, and the API error JSON envelope is unwrapped to its human-readable message. - The Codex isolation warning now renders inside the clack setup frame. - Docs updated to gpt-5.5 with a note that *-codex ids require API-key auth. * fix: require llm.models.default in status and match codex probe remediation Status reported a project ready when a non-none LLM backend was configured without llm.models.default, but the runtime (resolveModelSlots) hard-requires it, so ingest/scan/memory threw after `ktx status` said the project was usable. buildLlmStatus now fails for any non-none backend missing models.default and no longer invents a fallback model for claude-code/codex. Codex probe failures now carry a category-matched fix: a model-access failure steers the user at llm.models.default instead of the auth/install remediation. runCodexAuthProbe returns the fix and status consumes it; the message stays self-sufficient so setup output is unchanged. Docs: README now lists the codex backend and local Codex auth; ktx-setup.mdx states --llm-model only accepts codex/default or gpt-*/codex-* ids. Repaired four doctor fixtures that configured a backend without models.default (the now-correctly-blocked config) and added coverage for the new behavior.
This commit is contained in:
parent
74c6076b72
commit
494618ab14
41 changed files with 2544 additions and 30 deletions
460
packages/cli/test/context/llm/codex-runtime.test.ts
Normal file
460
packages/cli/test/context/llm/codex-runtime.test.ts
Normal file
|
|
@ -0,0 +1,460 @@
|
|||
import { describe, expect, it, vi } from 'vitest';
|
||||
import { z } from 'zod';
|
||||
import {
|
||||
CodexKtxLlmRuntime,
|
||||
runCodexAuthProbe,
|
||||
} from '../../../src/context/llm/codex-runtime.js';
|
||||
|
||||
async function* events(items: unknown[]) {
|
||||
for (const item of items) {
|
||||
yield item;
|
||||
}
|
||||
}
|
||||
|
||||
function runner(items: unknown[]) {
|
||||
return {
|
||||
runStreamed: vi.fn(async () => events(items)),
|
||||
};
|
||||
}
|
||||
|
||||
/** Yields the given events, then throws — mirroring the SDK throwing on a non-zero codex exec exit. */
|
||||
function throwingRunner(items: unknown[], error: Error) {
|
||||
return {
|
||||
runStreamed: vi.fn(async () =>
|
||||
(async function* () {
|
||||
for (const item of items) {
|
||||
yield item;
|
||||
}
|
||||
throw error;
|
||||
})(),
|
||||
),
|
||||
};
|
||||
}
|
||||
|
||||
const MODEL_UNSUPPORTED_API_ERROR = JSON.stringify({
|
||||
type: 'error',
|
||||
status: 400,
|
||||
error: {
|
||||
type: 'invalid_request_error',
|
||||
message: "The 'gpt-5.3-codex' model is not supported when using Codex with a ChatGPT account.",
|
||||
},
|
||||
});
|
||||
|
||||
function budgetRunner() {
|
||||
let observedSignal: AbortSignal | undefined;
|
||||
return {
|
||||
observedSignal: () => observedSignal,
|
||||
runStreamed: vi.fn(async (input: { signal?: AbortSignal }) => {
|
||||
observedSignal = input.signal;
|
||||
return events([
|
||||
{ type: 'turn.started' },
|
||||
{ type: 'item.started', item: { type: 'mcp_tool_call', server: 'ktx', tool: 'first', status: 'in_progress' } },
|
||||
{ type: 'item.completed', item: { type: 'mcp_tool_call', server: 'ktx', tool: 'first', status: 'completed' } },
|
||||
{ type: 'item.started', item: { type: 'mcp_tool_call', server: 'ktx', tool: 'second', status: 'in_progress' } },
|
||||
{ type: 'item.completed', item: { type: 'mcp_tool_call', server: 'ktx', tool: 'second', status: 'completed' } },
|
||||
{ type: 'turn.completed', usage: { input_tokens: 1, output_tokens: 1 } },
|
||||
]);
|
||||
}),
|
||||
};
|
||||
}
|
||||
|
||||
describe('CodexKtxLlmRuntime', () => {
|
||||
it('generates text with the role-selected model and metrics', async () => {
|
||||
const onMetrics = vi.fn();
|
||||
const fakeRunner = runner([
|
||||
{ type: 'turn.started' },
|
||||
{ type: 'item.completed', item: { type: 'agent_message', text: 'hello' } },
|
||||
{ type: 'turn.completed', usage: { input_tokens: 3, output_tokens: 4, total_tokens: 7 } },
|
||||
]);
|
||||
const runtime = new CodexKtxLlmRuntime({
|
||||
projectDir: '/tmp/project',
|
||||
modelSlots: { default: 'codex', triage: 'gpt-5.4' },
|
||||
runner: fakeRunner,
|
||||
});
|
||||
|
||||
await expect(runtime.generateText({ role: 'triage', system: 'system', prompt: 'prompt', onMetrics })).resolves.toBe('hello');
|
||||
expect(fakeRunner.runStreamed).toHaveBeenCalledWith(
|
||||
expect.objectContaining({
|
||||
projectDir: '/tmp/project',
|
||||
model: 'gpt-5.4',
|
||||
prompt: 'system\n\nprompt',
|
||||
}),
|
||||
);
|
||||
expect(onMetrics).toHaveBeenCalledWith(expect.objectContaining({ usage: { inputTokens: 3, outputTokens: 4, totalTokens: 7 } }));
|
||||
});
|
||||
|
||||
it('generates and validates structured output', async () => {
|
||||
const fakeRunner = runner([
|
||||
{ type: 'turn.started' },
|
||||
{ type: 'item.completed', item: { type: 'agent_message', text: '{"answer":"yes"}' } },
|
||||
{ type: 'turn.completed' },
|
||||
]);
|
||||
const runtime = new CodexKtxLlmRuntime({
|
||||
projectDir: '/tmp/project',
|
||||
modelSlots: { default: 'codex' },
|
||||
runner: fakeRunner,
|
||||
});
|
||||
|
||||
await expect(
|
||||
runtime.generateObject({
|
||||
role: 'default',
|
||||
prompt: 'json',
|
||||
schema: z.object({ answer: z.string() }),
|
||||
}),
|
||||
).resolves.toEqual({ answer: 'yes' });
|
||||
expect(fakeRunner.runStreamed).toHaveBeenCalledWith(
|
||||
expect.objectContaining({
|
||||
outputSchema: expect.objectContaining({ type: 'object' }),
|
||||
}),
|
||||
);
|
||||
});
|
||||
|
||||
it('returns a structured-output error when Codex final text is invalid JSON', async () => {
|
||||
const fakeRunner = runner([
|
||||
{ type: 'turn.started' },
|
||||
{ type: 'item.completed', item: { type: 'agent_message', text: 'not json' } },
|
||||
{ type: 'turn.completed' },
|
||||
]);
|
||||
const runtime = new CodexKtxLlmRuntime({
|
||||
projectDir: '/tmp/project',
|
||||
modelSlots: { default: 'codex' },
|
||||
runner: fakeRunner,
|
||||
});
|
||||
|
||||
await expect(
|
||||
runtime.generateObject({
|
||||
role: 'default',
|
||||
prompt: 'json',
|
||||
schema: z.object({ answer: z.string() }),
|
||||
}),
|
||||
).rejects.toThrow('Codex structured output failed validation');
|
||||
});
|
||||
|
||||
it('starts and closes a temporary MCP server for tool-backed agent loops', async () => {
|
||||
const close = vi.fn(async () => undefined);
|
||||
const startMcpServer = vi.fn(async () => ({
|
||||
url: 'http://127.0.0.1:4321/mcp',
|
||||
bearerTokenEnvVar: 'KTX_CODEX_RUNTIME_MCP_TOKEN' as const,
|
||||
bearerToken: 'token',
|
||||
close,
|
||||
}));
|
||||
const fakeRunner = runner([
|
||||
{ type: 'turn.started' },
|
||||
{ type: 'item.started', item: { type: 'mcp_tool_call', name: 'wiki_search' } },
|
||||
{ type: 'item.completed', item: { type: 'agent_message', text: 'done' } },
|
||||
{ type: 'turn.completed', usage: { input_tokens: 1, output_tokens: 1, total_tokens: 2 } },
|
||||
]);
|
||||
const runtime = new CodexKtxLlmRuntime({
|
||||
projectDir: '/tmp/project',
|
||||
modelSlots: { default: 'codex' },
|
||||
runner: fakeRunner,
|
||||
startMcpServer,
|
||||
});
|
||||
const onStepFinish = vi.fn();
|
||||
|
||||
const result = await runtime.runAgentLoop({
|
||||
modelRole: 'default',
|
||||
systemPrompt: 'system',
|
||||
userPrompt: 'user',
|
||||
stepBudget: 5,
|
||||
telemetryTags: {},
|
||||
onStepFinish,
|
||||
toolSet: {
|
||||
aliased_wiki_tool: {
|
||||
name: 'wiki_search',
|
||||
description: 'Search wiki',
|
||||
inputSchema: z.object({ query: z.string() }),
|
||||
execute: vi.fn(),
|
||||
},
|
||||
},
|
||||
});
|
||||
|
||||
expect(result.stopReason).toBe('natural');
|
||||
expect(result.metrics).toMatchObject({ stepCount: 1, usage: { inputTokens: 1, outputTokens: 1, totalTokens: 2 } });
|
||||
expect(onStepFinish).toHaveBeenCalledWith({ stepIndex: 1, stepBudget: 5 });
|
||||
expect(startMcpServer).toHaveBeenCalledWith({ projectDir: '/tmp/project', toolSet: expect.any(Object) });
|
||||
expect(fakeRunner.runStreamed).toHaveBeenCalledWith(
|
||||
expect.objectContaining({
|
||||
env: { KTX_CODEX_RUNTIME_MCP_TOKEN: 'token' },
|
||||
configOverrides: expect.objectContaining({
|
||||
mcp_servers: expect.objectContaining({
|
||||
ktx: expect.objectContaining({
|
||||
url: 'http://127.0.0.1:4321/mcp',
|
||||
enabled_tools: ['wiki_search'],
|
||||
required: true,
|
||||
}),
|
||||
}),
|
||||
}),
|
||||
}),
|
||||
);
|
||||
expect(close).toHaveBeenCalled();
|
||||
});
|
||||
|
||||
it('returns error stop reason on turn failure', async () => {
|
||||
const runtime = new CodexKtxLlmRuntime({
|
||||
projectDir: '/tmp/project',
|
||||
modelSlots: { default: 'codex' },
|
||||
runner: runner([{ type: 'turn.failed', error: { message: 'boom' } }]),
|
||||
});
|
||||
|
||||
const result = await runtime.runAgentLoop({
|
||||
modelRole: 'default',
|
||||
systemPrompt: 'system',
|
||||
userPrompt: 'user',
|
||||
stepBudget: 5,
|
||||
telemetryTags: {},
|
||||
toolSet: {},
|
||||
});
|
||||
|
||||
expect(result.stopReason).toBe('error');
|
||||
expect(result.error?.message).toBe('boom');
|
||||
});
|
||||
|
||||
it('surfaces failed MCP tool calls as agent-loop errors', async () => {
|
||||
const runtime = new CodexKtxLlmRuntime({
|
||||
projectDir: '/tmp/project',
|
||||
modelSlots: { default: 'codex' },
|
||||
runner: runner([
|
||||
{ type: 'turn.started' },
|
||||
{ type: 'item.started', item: { type: 'mcp_tool_call', server: 'ktx', tool: 'search', status: 'in_progress' } },
|
||||
{
|
||||
type: 'item.completed',
|
||||
item: {
|
||||
type: 'mcp_tool_call',
|
||||
server: 'ktx',
|
||||
tool: 'search',
|
||||
status: 'failed',
|
||||
error: { message: 'denied' },
|
||||
},
|
||||
},
|
||||
{ type: 'turn.completed', usage: { input_tokens: 1, output_tokens: 1 } },
|
||||
]),
|
||||
});
|
||||
|
||||
const result = await runtime.runAgentLoop({
|
||||
modelRole: 'default',
|
||||
systemPrompt: 'system',
|
||||
userPrompt: 'user',
|
||||
stepBudget: 5,
|
||||
telemetryTags: {},
|
||||
toolSet: {},
|
||||
});
|
||||
|
||||
expect(result.stopReason).toBe('error');
|
||||
expect(result.error?.message).toBe('Codex runtime tool call failed: search: denied');
|
||||
expect(result.metrics).toMatchObject({
|
||||
stepCount: 1,
|
||||
usage: { inputTokens: 1, outputTokens: 1, totalTokens: 2 },
|
||||
});
|
||||
});
|
||||
|
||||
it('returns budget and aborts the Codex stream when local MCP step budget is reached', async () => {
|
||||
const fakeRunner = budgetRunner();
|
||||
const runtime = new CodexKtxLlmRuntime({
|
||||
projectDir: '/tmp/project',
|
||||
modelSlots: { default: 'codex' },
|
||||
runner: fakeRunner,
|
||||
});
|
||||
const onStepFinish = vi.fn();
|
||||
|
||||
const result = await runtime.runAgentLoop({
|
||||
modelRole: 'default',
|
||||
systemPrompt: 'system',
|
||||
userPrompt: 'user',
|
||||
stepBudget: 1,
|
||||
telemetryTags: {},
|
||||
onStepFinish,
|
||||
toolSet: {
|
||||
first: {
|
||||
name: 'first',
|
||||
description: 'First tool',
|
||||
inputSchema: z.object({}),
|
||||
execute: vi.fn(),
|
||||
},
|
||||
},
|
||||
});
|
||||
|
||||
expect(result.stopReason).toBe('budget');
|
||||
expect(result.error).toBeUndefined();
|
||||
expect(result.metrics).toMatchObject({ stepCount: 1 });
|
||||
expect(onStepFinish).toHaveBeenCalledTimes(1);
|
||||
expect(onStepFinish).toHaveBeenCalledWith({ stepIndex: 1, stepBudget: 1 });
|
||||
expect(fakeRunner.observedSignal()?.aborted).toBe(true);
|
||||
});
|
||||
|
||||
it('counts built-in command_execution steps against the budget and aborts the stream', async () => {
|
||||
let observedSignal: AbortSignal | undefined;
|
||||
const fakeRunner = {
|
||||
observedSignal: () => observedSignal,
|
||||
runStreamed: vi.fn(async (input: { signal?: AbortSignal }) => {
|
||||
observedSignal = input.signal;
|
||||
return events([
|
||||
{ type: 'turn.started' },
|
||||
{ type: 'item.started', item: { type: 'command_execution', command: 'ls', status: 'in_progress' } },
|
||||
{ type: 'item.completed', item: { type: 'command_execution', command: 'ls', status: 'completed', exit_code: 0 } },
|
||||
{ type: 'item.started', item: { type: 'command_execution', command: 'cat a', status: 'in_progress' } },
|
||||
{ type: 'item.completed', item: { type: 'command_execution', command: 'cat a', status: 'completed', exit_code: 0 } },
|
||||
{ type: 'item.completed', item: { type: 'command_execution', command: 'cat b', status: 'completed', exit_code: 0 } },
|
||||
{ type: 'turn.completed', usage: { input_tokens: 1, output_tokens: 1 } },
|
||||
]);
|
||||
}),
|
||||
};
|
||||
const runtime = new CodexKtxLlmRuntime({
|
||||
projectDir: '/tmp/project',
|
||||
modelSlots: { default: 'codex' },
|
||||
runner: fakeRunner,
|
||||
});
|
||||
const onStepFinish = vi.fn();
|
||||
|
||||
const result = await runtime.runAgentLoop({
|
||||
modelRole: 'default',
|
||||
systemPrompt: 'system',
|
||||
userPrompt: 'user',
|
||||
stepBudget: 2,
|
||||
telemetryTags: {},
|
||||
onStepFinish,
|
||||
toolSet: {},
|
||||
});
|
||||
|
||||
expect(result.stopReason).toBe('budget');
|
||||
expect(result.error).toBeUndefined();
|
||||
expect(result.metrics).toMatchObject({ stepCount: 2 });
|
||||
expect(onStepFinish).toHaveBeenCalledTimes(2);
|
||||
expect(onStepFinish).toHaveBeenLastCalledWith({ stepIndex: 2, stepBudget: 2 });
|
||||
expect(fakeRunner.observedSignal()?.aborted).toBe(true);
|
||||
});
|
||||
|
||||
it('fires onStepFinish live as each step completes, before the stream drains', async () => {
|
||||
const order: string[] = [];
|
||||
async function* liveEvents() {
|
||||
yield { type: 'turn.started' };
|
||||
yield { type: 'item.completed', item: { type: 'mcp_tool_call', server: 'ktx', tool: 'a', status: 'completed' } };
|
||||
order.push('yielded-after-step-1');
|
||||
yield { type: 'item.completed', item: { type: 'mcp_tool_call', server: 'ktx', tool: 'b', status: 'completed' } };
|
||||
order.push('yielded-after-step-2');
|
||||
yield { type: 'item.completed', item: { type: 'agent_message', text: 'done' } };
|
||||
yield { type: 'turn.completed', usage: { input_tokens: 1, output_tokens: 1 } };
|
||||
}
|
||||
const fakeRunner = { runStreamed: vi.fn(async () => liveEvents()) };
|
||||
const runtime = new CodexKtxLlmRuntime({
|
||||
projectDir: '/tmp/project',
|
||||
modelSlots: { default: 'codex' },
|
||||
runner: fakeRunner,
|
||||
});
|
||||
|
||||
const result = await runtime.runAgentLoop({
|
||||
modelRole: 'default',
|
||||
systemPrompt: 'system',
|
||||
userPrompt: 'user',
|
||||
stepBudget: 10,
|
||||
telemetryTags: {},
|
||||
onStepFinish: ({ stepIndex }) => {
|
||||
order.push(`step-${stepIndex}`);
|
||||
},
|
||||
toolSet: {},
|
||||
});
|
||||
|
||||
expect(result.stopReason).toBe('natural');
|
||||
expect(result.metrics).toMatchObject({ stepCount: 2 });
|
||||
expect(order).toEqual(['step-1', 'yielded-after-step-1', 'step-2', 'yielded-after-step-2']);
|
||||
});
|
||||
|
||||
it('surfaces the real Codex error event even when the SDK stream throws afterward', async () => {
|
||||
// The SDK yields the error/turn.failed events on stdout, then throws on the
|
||||
// non-zero exit. The masked exit message must not hide the real API error.
|
||||
const fakeRunner = throwingRunner(
|
||||
[
|
||||
{ type: 'thread.started', thread_id: 't' },
|
||||
{ type: 'turn.started' },
|
||||
{ type: 'error', message: MODEL_UNSUPPORTED_API_ERROR },
|
||||
{ type: 'turn.failed', error: { message: MODEL_UNSUPPORTED_API_ERROR } },
|
||||
],
|
||||
new Error('Codex Exec exited with code 1: Reading prompt from stdin...'),
|
||||
);
|
||||
const runtime = new CodexKtxLlmRuntime({
|
||||
projectDir: '/tmp/project',
|
||||
modelSlots: { default: 'codex' },
|
||||
runner: fakeRunner,
|
||||
});
|
||||
|
||||
await expect(runtime.generateText({ role: 'default', prompt: 'hi' })).rejects.toThrow(
|
||||
'not supported when using Codex with a ChatGPT account',
|
||||
);
|
||||
});
|
||||
|
||||
it('probes Codex authentication through a minimal non-interactive turn', async () => {
|
||||
const fakeRunner = runner([
|
||||
{ type: 'turn.started' },
|
||||
{ type: 'item.completed', item: { type: 'agent_message', text: 'ok' } },
|
||||
{ type: 'turn.completed' },
|
||||
]);
|
||||
|
||||
await expect(
|
||||
runCodexAuthProbe({
|
||||
projectDir: '/tmp/project',
|
||||
model: 'codex',
|
||||
runner: fakeRunner,
|
||||
}),
|
||||
).resolves.toEqual({ ok: true });
|
||||
});
|
||||
|
||||
it('reports an unavailable model without blaming auth when Codex rejects the model', async () => {
|
||||
const fakeRunner = throwingRunner(
|
||||
[
|
||||
{ type: 'turn.started' },
|
||||
{ type: 'turn.failed', error: { message: MODEL_UNSUPPORTED_API_ERROR } },
|
||||
],
|
||||
new Error('Codex Exec exited with code 1: Reading prompt from stdin...'),
|
||||
);
|
||||
|
||||
const result = await runCodexAuthProbe({
|
||||
projectDir: '/tmp/project',
|
||||
model: 'gpt-5.3-codex',
|
||||
runner: fakeRunner,
|
||||
});
|
||||
|
||||
expect(result.ok).toBe(false);
|
||||
if (!result.ok) {
|
||||
expect(result.message).not.toContain('authentication is not usable');
|
||||
expect(result.message).toContain('not available');
|
||||
expect(result.message).toContain('gpt-5.3-codex');
|
||||
expect(result.message).toContain('not supported when using Codex with a ChatGPT account');
|
||||
// A model-access failure must steer the user at the model config, not auth.
|
||||
expect(result.fix).toContain('llm.models.default');
|
||||
expect(result.fix).not.toContain('Authenticate Codex');
|
||||
}
|
||||
});
|
||||
|
||||
it('reports an auth failure when Codex exits without an error event', async () => {
|
||||
const fakeRunner = throwingRunner(
|
||||
[],
|
||||
new Error('Codex Exec exited with code 1: Not logged in. Run `codex login`.'),
|
||||
);
|
||||
|
||||
const result = await runCodexAuthProbe({
|
||||
projectDir: '/tmp/project',
|
||||
model: 'gpt-5.5',
|
||||
runner: fakeRunner,
|
||||
});
|
||||
|
||||
expect(result.ok).toBe(false);
|
||||
if (!result.ok) {
|
||||
expect(result.message).toContain('authentication is not usable');
|
||||
expect(result.message).toContain('Not logged in');
|
||||
expect(result.fix).toContain('Authenticate Codex');
|
||||
}
|
||||
});
|
||||
|
||||
it('rejects an unsupported model id before probing, steering at llm.models.default', async () => {
|
||||
const result = await runCodexAuthProbe({
|
||||
projectDir: '/tmp/project',
|
||||
model: 'not-a-real-model',
|
||||
});
|
||||
|
||||
expect(result.ok).toBe(false);
|
||||
if (!result.ok) {
|
||||
expect(result.message).toContain('Unsupported Codex model');
|
||||
expect(result.fix).toContain('llm.models.default');
|
||||
}
|
||||
});
|
||||
});
|
||||
Loading…
Add table
Add a link
Reference in a new issue