docs(automation): narrow v1 data model + Phase 1 scope

§9 (Data model): drop from six tables to three. v1 ships automations, automation_triggers, automation_runs only. domain_events deferred to Phase 3 (event trigger); mcp_connections/mcp_tools deferred to Phase 4 (MCP integration). Remove the table definitions for the deferred ones and replace with a deferred-tables note pointing to the consuming phase. automation_triggers.type enum narrowed to schedule|manual for v1. Webhook and event types ship with their respective phases. secret_hash column deferred to Phase 2 alongside the webhook trigger. automation_runs.cost_usd column deferred until at least one v1 capability records token-level cost — additive when reintroduced. §14 (Phase 1) reorganized into four explicit steps matching the work we're about to do: scaffolding + schemas + empty registries (step 1), then registry population (step 2), then executor (step 3), then NL authoring + UI (step 4). The current commit batch lands step 1 only.
2026-05-29 19:35:20 +02:00 · 2026-05-26 22:37:05 +02:00 · 2026-05-26 22:37:05 +02:00 · db8c472664
commit db8c472664
parent 144d702c35
1 changed files with 59 additions and 67 deletions
--- a/automation-design-plan.md
+++ b/automation-design-plan.md
@ -801,12 +801,20 @@ mechanical, additive, and free of design rewrite.
 ## 9. Data model
-Six tables. All scoped by `search_space_id` for RBAC.
+**v1 ships three tables:** `automations`, `automation_triggers`,
 `automation_runs`. All scoped by `search_space_id` for RBAC.
-The first four (`automations`, `automation_triggers`, `automation_runs`,
+The other three tables described in earlier drafts are deferred:
-`domain_events`) are the engine's own state. The last two
+
-(`mcp_connections`, `mcp_tools`) hold the durable knowledge that backs
+- `domain_events` → **deferred to Phase 3** (introduced with the event
-MCP-derived capabilities — see §3 for the lifecycle rationale.
+  trigger).
 - `mcp_connections`, `mcp_tools` → **deferred to Phase 4** (MCP
  integration).
 The deferred tables ship as-is when their consuming feature lands;
 nothing in the v1 schema needs to change to accommodate them. The three
 v1 tables form the engine's persistent state — definitions, triggers,
 and an immutable run history.
 ### `automations`
@ -828,12 +836,14 @@ MCP-derived capabilities — see §3 for the lifecycle rationale.
 | --------------- | ----------------------------------------------------------------------------- | ------------------------------------------- |
 | `id`            | int PK                                                                        |                                             |
 | `automation_id` | FK                                                                            |                                             |
-| `type`          | enum: `schedule`, `webhook`, `event`                                          |                                             |
+| `type`          | enum: `schedule`, `manual` (Phase 2/3 add `webhook`, `event`)                  |                                             |
 | `config`        | jsonb                                                                         | validated against trigger's `config_schema` |
 | `enabled`       | bool                                                                          |                                             |
 | `secret_hash`   | str / null                                                                    | for webhook bearer tokens                   |
 | `last_fired_at` | timestamp                                                                     |                                             |
 `secret_hash` (for webhook bearer tokens) is **deferred to Phase 2** with
 the webhook trigger.
 ### `automation_runs`
 | field             | type                                                                         | notes                                              |
@ -849,61 +859,25 @@ MCP-derived capabilities — see §3 for the lifecycle rationale.
 | `output`          | jsonb / null                                                                 |                                                    |
 | `artifacts`       | jsonb                                                                        | references to created artifacts                    |
 | `error`           | jsonb / null                                                                 |                                                    |
 | `cost_usd`        | decimal                                                                      | accumulated cost                                   |
 | `started_at` / `finished_at` | timestamps                                                        |                                                    |
 | `agent_session_id`| str / null                                                                   | link to LangGraph trace if agent_task was used     |
-### `domain_events`
+`cost_usd` (per-run accumulated cost) is **deferred** until at least one
 v1 capability records token-level cost. When reintroduced it lands as a
 column-only migration.
-| field             | type        | notes                                              |
+### Deferred tables
 | ----------------- | ----------- | -------------------------------------------------- |
 | `id`              | UUID PK     |                                                    |
 | `search_space_id` | FK          | scoping                                            |
 | `event_type`      | varchar     | e.g. `drive.file_added`, `automation.run.succeeded` |
 | `source_id`       | varchar     | which connector/automation/etc. produced it        |
 | `payload`         | jsonb       | matches the event type's documented schema         |
 | `created_at`      | timestamp   |                                                    |
 | `consumed_by`     | jsonb       | array of consumer_ids, for tracking + replay       |
 | `expires_at`     | timestamp   | auto-cleanup after 7 days                          |
-### `mcp_connections`
+- **`domain_events`** — the event bus backing event triggers. Ships in
  Phase 3 with the event trigger. v1 only emits `automation.run.*`
  events into application logs; the table is added when at least one
  consumer needs to subscribe to them.
 - **`mcp_connections`** / **`mcp_tools`** — see §3. Both ship in Phase 4
  alongside the MCP harvester and the two-tier registry.
-Persistent record of MCP server connections per SearchSpace.
+NL drafts are **not** a core table. They live in a generic short-TTL
-
+store (Redis or a transient table) when the NL flow is built in
-| field               | type        | notes                                              |
+Phase 3.
 | ------------------- | ----------- | -------------------------------------------------- |
 | `id`                | UUID PK     |                                                    |
 | `search_space_id`   | FK          | scoping                                            |
 | `server_url`        | text        | the MCP server's endpoint                          |
 | `transport`         | text        | `"http"`, `"stdio"`, etc.                          |
 | `name`              | text        | human-readable label (e.g., "Slack — Acme")        |
 | `access_token`      | bytea       | encrypted at rest                                  |
 | `refresh_token`     | bytea       | encrypted at rest                                  |
 | `expires_at`        | timestamp   | for OAuth tokens                                   |
 | `last_harvested_at` | timestamp   | when tool list was last refreshed                  |
 | `created_at`        | timestamp   |                                                    |
 | `created_by`        | FK → users  |                                                    |
 ### `mcp_tools`
 The tool list each connected MCP server exposes. Acts as the durable
 source for MCP capabilities — definitions reference `mcp_tools` rows by
 qualified name, and worker processes lazily build handler closures from
 this state.
 | field           | type        | notes                                            |
 | --------------- | ----------- | ------------------------------------------------ |
 | `id`            | UUID PK     |                                                  |
 | `connection_id` | FK → `mcp_connections.id` ON DELETE CASCADE | |
 | `name`          | text        | the tool name reported by the MCP server         |
 | `description`   | text        | description for the NL generator and form editor |
 | `input_schema`  | jsonb       | JSON Schema for tool arguments                   |
 | `output_schema` | jsonb       | JSON Schema for tool results                     |
 | `side_effects`  | text[]      | inferred from MCP hints + naming + admin override |
 | UNIQUE          |             | (connection_id, name)                            |
 NL drafts are **not** a core table. They live in a generic short-TTL store
 (Redis or a transient table) when the NL flow is built in Phase 3.
 ---
@ -1092,21 +1066,39 @@ which actions and triggers are available, not whether users can describe
 automations in natural language.
 ### Phase 1 — Engine MVP with NL authoring
- 4 tables + Alembic migration
+
- Capability registry with native capabilities (`search_space.query`,
+**Step 1 (current scope, this batch of commits):**
-  `search_space.fetch_document`, `agent.run`)
+- 3 tables (`automations`, `automation_triggers`, `automation_runs`) +
- `agent_task` action only
+  Alembic migration
- `schedule` trigger + manual "Run now" endpoint
+- Empty Capability, Action, Trigger registries (concrete entries land in
- Executor with retries, timeouts, budget caps
+  later steps when the consuming feature lands)
- Template engine (Jinja sandbox + 15 filters + 4 runtime limits)
+- Pydantic schemas for the automation definition envelope, the two v1
- **NL authoring flow**: Generator LLM, deterministic validator,
+  trigger configs (`schedule`, `manual`), and the one v1 action config
-  Review LLM, editable form
+  (`agent_task`)
 - Module structure under `app/automations/` (data/, schemas/,
  registries/), fully isolated from the existing codebase
 **Step 2:**
 - Register the `agent_task` action and the `schedule` / `manual`
  triggers in the registries
 - Capability registry populated with native deliverable-producing
  capabilities (chosen when this step starts)
 **Step 3:**
 - Executor (single-queue Celery task) with retries, timeouts, budget
  caps measured against `cost_usd` ledger on the run
 - Template engine (Jinja sandbox + the v1 filter allowlist + runtime
  limits)
 - Manual "Run now" endpoint
 **Step 4:**
 - NL authoring flow: Generator LLM, deterministic validator, Review LLM,
  editable form
 - Run history UI with Electric SQL streaming
 **After Phase 1**: a user can describe an automation in natural language,
 review the proposal (with summary + flagged anomalies), edit any field,
-save, and watch it run on a schedule. The Claude Routines value
+save, and watch it run on a schedule.
 proposition, on SurfSense's data, with NL-first authoring.
 ### Phase 2 — Webhooks and delivery
 - `webhook` trigger with per-automation bearer tokens