SurfSense

mirror of https://github.com/MODSetter/SurfSense.git synced 2026-07-14 22:52:15 +02:00

Author	SHA1	Message	Date
CREDO23	353755fd73	test(automations): cross-cutting registries, enums, side-effects + shared fixtures Top-level tests that span multiple submodules: - test_stores.py (7): the trigger + action registry contracts — register round-trip, unknown type → None (not raise), duplicate registration rejected, defensive snapshot from all_*. - test_definition_types.py (2): params_schema property on both ActionDefinition and TriggerDefinition reflects the Pydantic model. - test_persistence_enums.py (3): exact string values + member sets of AutomationStatus / RunStatus / TriggerType — the postgres-mirrored contract that breaks stored rows if drifted. - test_import_registrations.py (2): the bundled agent_task action and schedule trigger self-register on package import (canary for the side-effect import chain). conftest.py adds isolated_action_registry / isolated_trigger_registry fixtures: snapshot + restore of the module-level _REGISTRY dicts so tests that add their own definitions don't leak across the suite. 14 tests, pure unit.	2026-05-28 19:03:55 +02:00
CREDO23	822940b09e	test(automations/schemas): lock definition + api validation gates definition/ (29 tests): the envelope (defaults, extra=forbid, empty plan/name rejection), Inputs schema-alias roundtrip (Python schema_ ↔ wire schema), PlanStep numeric bounds + addressing-field constraints, Execution production defaults stability (10-min timeout, 2 retries, exponential backoff, drop_if_running) + closed-set Literal gates, Metadata's exceptional extra="allow" contract, and TriggerSpec type requirement. api/ (9 tests): AutomationCreate/Update cascade-validate into the nested definition, reject unknown payload fields, enforce name length; TriggerCreate exposes safe defaults (enabled=True, params={}, static_inputs={}) and rejects unknown TriggerType strings at the boundary. All pure unit, no DB.	2026-05-28 19:03:42 +02:00
CREDO23	acbeb60a43	test(automations/actions): lock agent_task helpers (auto_decide + finalize) auto_decide.build_auto_decisions (3): produces one decision per action_request entry, defaults to one decision for legacy scalar interrupts, and skips malformed interrupts silently so a misbehaving tool can't take down the whole agent_task step. finalize.extract_final_assistant_message (4): string-content AIMessage returned verbatim, list-of-parts content concatenated (skipping non-text parts like tool_use), walks back past trailing ToolMessages to find the last AIMessage, and returns None when no extractable text is present (so callers can branch on silence vs. empty). 7 tests, pure unit.	2026-05-28 19:03:29 +02:00
CREDO23	db4eef651f	test(automations/templating): lock render, filters, environment, context render.py (4): variable substitution, StrictUndefined raises on missing keys, evaluate_predicate coerces to bool, render_value walks dicts/lists and renders string leaves. filters.py (4): slugify produces URL-safe output, date formats datetime with strftime, date(None) → "" so templates can write {{ inputs.last_fired_at \| date }} on first run, date(str) passes through. environment.py (4): the sandbox boundary — disallowed Jinja built-ins (e.g. pprint) raise, and the finalize hook coerces non-string outputs to predictable wire shapes (datetime → ISO, None → "", dict → JSON). context.py (1): build_run_context exposes {run, inputs, steps} with the exact shape every plan template body relies on. 13 tests total, all pure unit.	2026-05-28 19:03:22 +02:00
CREDO23	49af95b652	test(automations/runtime): lock execute_step + with_retries execute_step (6 tests): happy path, when=falsy → skipped, unknown action → ActionNotFound failure, retry budget exhaustion (attempts = 1 + max_retries), retry recovery, and template-rendering of step params against the run context. with_retries (3 tests): first-try success returns attempts=1, recovery returns the actual attempt that produced the result, and exhaustion re-raises the last exception with the handler called 1 + max_retries times. All tests use backoff="none" to keep wall-clock time zero; timeout testing is intentionally skipped (would need >= 1s per the int contract, and exhaustion already locks that any Exception triggers retry).	2026-05-28 19:03:08 +02:00
CREDO23	18b4800e49	test(automations/dispatch): lock _validate_inputs + DispatchError Cover the input-validation contract dispatch_run relies on: - no declared schema → inputs pass through unchanged (regression site that previously stripped runtime keys like fired_at / last_fired_at and broke Jinja templates). - declared schema, valid inputs → passthrough validated. - declared schema, invalid inputs → DispatchError (uniform exception type, not raw jsonschema.ValidationError). Plus the DispatchError exception identity (Exception subclass, message preserved, isinstance-friendly for the dispatch layer's consumers). 4 tests, pure unit.	2026-05-28 19:03:00 +02:00
CREDO23	2a76f43387	test(automations/triggers): lock schedule cron + params Cover the cron + IANA timezone + UTC normalization contract for the schedule trigger: next-match strictly-after, DST offset shift across spring-forward, malformed cron / unknown timezone rejection, and the ScheduleTriggerParams Pydantic gate that surfaces InvalidCronError as ValidationError at the API boundary. 8 tests, pure unit (no DB, no mocks).	2026-05-28 19:02:52 +02:00
CREDO23	91962ba879	fix automation run inputs, hitl routing, and detail UI polish	2026-05-28 02:48:47 +02:00
CREDO23	2e572d7818	feat(web): create_automation HITL approval card in chat Closes the create loop in chat: the agent describes user intent → the drafter sub-LLM produces an AutomationCreate JSON → this card surfaces a structured preview → approve persists; reject cancels. Edits flow through chat refinement (re-call with a refined intent), not in-card, so the card stays simple and the multi-turn checkpointer carries the context. Tool UI (components/tool-ui/automation/): - create-automation.tsx — entry dispatcher + ApprovalCard chrome (pending/processing/complete/rejected via useHitlPhase) + SavedCard (links to the detail page) + InvalidCard (lists drafter validation issues) + ErrorCard (verbatim message). Rejection result is hidden because the approval card itself shows the rejected phase inline. - automation-draft-preview.tsx — structured preview body: name + description + goal, triggers (humanised cron + tz + static-input keys), plan steps (step_id → action), and a collapsible raw JSON for power users. Wiring: - components/tool-ui/index.ts — re-export. - features/chat-messages/timeline/tool-registry/registry.ts — register create_automation → CreateAutomationToolUI (dynamic import, same pattern as other connector tools). - contracts/enums/toolIcons.tsx — Workflow icon + "Create automation" display name so fallback chrome (and timeline headers) are honest. Shared util: - lib/automations/describe-cron.ts — lifted from the route slice's lib/ folder since both the dashboard slice and the new approval card now render schedule descriptions. Slice imports updated; the now- empty slice lib/ folder is gone. Backend prompt fragments: - main_agent/system_prompt/.../create_automation/description.md and the tool's docstring no longer promise in-card edits. They make the refinement path explicit: if the user wants changes after seeing the draft, they reply in chat and the agent calls the tool again with a refined intent. v1 deliberately excludes: - In-card edit form / right-side edit panel — defer until we see real demand. The chat refinement loop covers the common case. - approve_always / persistent allow rules — automations are a single artifact, not a repeated mutation, so the "trust this kind of call" affordance doesn't apply.	2026-05-28 01:32:04 +02:00
CREDO23	79f0218360	rbac: surface automations permissions in the UI Backend already defined automations:create/read/update/delete/execute and seeded them on Owner/Editor/Viewer roles, but the Settings → Roles UI was missing the metadata to render them properly. - backend: add PERMISSION_DESCRIPTIONS entries for the 5 automations perms so the role editor stops falling back to "Permission for automations:create". - frontend: add automations to CATEGORY_CONFIG (Workflow icon, slotted between podcasts and connectors) so the role editor groups them as a real section. - frontend: extend the three ROLE_PRESETS — Editor and Contributor get create/read/update/execute (mirroring backend Editor); Viewer gets read. Prep work for the automations frontend; canPerform/usePermissionGate already handle the runtime gating, so no new hook is needed.	2026-05-28 00:30:40 +02:00
CREDO23	2b7d91aa03	feat(automations): add create_automation HITL tool (NL → draft → approve → save) Single tool exposed to the main agent. The main agent passes a natural-language `intent`; a focused drafter sub-LLM turns it into a full AutomationCreate JSON; that JSON is surfaced via request_approval (action_type "automation_create") so the user can edit/approve it on a frontend card; on approval the tool persists via AutomationService. Three phases, one tool call. Scope split: - main agent sees only `intent: str` (no schema knowledge leaks into the calling graph) — prompt fragments scoped accordingly. - drafter sub-LLM owns the schema + few-shot intent→JSON examples — lives in the generating graph's prompt (tools/automation/prompt.py). Files: - main_agent/tools/automation/{create.py, prompt.py, __init__.py}: new tool + drafter system prompt with two few-shot intent→JSON examples. - system_prompt/prompts/tools/create_automation/{description.md, example.md}: intent-only guidance for the main agent. - main_agent/tools/index.py: add create_automation to the main-agent allowlist. - new_chat/tools/registry.py: deferred-import factory to break the multi_agent_chat ↔ registry cycle; one ToolDefinition entry.	2026-05-28 00:12:02 +02:00
CREDO23	c0232fdcfe	refactor(automations): park manual trigger pending Run-now redesign Manual-as-a-standalone-trigger conflates "user clicks Run now" with the trigger model and forces ad-hoc input plumbing on the caller. Remove the unreachable surface so the tree reflects reality (schedule is the only v1 trigger). - Unregister `manual`: drop import from triggers/__init__.py - Delete `app/automations/triggers/manual/` - Drop `RunService.dispatch_manual` (RunService is now read-only) - Drop `POST /automations/{id}/run` and `RunDispatched` schema - Keep `TriggerType.MANUAL` Python + PG enum value (reserved, documented) to avoid an Alembic round-trip when Run-now is redesigned	2026-05-27 22:29:51 +02:00
CREDO23	8fb65d7188	fix(automations): use enum values not names for postgres enum columns	2026-05-27 21:53:07 +02:00
CREDO23	27ab367a13	feat(automations): static_inputs on triggers + vertical-slice api/services	2026-05-27 21:21:43 +02:00
CREDO23	84d99f19a2	automations(api): API request/response schemas	2026-05-27 19:10:20 +02:00
CREDO23	dd6bc30f98	move automations api into vertical slice with service layer	2026-05-27 18:56:16 +02:00
CREDO23	d84240a630	add schedule tick task and beat entry	2026-05-27 17:56:07 +02:00
CREDO23	3b1d7c4389	add cron-based schedule trigger	2026-05-27 17:56:02 +02:00
CREDO23	f08b316441	add next_fire_at to automation_triggers and croniter dep	2026-05-27 17:55:58 +02:00
CREDO23	861b91004d	refactor(automations): extract dispatch_run; move manual adapter under triggers/manual/dispatch.py	2026-05-27 17:20:23 +02:00
CREDO23	8c32455818	refactor(automations): vertical-slice actions and triggers by domain	2026-05-27 17:07:20 +02:00
CREDO23	ce45e11009	feat(automations): wire agent_task to multi_agent_chat with auto-approve loop	2026-05-27 17:02:44 +02:00
CREDO23	7ec3468113	refactor(automations): bind action handlers via ActionContext factory	2026-05-27 16:29:32 +02:00
CREDO23	f646b5cbab	feat(rbac): backfill automations permissions on existing roles	2026-05-27 15:37:25 +02:00
CREDO23	cfbe2a7fe0	feat(automations): expose POST /automations/{id}/run	2026-05-27 15:30:45 +02:00
CREDO23	3bb02d8889	feat(automations): add manual dispatch service	2026-05-27 15:30:41 +02:00
CREDO23	1366c8a711	feat(rbac): add automations permission family	2026-05-27 15:30:34 +02:00
CREDO23	b26bf0bbcf	feat(automation): register automation run celery task	2026-05-27 15:02:36 +02:00
CREDO23	273b98f350	feat(automation): expose runtime package surface	2026-05-27 15:02:36 +02:00
CREDO23	d3cda12191	feat(automation): add automation run executor	2026-05-27 15:02:36 +02:00
CREDO23	0a329e5a69	feat(automation): add per-step execution	2026-05-27 15:02:36 +02:00
CREDO23	f71a02db2f	feat(automation): add automation run repository	2026-05-27 15:02:36 +02:00
CREDO23	924a82c0b1	feat(automation): add retry policy helper	2026-05-27 15:02:36 +02:00
CREDO23	8b87d179e9	feat(automation): add recursive render_value to templating	2026-05-27 15:02:36 +02:00
CREDO23	cb42b3a84f	feat(automation): add template run context builder	2026-05-27 14:23:18 +02:00
CREDO23	de6da1b775	feat(automation): add template render and predicate evaluation	2026-05-27 14:23:17 +02:00
CREDO23	8345e79f6d	feat(automation): add sandboxed template environment	2026-05-27 14:23:17 +02:00
CREDO23	08e94ac5ca	feat(automation): add custom template filters	2026-05-27 14:23:17 +02:00
CREDO23	b4e5bf95a4	feat(automation): add template filter and test allowlist	2026-05-27 14:23:17 +02:00
CREDO23	99fd1a1338	feat(automation): register agent_task action and schedule/manual triggers	2026-05-27 13:58:57 +02:00
CREDO23	56b3e1bfc4	refactor(automation): drop Block suffix from definition components	2026-05-27 13:48:41 +02:00
CREDO23	7f4c1c25ab	feat(automation): wire SQLAlchemy relationships on both sides	2026-05-27 13:45:32 +02:00
CREDO23	7ac99b89a0	refactor(automation): drop Capability registry	2026-05-27 13:29:30 +02:00
CREDO23	9fa35f21cf	refactor(automation): rename schema config to params, drop dead fields	2026-05-27 13:29:26 +02:00
CREDO23	c8a89ccac8	refactor(automation): rename trigger model config to params	2026-05-27 13:29:22 +02:00
CREDO23	fe32cd35ed	refactor(automation): rename trigger config column to params	2026-05-27 13:29:18 +02:00
CREDO23	a4fbfd8c0d	chore(automation): tighten run.py + envelope.py docstrings Re-apply the trim style after the prior refactor commit re-introduced a multi-line docstring on AutomationRun. - AutomationRun: drop the four-line docstring explaining where per-step session ids live; move the note to a single-line inline comment right above ``step_results`` where it's actionable. - AutomationDefinition: drop the design-plan cross-reference; the module docstring already establishes what the file is. No behaviour change.	2026-05-27 11:45:04 +02:00
CREDO23	35117a952d	refactor(automation): drop agent_session_id from AutomationRun A run can contain zero, one, or N agent_task steps. A single agent_session_id at the run level holds at most one of them, so the column is the wrong shape for the data. Per-step session ids (LangGraph thread/checkpoint reference for an agent_task step) live inside step_results[i] alongside the rest of the per-step bag (status, timings, output). Each agent step records its own; non-agent steps record nothing. Run-level "primary session" is a UI concern, not a schema concern. Trade-off: trace -> run reverse lookup is now a JSONB query, not an index hit. Usually traversal goes run -> trace; if the reverse becomes hot we add a GIN index on step_results or a generated column — both additive. Changes: - AutomationRun: drop the agent_session_id column; module docstring notes where per-step session ids now live. - Migration 144: drop the column from the CREATE TABLE; downgrade unchanged. Safe to edit migration 144 in place (vs. add 145 with ALTER ... DROP): this branch has not shipped and the table has never existed in any deployed database.	2026-05-27 11:41:32 +02:00
CREDO23	f0e00bd3ee	chore(automation): trim docstrings to intent only Cut the docstrings and Field(description=...) text across the entire automations/ tree down to single-line intent statements, matching the multi_agent_chat conciseness style: - Module docstrings: one line stating what the file is. - Class docstrings: deleted when the class name + module docstring already cover intent; kept only where they add a constraint or rationale not visible in the signature. - Pydantic Field descriptions: short noun phrases / clauses, not full sentences. Reasoning that belonged in the design plan moved out of the code. - Enum values: per-value docstrings replaced with terse inline comments where the meaning isn't obvious from the name. Behaviour is unchanged. The same 33 files, same public surface, same imports — verified by re-running the 10-point registry smoke test and the 8-point schema round-trip / constraint suite from commits 9 and 10. LOC: 1180 → 691 (-42%).	2026-05-26 23:01:22 +02:00
CREDO23	7a96c0e29c	feat(automation): add empty Capability / Action / Trigger registries Three registries under app/automations/registries/, each as its own folder with the same SRP-per-file split (types.py for the dataclass, store.py for the in-memory dict + register/get/all functions). All three start empty; concrete entries land when the user signs off on which capabilities / actions / triggers to include (step 2). Capability (locked at v1-minimum five fields — see commit 2): - id, description, input_schema, output_schema, handler - CapabilityHandler = Callable[[dict[str, Any]], Awaitable[Any]] - Frozen, slotted dataclass (immutable post-registration). ActionDefinition (v1-trim of design plan §4): - type, name, description, config_schema, handler - Defers output_contract (handled per-step by agent_task's config.output_schema), uses_capabilities (no static analysis needed until >1 action ships), and produces_artifacts (deferred alongside the artifact pipeline). TriggerDefinition (declarative, no handler): - type, description, config_schema, payload_schema - No handler field — firing is a single dispatcher's responsibility, not a per-trigger one. store.py contract for all three: - register_: idempotent at process startup, raises on duplicate - get_: returns None on miss - all_: returns a defensive copy of the registry dict Verified by an inline smoke test (10 checks): empty initial state, registration and lookup work, duplicates raise, frozen dataclasses reject mutation, snapshots are copies, handlers are awaitable. Isolation invariant audit: grep across the full app/automations/ tree shows only three app. imports, all of them ``from app.db import BaseModel, TimestampMixin`` in the model files. No imports from app.agents., app.services., app.tasks., app.routes., or any other business-logic module.	2026-05-26 22:54:17 +02:00

1 2 3 4 5 ...

2347 commits