SurfSense

mirror of https://github.com/MODSetter/SurfSense.git synced 2026-06-06 20:15:17 +02:00

Author	SHA1	Message	Date
CREDO23	a019f18d1c	refactor(agents): move connector_searchable_types, agent_cache, system_prompt + prompts to app/agents/shared (slice 7b) Three live shared leaves discovered while taking stock after slice 7 (all are consumed by the multi-agent stack and/or live routes, not single-agent-only): - connector_searchable_types -> shared + shim (multi-agent factory uses it) - agent_cache -> shared + shim (multi-agent runtime/agent_cache uses it) - system_prompt + prompts/ (42 .md fragments) -> shared together + shim. Repointed composer's _PROMPTS_PACKAGE to app.agents.shared.prompts so importlib.resources fragment loading keeps working; system_prompt's relative ".prompts.composer" import is preserved by moving both as a unit. Each keeps a re-export shim for the frozen chat_deepagent. After this slice, new_chat/ holds only the frozen single-agent stack (chat_deepagent, subagents/, __init__) plus shims.	2026-06-04 13:21:45 +02:00
CREDO23	13a96851ef	refactor(agents): move skills/, plugins/, plugin_loader to app/agents/shared (slice 7) - skills/ (builtin SKILL.md assets) has zero Python importers; it is read by filesystem path only. Moved the dir and restored skills_backends._default_builtin_root() to the clean parent.parent / "skills" / "builtin" form (undoing the transient path from 5c). - plugin_loader.py -> shared (frozen chat_deepagent uses it -> re-export shim). - plugins/ package -> shared (year_substituter rewired to shared.plugin_loader; docstring entry-point example updated to the shared dotted path). No shim needed (only a test imported it). Plugin discovery is via importlib entry points (group "surfsense.plugins"), not dotted-path import, and nothing is registered in pyproject, so the move does not affect runtime discovery.	2026-06-04 13:16:22 +02:00
CREDO23	aab95b9130	refactor(agents): move tools package to app/agents/shared (slice 6) Relocate the entire new_chat/tools/ package (62 files incl. registry, hitl, MCP cluster, and all connector subpackages: gmail/slack/discord/teams/drive/etc.) to the shared kernel. The package turned out to be a clean cohesive cluster: its only references to non-tools new_chat modules were comments, and its middleware deps were already flipped to shared in slice 5c. Flip 33 live importers (multi-agent, flows, routes, services, anonymous_agent, tests). Re-export shims remain for the frozen single-agent stack: a package __init__ mirroring the public surface (new_chat.__init__ imports it) plus invalid_tool + registry submodule shims (chat_deepagent imports those). Resolves slice 5c's two transient back-edges: shared/middleware/action_log (TYPE_CHECKING ToolDefinition) and tool_call_repair (local INVALID_TOOL_NAME) now point at app.agents.shared.tools.	2026-06-04 13:11:56 +02:00
CREDO23	a7fde2a48e	refactor(agents): move filesystem_backends to app/agents/shared (slice 5d) Completes slice 5. filesystem_backends was deferred from 3b because it depends on middleware.{kb_postgres_backend,multi_root_local_folder_backend}; those moved to shared in 5c, so it now relocates cleanly. Flip the 2 non-frozen importers (multi-agent factory + test); a re-export shim remains for the frozen chat_deepagent (build_backend_resolver).	2026-06-04 13:03:15 +02:00
CREDO23	227983a104	refactor(agents): move middleware package to app/agents/shared (slice 5c) Relocate the entire new_chat/middleware/ package to the shared kernel as one cohesive unit (it is live shared infrastructure: the multi-agent stack wraps nearly every middleware via multi_agent_chat/middleware/main_agent/*, and anonymous_agent consumes it too). Flip 69 live importers across both the package-path and submodule-path forms. Shims left for the frozen single-agent stack: a package __init__ re-export plus submodule shims for permission, skills_backends, and scoped_model_fallback (the three imported via submodule path by chat_deepagent/subagents). Cycle break: importing shared.middleware previously reached back into new_chat.tools at module load, which dragged in new_chat.__init__ -> chat_deepagent -> the middleware shim -> half-initialized shared.middleware. Made action_log's ToolDefinition import TYPE_CHECKING-only and tool_call_repair's INVALID_TOOL_NAME import function-local. These tools-package back-edges fully resolve in slice 6. Asset note: skills_backends._default_builtin_root now walks to app/agents/new_chat/skills/builtin (the skills/ tree migrates in slice 7).	2026-06-04 13:00:41 +02:00
CREDO23	6f488d9564	refactor(agents): move checkpointer + mention_resolver to app/agents/shared (slice 5b) Two independent leaf modules (no intra-new_chat deps, no frozen importer), consumed only by flows/routes/tests. Flipped 8 importers across both the dotted-path and module-style (from app.agents.new_chat import mention_resolver) forms. No shims needed.	2026-06-04 12:52:54 +02:00
CREDO23	dcdf8f776b	refactor(agents): move utils + document_xml to app/agents/shared (slice 5a) Two pure leaf modules with no intra-new_chat deps and no frozen importer. Moving them now (before the middleware package) pre-empts two shared->new_chat back-edges that the middleware move would otherwise create (knowledge_search->utils, kb_postgres_backend->document_xml).	2026-06-04 12:50:38 +02:00
CREDO23	946f8a8c5d	refactor(agents): move llm_config + prompt_caching to app/agents/shared (slice 4b) Relocate the mutually-dependent LLM config layer and the LiteLLM prompt-caching helper to the shared kernel as one unit, rewiring their internal cross-reference to the shared paths. Flip 21 non-frozen importers. Re-export shims remain at new_chat/{llm_config,prompt_caching}.py for the frozen single-agent stack (chat_deepagent); they will be removed when that stack is retired.	2026-06-04 12:41:52 +02:00
CREDO23	8fca2753aa	refactor(agents): move permissions to app/agents/shared (slice 4a) Relocate the permission evaluator (wildcard matcher + rule evaluation) to the shared kernel and flip 43 non-frozen importers. A re-export shim remains at new_chat/permissions.py for the frozen single-agent stack (chat_deepagent and subagents/{config,providers/linear,providers/slack}); it will be removed when that stack is retired.	2026-06-04 12:38:30 +02:00
CREDO23	3efe51e6ec	refactor(agents): move filesystem_state, path_resolver, sandbox to app/agents/shared (slice 3b) Relocate three leaf filesystem-cluster modules to the shared kernel and flip all 38 importers. No re-export shims needed (no frozen single-agent importer). This also resolves the pre-existing shared->new_chat back-edge from shared/receipt_command.py onto filesystem_state. filesystem_backends is intentionally deferred to slice 5: it depends on new_chat middleware (kb_postgres_backend, multi_root_local_folder_backend) that have not yet moved, so relocating it now would create a shared->new_chat edge.	2026-06-04 12:34:28 +02:00
CREDO23	1b536b8aee	refactor(agents): move filesystem_selection to app/agents/shared (slice 3a) Promote the filesystem mode contracts (FilesystemMode, FilesystemSelection, ClientPlatform, LocalFilesystemMount) out of `new_chat` into the cross-agent `app/agents/shared` kernel. Pure leaf consumed across the whole multi-agent filesystem middleware/tool tree, the chat flows/monolith, routes and tests. git mv (content unchanged) + flipped all ~48 importers. A re-export shim remains at new_chat/filesystem_selection.py only for the not-yet-retired single-agent (chat_deepagent). Also updated the stream parity test's annotation normalizer to strip the new app.agents.shared.filesystem_selection. prefix (the dataclasses' __module__ changed with the move), keeping monolith<->flows signature parity intact. Behavior-preserving: only import paths change. 1326 tests green.	2026-06-04 12:28:20 +02:00
CREDO23	a975754e7d	refactor(agents): move feature flags to app/agents/shared/feature_flags (slice 2b) Promote the agent feature-flag resolver (AgentFeatureFlags / get_flags) out of `new_chat` into the cross-agent `app/agents/shared` kernel. feature_flags is a pure leaf consumed across the multi-agent middleware stack, the chat routes, and tests. Moved it via git mv (content unchanged) and flipped all 37 importers to app.agents.shared.feature_flags. A thin re-export shim remains at new_chat/feature_flags.py only for the not-yet-retired single-agent (chat_deepagent); it goes away with the single-agent deletion. Behavior-preserving: only import paths change. 1243 tests green.	2026-06-04 12:23:12 +02:00
CREDO23	28b13ed25b	refactor(agents): move context schema + state reducers to app/agents/shared (slice 2) Continue promoting the shared agent toolkit out of `new_chat` into the cross-agent `app/agents/shared` kernel. - state_reducers.py: clean move (no single-agent importer); all 7 importers flipped to app.agents.shared.state_reducers. - context.py: moved to app.agents.shared.context; flipped the multi-agent, app, automations, chat-flows and monolith importers. A thin re-export shim remains at new_chat/context.py because the not-yet-retired single-agent (chat_deepagent) and the new_chat package __init__ still import it; the shim goes away with the single-agent deletion. - Updated the stream parity test's annotation normalizer to strip the new app.agents.shared.context. prefix (SurfSenseContextSchema.__module__ changed with the move), keeping monolith<->flows signature parity intact. Behavior-preserving: definitions unchanged; only import paths move. 1219 tests green.	2026-06-04 12:19:22 +02:00
CREDO23	0354f73f29	refactor(agents): move error taxonomy to app/agents/shared/errors (slice 1) First slice of promoting the shared agent toolkit out of the misnamed `new_chat` package into the cross-agent `app/agents/shared` kernel. `errors.py` is a leaf module (no intra-package deps) consumed by the multi-agent chat, the chat streaming flows/monolith, and tests — i.e. it is shared infrastructure, not single-agent code. Moved it verbatim to `app.agents.shared.errors` and flipped all 12 importers. No re-export shim remains since zero importers needed it. Behavior-preserving: identical class/enum definitions; only the import path changes. 1208 agent + chat-task tests green.	2026-06-04 12:13:55 +02:00
CREDO23	a4010a357f	refactor: extract shared connector->searchable-type mapping out of chat_deepagent The multi-agent factory reached into the single-agent factory module (chat_deepagent) for `_map_connectors_to_searchable_types`. Move this agent-agnostic helper (and its two lookup tables) into a dedicated `connector_searchable_types` module and point both factories at it. Behavior-preserving: the function body is unchanged; only its home and visibility (now public `map_connectors_to_searchable_types`) change. This removes the cross-dependency on the dying single-agent module so it can be retired later without breaking the multi-agent path.	2026-06-04 12:05:17 +02:00
CREDO23	0b006badb0	chore: remove dead agent code (vision autocomplete, linear tools, provider hints)	2026-06-04 11:44:23 +02:00
CREDO23	cb44063081	fix: repair pre-existing agent_task, gateway, and skills tests	2026-06-04 10:25:06 +02:00
CREDO23	b6710ae9af	refactor(jira): remove dead legacy indexing and write tools (mcp-only now)	2026-06-02 16:38:00 +02:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	0f2e3c7655	refactor: anonymous/free chat experience - Enhanced lambda function formatting in `_after_commit` for better clarity. - Simplified generator expression in `_match_condition` for improved readability. - Streamlined function signature in `_eligible` for consistency. - Updated imports and refactored anonymous chat routes to use a new agent creation method. - Added a new function `_load_anon_document` to handle document loading from Redis. - Improved UI components by replacing legacy structures with modern alternatives, including alerts and separators. - Refactored quota-related components to utilize new alert structures for better user feedback. - Cleaned up unused variables and optimized component states for performance.	2026-05-31 15:58:21 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	9d1a01eb0c	refactor(automations): streamline model eligibility handling in automation creation - Removed the eligibility gate for model selection in the automation creation process, allowing users to choose models directly in the builder. - Updated the `AutomationBuilderForm` to incorporate model selection logic, ensuring that selected models are validated and preserved during automation creation and editing. - Simplified the `AutomationsContent` and `AutomationNewContent` components by eliminating unnecessary eligibility checks and alerts. - Enhanced the user experience by integrating model selection directly into the automation approval process, ensuring that only billable models are used. - Refactored related tests to cover new model selection behavior and ensure proper validation of user-selected models.	2026-05-29 20:27:40 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	409fec94c3	feat(automations): implement model eligibility checks for automation creation - Added model eligibility checks to ensure automations can only use billable models (premium or BYOK). - Introduced new API endpoint to report model eligibility status for search spaces. - Updated frontend components to display eligibility alerts and disable creation options when models are not billable. - Enhanced automation creation forms to reflect model eligibility, preventing users from submitting invalid configurations. - Implemented server-side logic to capture and preserve model preferences across automation edits, ensuring consistent behavior during execution.	2026-05-29 03:13:46 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	40ca9e6ed2	refactor: remove `search_surfsense_docs` tool and related references - Deleted the `search_surfsense_docs` tool and its associated files, streamlining the agent's toolset. - Updated various components and prompts to remove references to the now-removed tool, ensuring consistency across the codebase. - Adjusted documentation to direct users to the SurfSense documentation link for product-related queries instead.	2026-05-28 22:35:14 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	94e834134f	chore: linting	2026-05-28 19:21:29 -07:00
Rohan Verma	4dda02c06c	Merge pull request #1443 from CREDO23/feature-automations [Feat] Automation V1 — Scheduled Agent Tasks, Created via Chat (HITL) or JSON	2026-05-28 12:41:41 -07:00
CREDO23	91962ba879	fix automation run inputs, hitl routing, and detail UI polish	2026-05-28 02:48:47 +02:00
CREDO23	2e572d7818	feat(web): create_automation HITL approval card in chat Closes the create loop in chat: the agent describes user intent → the drafter sub-LLM produces an AutomationCreate JSON → this card surfaces a structured preview → approve persists; reject cancels. Edits flow through chat refinement (re-call with a refined intent), not in-card, so the card stays simple and the multi-turn checkpointer carries the context. Tool UI (components/tool-ui/automation/): - create-automation.tsx — entry dispatcher + ApprovalCard chrome (pending/processing/complete/rejected via useHitlPhase) + SavedCard (links to the detail page) + InvalidCard (lists drafter validation issues) + ErrorCard (verbatim message). Rejection result is hidden because the approval card itself shows the rejected phase inline. - automation-draft-preview.tsx — structured preview body: name + description + goal, triggers (humanised cron + tz + static-input keys), plan steps (step_id → action), and a collapsible raw JSON for power users. Wiring: - components/tool-ui/index.ts — re-export. - features/chat-messages/timeline/tool-registry/registry.ts — register create_automation → CreateAutomationToolUI (dynamic import, same pattern as other connector tools). - contracts/enums/toolIcons.tsx — Workflow icon + "Create automation" display name so fallback chrome (and timeline headers) are honest. Shared util: - lib/automations/describe-cron.ts — lifted from the route slice's lib/ folder since both the dashboard slice and the new approval card now render schedule descriptions. Slice imports updated; the now- empty slice lib/ folder is gone. Backend prompt fragments: - main_agent/system_prompt/.../create_automation/description.md and the tool's docstring no longer promise in-card edits. They make the refinement path explicit: if the user wants changes after seeing the draft, they reply in chat and the agent calls the tool again with a refined intent. v1 deliberately excludes: - In-card edit form / right-side edit panel — defer until we see real demand. The chat refinement loop covers the common case. - approve_always / persistent allow rules — automations are a single artifact, not a repeated mutation, so the "trust this kind of call" affordance doesn't apply.	2026-05-28 01:32:04 +02:00
CREDO23	2b7d91aa03	feat(automations): add create_automation HITL tool (NL → draft → approve → save) Single tool exposed to the main agent. The main agent passes a natural-language `intent`; a focused drafter sub-LLM turns it into a full AutomationCreate JSON; that JSON is surfaced via request_approval (action_type "automation_create") so the user can edit/approve it on a frontend card; on approval the tool persists via AutomationService. Three phases, one tool call. Scope split: - main agent sees only `intent: str` (no schema knowledge leaks into the calling graph) — prompt fragments scoped accordingly. - drafter sub-LLM owns the schema + few-shot intent→JSON examples — lives in the generating graph's prompt (tools/automation/prompt.py). Files: - main_agent/tools/automation/{create.py, prompt.py, __init__.py}: new tool + drafter system prompt with two few-shot intent→JSON examples. - system_prompt/prompts/tools/create_automation/{description.md, example.md}: intent-only guidance for the main agent. - main_agent/tools/index.py: add create_automation to the main-agent allowlist. - new_chat/tools/registry.py: deferred-import factory to break the multi_agent_chat ↔ registry cycle; one ToolDefinition entry.	2026-05-28 00:12:02 +02:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	9d6e9b7e2d	feat: enhance task management and timeout configurations in multi-agent chat - Added new environment variables for controlling task execution limits, including `SURFSENSE_SUBAGENT_INVOKE_TIMEOUT_SECONDS`, `SURFSENSE_TASK_BATCH_CONCURRENCY`, and `SURFSENSE_TASK_BATCH_MAX_SIZE`. - Updated documentation to reflect new batch processing capabilities for `task` calls, allowing for concurrent execution of multiple subagent tasks. - Improved error handling and receipt generation for deliverables, ensuring consistent feedback on task status. - Refactored middleware to incorporate search space ID for better task management.	2026-05-27 14:58:10 -07:00
Anish Sarkar	a41b16b73e	feat(web): enhance chat context and mention handling with connector support	2026-05-26 21:11:53 +05:30
Rohan Verma	69388fc710	Merge pull request #1429 from CREDO23/fix-desktop-redirects [Fixes] Packaged desktop: connector redirect + linux launcher icon	2026-05-23 15:51:59 -07:00
Anish Sarkar	98e3950dc8	Merge remote-tracking branch 'upstream/dev' into feat/opentelemetry	2026-05-23 03:21:08 +05:30
CREDO23	d97b2830c5	fix: resolve desktop KB prompt self-contradiction on chunk_ids The citations fix (`cacb27e0`) added a "Chunk citations in your prose" section to system_prompt_desktop.md telling the KB subagent to always leave `evidence.chunk_ids` null and emit no `[citation:...]` markers in desktop mode, but left the pre-existing line declaring that `chunk_ids` apply to `<priority_documents>` hits. The two rules contradicted each other; the model picked one per turn. Strike the stale conditional clause and point at the dedicated section as the single source of truth. Matches the parallel line in system_prompt_cloud.md and the already-consistent system_prompt_readonly_desktop.md.	2026-05-22 17:24:57 +02:00
Anish Sarkar	dc893281ba	feat(chat): add model retry and stream lifecycle events	2026-05-22 17:48:43 +05:30
Anish Sarkar	8bca29fe0d	feat(agents): track subagent invocation telemetry	2026-05-22 13:48:57 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	cacb27e007	fix: citations in agent responses	2026-05-21 14:41:32 -07:00
Anish Sarkar	53691f9c51	feat(agents): track permission and compaction events	2026-05-21 23:02:54 +05:30
Anish Sarkar	ea3d0a6463	feat(agents): emit metrics for model and tool calls	2026-05-21 23:02:36 +05:30
CREDO23	49da7a57df	Merge remote-tracking branch 'upstream/dev' into improvement-agent-speed Resolves: surfsense_backend/app/agents/new_chat/middleware/memory_injection.py - Took both imports: upstream moved MEMORY_HARD_LIMIT/SOFT_LIMIT to app.services.memory; kept our perf-logger import for timing. Pulls in upstream changes: - Memory document feature (services/memory refactor, removal of app.agents.new_chat.memory_extraction and background extraction in stream_new_chat — agent now drives memory via update_memory tool). - BACKEND_URL env refactor across web tool-ui/editor/chat/dashboard/lib. - GitHub Actions backend test workflow + pre-commit biome bump. - Token-display polish in MessageInfoDropdown; save_memory no-update sentinel. Verified: 1723 unit tests pass, ruff clean. No semantic regression in stream_new_chat (their memory-extraction deletion and our preflight removal touch different functions).	2026-05-20 21:23:48 +02:00
CREDO23	d5ee8cc4cd	Merge remote-tracking branch 'upstream/dev' into improvement-agent-speed	2026-05-20 19:22:49 +02:00
CREDO23	704d1bf18f	refactor(mcp): per-connector cache refresh on lifecycle events Collapse the invalidate + warmup pair into a single refresh_mcp_tools_cache_for_connector(connector_id, search_space_id) helper and scope live discovery to the one connector that changed instead of the whole search space. - new mcp_tool.discover_single_mcp_connector: load one connector, refresh OAuth if needed, force live MCP discovery so its cached_tools row is rewritten; returned wrappers are discarded since the in-process LRU is rebuilt lazily on the next user query - mcp_tools_cache.refresh_mcp_tools_cache_for_connector: synchronously evicts the per-space LRU (LRU keys cannot scope finer) and schedules the per-connector prefetch via loop.create_task - routes (OAuth callback, MCP POST, MCP PUT) collapse their two back-to-back calls into a single refresh call; DELETE handlers keep using bare invalidate_mcp_tools_cache (nothing to prefetch) No new automated tests: the new functions are I/O glue (DB + network) where mocked unit tests would test implementation rather than behavior. The existing 9 unit tests for the cached_tools data shape are unchanged.	2026-05-20 17:43:27 +02:00
CREDO23	c0aa4261ac	perf(mcp): persist list_tools discovery in connector.config.cached_tools Skip the ~1-3s MCP initialize + list_tools handshake on every cache miss by reading tool definitions from the connector row we already load. Lazy populate on first miss, self-heal on corrupt cache, zero schema migration.	2026-05-20 16:11:07 +02:00
CREDO23	db8bffab38	perf(prompt-cache): enable Azure prompt_cache_key routing hint Splits the OpenAI-family gate into per-param predicates so AZURE and AZURE_OPENAI configs now receive prompt_cache_key for backend routing affinity (Microsoft auto-caches GPT-4o+ deployments at >=1024 tokens; the key clusters same-prefix requests on the same GPU pool and raises hit rate on turn 2+). prompt_cache_retention stays opted out for Azure because litellm 1.83.14's Azure transformer would drop it silently; revisit when Azure's supported params list is updated.	2026-05-20 11:58:15 +02:00
CREDO23	71dead0406	perf(kb-planner): route internal planner calls to dedicated small/fast LLM Adds an optional planner LLM role wired through KnowledgePriorityMiddleware so KB query rewriting, date extraction, and recency classification run on a cheap model (e.g. gpt-4o-mini, Haiku, Azure nano) instead of the user's chat LLM. Operators opt in by setting is_planner: true on exactly one global config; without it, behavior is unchanged.	2026-05-20 11:42:52 +02:00
Anish Sarkar	132e7b3c44	refactor: remove memory extraction functions and related components from the new chat agent	2026-05-20 14:03:28 +05:30
CREDO23	52d425f170	perf(kb-persistence): offload sync embed_texts to thread _create_document and _update_document run on the chat critical path when the filesystem subagent writes via the user's chat turn. Both called embed_texts synchronously inside an async coroutine, blocking the event loop for the duration of the embed.	2026-05-20 10:03:14 +02:00
CREDO23	4fa85a9a94	perf(kb-search): offload sync embed_texts to thread embed_texts holds a threading.Lock and runs a sync embedding call inside search_knowledge_base, an async coroutine on the KB priority middleware critical path. Blocking the event loop here stalls every other coroutine on the worker (SSE keepalives, concurrent chat requests, background tasks). Wrap in asyncio.to_thread so the embed runs on the default executor pool while the loop keeps serving.	2026-05-20 10:02:38 +02:00
CREDO23	0cdda14922	perf(kb subagent, desktop): cap evidence.content_excerpt to 500 chars	2026-05-20 09:43:36 +02:00
CREDO23	5edf0520c4	perf(kb subagent, cloud): cap evidence.content_excerpt to 500 chars	2026-05-20 09:43:32 +02:00
CREDO23	b554c600bb	perf(research subagent): cap evidence.findings and evidence.sources to bound output	2026-05-20 09:42:57 +02:00
CREDO23	6c173dc2a7	perf(teams subagent): stop echoing raw teams/channels/messages payload into evidence.items	2026-05-20 09:42:03 +02:00

1 2 3 4 5 ...

720 commits