SurfSense

mirror of https://github.com/MODSetter/SurfSense.git synced 2026-06-10 20:35:17 +02:00

Author	SHA1	Message	Date
CREDO23	ba687813c1	fix(elasticsearch): commit failed status immediately	2026-06-10 00:10:52 +02:00
CREDO23	c26181d086	fix(airtable): commit failed status immediately	2026-06-10 00:10:52 +02:00
CREDO23	e3afe9d7c7	fix(luma): commit failed status immediately	2026-06-10 00:10:52 +02:00
CREDO23	8191118eb4	fix(bookstack): commit failed status immediately	2026-06-10 00:10:52 +02:00
CREDO23	45438249b6	fix(clickup): commit failed status immediately	2026-06-10 00:10:52 +02:00
CREDO23	f5dd8f3985	fix(github): commit failed status immediately	2026-06-10 00:10:52 +02:00
CREDO23	f085ac59e5	fix(teams): commit failed status immediately	2026-06-10 00:10:52 +02:00
CREDO23	791b0afe16	fix(discord): commit failed status immediately	2026-06-10 00:10:52 +02:00
CREDO23	be8a3bcd00	fix(slack): commit failed status immediately	2026-06-10 00:10:52 +02:00
CREDO23	c47949791b	fix(confluence): fail skipped placeholders so they don't stay pending	2026-06-10 00:10:42 +02:00
CREDO23	d70d01f331	fix(linear): fail skipped placeholders so they don't stay pending	2026-06-10 00:10:42 +02:00
CREDO23	1b0912aaa3	fix(calendar): fail skipped placeholders so they don't stay pending	2026-06-10 00:10:42 +02:00
CREDO23	b2c2fc9c2e	fix(gmail): fail skipped placeholders so they don't stay pending	2026-06-10 00:10:42 +02:00
CREDO23	90b32a8880	fix(notion): fail skipped placeholders so they don't stay pending	2026-06-10 00:10:42 +02:00
CREDO23	33300e4faa	fix(dropbox): sanitize ETL reason and retry stuck pending/processing files	2026-06-10 00:10:25 +02:00
CREDO23	464e7d4554	fix(onedrive): sanitize ETL reason and retry stuck pending/processing files	2026-06-10 00:10:25 +02:00
CREDO23	c0c5f3414e	fix(google-drive): sanitize ETL reason and retry stuck pending/processing files	2026-06-10 00:10:25 +02:00
CREDO23	e45e8389dc	fix(dropbox): mark documents failed on ETL failure	2026-06-09 23:39:25 +02:00
CREDO23	82aaaa5a9f	fix(onedrive): mark documents failed on ETL failure	2026-06-09 23:39:25 +02:00
CREDO23	6fd95f82b4	fix(google-drive): mark placeholders failed on ETL failure	2026-06-09 23:39:25 +02:00
CREDO23	cb10882dc8	feat(indexers): add mark_connector_documents_failed helper	2026-06-09 23:39:25 +02:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	ce952d2ad1	chore: linting	2026-06-09 00:42:26 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	640ef5f15d	feat(proxy): integrate Scrapling for enhanced web scraping capabilities - Replaced Playwright with Scrapling's fetchers in the web crawling and YouTube processing modules for improved performance and flexibility. - Updated proxy configuration to support dynamic proxy selection via environment variables. - Enhanced logging to track performance metrics during web scraping operations. - Refactored related modules to utilize the new proxy utilities and streamline the scraping process.	2026-06-09 00:15:10 -07:00
CREDO23	8bdfd00a15	Merge upstream/dev	2026-06-05 19:18:12 +02:00
CREDO23	0081b627e9	refactor(agents): move kb_persistence middleware into main_agent (owner) The KB-persistence impl lived in shared/middleware/ but no subagent uses it -- consumers are the main_agent builder and the boundary event loop. Colocate with its owner using the folder-per-middleware shape; __init__ re-exports the public surface. Tests that reached module internals now alias the .middleware submodule. main_agent/middleware/kb_persistence.py -> kb_persistence/builder.py shared/middleware/kb_persistence.py -> kb_persistence/middleware.py	2026-06-05 14:11:55 +02:00
CREDO23	a7a642fedc	refactor(agents): move busy_mutex middleware into main_agent (owner) The busy-mutex impl (BusyMutexMiddleware + cancel/turn-lifecycle primitives) lived in shared/middleware/ but no subagent uses it -- consumers are the main_agent builder and the boundary (turn lifecycle). Colocate with its owner using the folder-per-middleware shape; __init__ re-exports the public surface so boundary import sites only change package path: main_agent/middleware/busy_mutex.py -> busy_mutex/builder.py shared/middleware/busy_mutex.py -> busy_mutex/middleware.py	2026-06-05 14:08:45 +02:00
CREDO23	f2a61bc0ef	refactor(agents): consolidate chat runtime infra under chat/runtime Move the lower-level runtime/infra modules out of multi_agent_chat/shared/ (they were never used by subagents, so they failed the shared-by-all-siblings rule) and unify them with the already-relocated checkpointer: agents/runtime/ -> agents/chat/runtime/ mac/shared/errors.py -> chat/runtime/errors.py mac/shared/llm_config.py -> chat/runtime/llm_config.py mac/shared/prompt_caching.py -> chat/runtime/prompt_caching.py mac/shared/mention_resolver.py -> chat/runtime/mention_resolver.py mac/shared/path_resolver.py -> chat/runtime/path_resolver.py These sit below the agent packages: the boundary + agent factory + shared middleware depend on them, and they import no agent code (acyclic).	2026-06-05 13:19:24 +02:00
CREDO23	7d866a2279	refactor(agents): sink sandbox.py into filesystem subsystem shared/sandbox.py was used only by the filesystem middleware/tools (and the boundary) -- never by main_agent or subagents as shared code. Move it next to its only agent-side consumer: multi_agent_chat/shared/sandbox.py -> multi_agent_chat/shared/middleware/filesystem/sandbox.py	2026-06-05 13:15:57 +02:00
CREDO23	24b62a63b4	refactor(agents): introduce chat/ category; dissolve top-level agents/shared Recursive shared-folder rule: a shared/ must be shared by ALL siblings at its level. The kernel (context, compaction, retry_after, web_search) was shared by only 2 of the agents -- anonymous_chat + multi_agent_chat -- never by podcaster or video_presentation. Those 2 are the "chat" category, so their shared code belongs in that category's shared/, not the top-level one. app/agents/anonymous_chat/ -> app/agents/chat/anonymous_chat/ app/agents/multi_agent_chat/ -> app/agents/chat/multi_agent_chat/ app/agents/shared/ -> app/agents/chat/shared/ (anon<->mac kernel) Top-level app/agents/shared/ is gone: nothing was shared across all three categories (chat / podcaster / video_presentation). ~289 import sites rewritten (app.agents.{anonymous_chat,multi_agent_chat,shared} -> app.agents.chat.*); all moves are git renames (history preserved). app/agents/ now: chat/, podcaster/, video_presentation/, runtime/.	2026-06-05 12:54:02 +02:00
CREDO23	b7ea829371	refactor(agents): relocate boundary-only infra out of shared/ Neither module is imported by any sibling agent package, so neither belongs in the cross-agent shared kernel: - checkpointer.py -> app/agents/runtime/checkpointer.py LangGraph Postgres checkpoint saver. It's cross-agent runtime infra wired by the boundary (app lifespan + anonymous_chat & multi_agent_chat flows), not agent code. New app/agents/runtime/ layer holds boundary-wired agent infra. - shared/system_prompt.py + shared/prompts/ -> app/prompts/ The legacy single-agent prompt composer. The live agents don't use it (main_agent has its own system_prompt/ builder; anonymous_chat builds inline); its only consumer is new_llm_config_routes for displaying default instructions. Moved to the existing non-agent prompt domain: system_prompt.py -> app/prompts/default_system_instructions.py prompts/ -> app/prompts/system_prompt_composer/ app/agents/shared/ now contains only genuinely cross-agent code: context, middleware/{compaction,retry_after,dedup_tool_calls}, tools/. NOTE: get_default_system_instructions() (LLM-config UI) composes from the legacy library, which differs from what the live agents actually run -- pre-existing latent staleness, not changed here.	2026-06-05 12:36:44 +02:00
CREDO23	82c5dc5b02	refactor(agents): move mac-only modules out of the cross-agent shared kernel app/agents/shared/ is a sibling of anonymous_chat/podcaster/multi_agent_chat/ video_presentation, so it should only hold code shared across 2+ of those agents. In practice podcaster and video_presentation import nothing from it, and anonymous_chat needs only context + compaction + retry_after + web_search. Everything else was multi_agent_chat-only (the boundary just passes through). Move the multi_agent_chat-only cluster into multi_agent_chat/shared/ (files moved verbatim via git rename; ~116 import sites rewritten): errors, feature_flags, filesystem_selection, path_resolver, prompt_caching, sandbox, llm_config, mention_resolver middleware/busy_mutex, middleware/kb_persistence busy_mutex/llm_config/mention_resolver are boundary-only but import the moved modules, so they were folded in to avoid a backwards shared -> multi_agent_chat dependency. main_agent builders now import the impls directly; the shared middleware barrel keeps only the genuinely-shared compaction + retry_after. Also delete the dead leftover shared/plugins and shared/skills dirs (live copies already live under main_agent/). Remaining in app/agents/shared/: context, system_prompt(+prompts), checkpointer, middleware/{compaction,retry_after,dedup_tool_calls}, tools/. checkpointer and system_prompt are boundary-only infra pending a dedicated home decision.	2026-06-05 12:30:15 +02:00
CREDO23	f615d6b530	refactor(agents): relocate remaining MAC-only kernel (permissions, deliverable_wait) permissions.py (authorization Rule/Ruleset model) is consumed across all MAC subagents + the permissions middleware, with a single external consumer (user_tool_allowlist service) -> move to multi_agent_chat/shared/permissions.py and repoint all 42 sites. deliverable_wait.py (wait_for_deliverable) is used only by the podcast and video_presentation deliverable tools -> colocate into subagents/builtins/deliverables/. No behavior change; import-all + permission/allowlist/deliverable unit tests stay green.	2026-06-05 10:58:49 +02:00
CREDO23	714c5ffea9	refactor(agents): group tool-outcome receipts into multi_agent_chat/shared/receipts/ receipt.py (Receipt model + make_receipt) and receipt_command.py (with_receipt Command helper) are a tight pair used only by MAC subagent tools, the graph state, and the kb_persistence middleware -- no external code imports them (the streaming tool_end handler only references them in a docstring). Move both into a dedicated receipts/ package (receipts/receipt.py + receipts/command.py) and repoint importers. No behavior change; import-all + receipt/deliverable unit tests stay green.	2026-06-05 10:56:37 +02:00
CREDO23	9c845d562e	refactor(agents): colocate main-agent middleware under main_agent/ slice Vertical-slice colocation: all main-agent code should live under main_agent/ instead of being split across a parallel middleware/main_agent tree. Move multi_agent_chat/middleware/main_agent/ -> main_agent/middleware/ and its assembler middleware/stack.py -> main_agent/middleware/stack.py, so the main-agent slice is self-contained (graph, runtime, system_prompt, tools, middleware). Genuinely cross-slice middleware (middleware/shared/, middleware/subagent/) stays under multi_agent_chat/middleware/ for a later slice; the moved builders now reference it via absolute imports. Pure move + import rewrite (git-tracked renames). Verified: full unit suite green (2430 passed, 1 skipped), including test_import_all and the checkpointed-subagent middleware suite.	2026-06-04 18:03:49 +02:00
CREDO23	5b45f78a16	refactor(chat): delete legacy stream_new_chat monolith (cutover complete) The flows orchestrators (new_chat/resume_chat) are now the sole live path after the byte-for-byte differential proof, so the monolith and its monolith-vs-flows parity scaffolding are removed. - Repoint the last live importer (anonymous_chat_routes) to streaming.agent.event_loop.stream_agent_events + shared.stream_result.StreamResult (drop-in; the keyword-only fallback-commit params default to inert for anon). - Repoint e2e launcher patch targets to flows.shared.llm_bundle. - Repoint helper unit tests (chunk_parts, thinking-step ids, tool-input streaming) to their flows homes to preserve coverage. - Delete the monolith, the contract test, and the parity tests (parallel_refactor, stage_1, stage_2, orchestrator_frame) whose sole purpose was comparing against the now-removed monolith. Full suite green (2622 passed, 1 skipped); the two excluded live-app dirs (document_upload, composio) have a pre-existing, env-gated registration 404 unrelated to this change.	2026-06-04 14:35:45 +02:00
CREDO23	724bbd6deb	refactor(agents): hardwire multi-agent as the only chat factory (bucket B1) Make create_multi_agent_chat_deep_agent the unconditional agent factory in all three streaming entry points (stream_new_chat monolith + new_chat/resume flow orchestrators). Drop the MULTI_AGENT_CHAT_ENABLED branch and the now-unused create_surfsense_deep_agent / _app_config imports. The single-agent implementation (chat_deepagent.py, subagents/) is left in place; it is deleted in a later phase. Suite green (2758 passed).	2026-06-04 13:35:38 +02:00
CREDO23	227983a104	refactor(agents): move middleware package to app/agents/shared (slice 5c) Relocate the entire new_chat/middleware/ package to the shared kernel as one cohesive unit (it is live shared infrastructure: the multi-agent stack wraps nearly every middleware via multi_agent_chat/middleware/main_agent/*, and anonymous_agent consumes it too). Flip 69 live importers across both the package-path and submodule-path forms. Shims left for the frozen single-agent stack: a package __init__ re-export plus submodule shims for permission, skills_backends, and scoped_model_fallback (the three imported via submodule path by chat_deepagent/subagents). Cycle break: importing shared.middleware previously reached back into new_chat.tools at module load, which dragged in new_chat.__init__ -> chat_deepagent -> the middleware shim -> half-initialized shared.middleware. Made action_log's ToolDefinition import TYPE_CHECKING-only and tool_call_repair's INVALID_TOOL_NAME import function-local. These tools-package back-edges fully resolve in slice 6. Asset note: skills_backends._default_builtin_root now walks to app/agents/new_chat/skills/builtin (the skills/ tree migrates in slice 7).	2026-06-04 13:00:41 +02:00
CREDO23	6f488d9564	refactor(agents): move checkpointer + mention_resolver to app/agents/shared (slice 5b) Two independent leaf modules (no intra-new_chat deps, no frozen importer), consumed only by flows/routes/tests. Flipped 8 importers across both the dotted-path and module-style (from app.agents.new_chat import mention_resolver) forms. No shims needed.	2026-06-04 12:52:54 +02:00
CREDO23	946f8a8c5d	refactor(agents): move llm_config + prompt_caching to app/agents/shared (slice 4b) Relocate the mutually-dependent LLM config layer and the LiteLLM prompt-caching helper to the shared kernel as one unit, rewiring their internal cross-reference to the shared paths. Flip 21 non-frozen importers. Re-export shims remain at new_chat/{llm_config,prompt_caching}.py for the frozen single-agent stack (chat_deepagent); they will be removed when that stack is retired.	2026-06-04 12:41:52 +02:00
CREDO23	3efe51e6ec	refactor(agents): move filesystem_state, path_resolver, sandbox to app/agents/shared (slice 3b) Relocate three leaf filesystem-cluster modules to the shared kernel and flip all 38 importers. No re-export shims needed (no frozen single-agent importer). This also resolves the pre-existing shared->new_chat back-edge from shared/receipt_command.py onto filesystem_state. filesystem_backends is intentionally deferred to slice 5: it depends on new_chat middleware (kb_postgres_backend, multi_root_local_folder_backend) that have not yet moved, so relocating it now would create a shared->new_chat edge.	2026-06-04 12:34:28 +02:00
CREDO23	1b536b8aee	refactor(agents): move filesystem_selection to app/agents/shared (slice 3a) Promote the filesystem mode contracts (FilesystemMode, FilesystemSelection, ClientPlatform, LocalFilesystemMount) out of `new_chat` into the cross-agent `app/agents/shared` kernel. Pure leaf consumed across the whole multi-agent filesystem middleware/tool tree, the chat flows/monolith, routes and tests. git mv (content unchanged) + flipped all ~48 importers. A re-export shim remains at new_chat/filesystem_selection.py only for the not-yet-retired single-agent (chat_deepagent). Also updated the stream parity test's annotation normalizer to strip the new app.agents.shared.filesystem_selection. prefix (the dataclasses' __module__ changed with the move), keeping monolith<->flows signature parity intact. Behavior-preserving: only import paths change. 1326 tests green.	2026-06-04 12:28:20 +02:00
CREDO23	28b13ed25b	refactor(agents): move context schema + state reducers to app/agents/shared (slice 2) Continue promoting the shared agent toolkit out of `new_chat` into the cross-agent `app/agents/shared` kernel. - state_reducers.py: clean move (no single-agent importer); all 7 importers flipped to app.agents.shared.state_reducers. - context.py: moved to app.agents.shared.context; flipped the multi-agent, app, automations, chat-flows and monolith importers. A thin re-export shim remains at new_chat/context.py because the not-yet-retired single-agent (chat_deepagent) and the new_chat package __init__ still import it; the shim goes away with the single-agent deletion. - Updated the stream parity test's annotation normalizer to strip the new app.agents.shared.context. prefix (SurfSenseContextSchema.__module__ changed with the move), keeping monolith<->flows signature parity intact. Behavior-preserving: definitions unchanged; only import paths move. 1219 tests green.	2026-06-04 12:19:22 +02:00
CREDO23	0354f73f29	refactor(agents): move error taxonomy to app/agents/shared/errors (slice 1) First slice of promoting the shared agent toolkit out of the misnamed `new_chat` package into the cross-agent `app/agents/shared` kernel. `errors.py` is a leaf module (no intra-package deps) consumed by the multi-agent chat, the chat streaming flows/monolith, and tests — i.e. it is shared infrastructure, not single-agent code. Moved it verbatim to `app.agents.shared.errors` and flipped all 12 importers. No re-export shim remains since zero importers needed it. Behavior-preserving: identical class/enum definitions; only the import path changes. 1208 agent + chat-task tests green.	2026-06-04 12:13:55 +02:00
CREDO23	44abf56d6c	fix: restore connector mentions, free-tier fallback, orphan-row guard in new chat flow	2026-06-04 10:25:06 +02:00
Anish Sarkar	e4d7b01b09	refactor(backend): Update references from document summary LLM to agent LLM across multiple files	2026-06-04 01:50:44 +05:30
Anish Sarkar	f3866b9e7e	feat(backend): Use deterministic content in connector ingestion	2026-06-04 00:51:38 +05:30
Anish Sarkar	81fa219b30	feat(backend): Remove LLM summaries from document indexing	2026-06-04 00:50:19 +05:30
Anish Sarkar	e68b3f9532	feat(chat): fix support for mentioned connectors in new chat input state and runtime context, enhancing query rendering and context management	2026-06-03 22:51:55 +05:30
CREDO23	9dab07f775	refactor: point notification callers at notifications module	2026-06-03 18:04:47 +02:00
CREDO23	b6710ae9af	refactor(jira): remove dead legacy indexing and write tools (mcp-only now)	2026-06-02 16:38:00 +02:00

1 2 3 4 5 ...

716 commits