SurfSense

mirror of https://github.com/MODSetter/SurfSense.git synced 2026-06-06 20:15:17 +02:00

Author	SHA1	Message	Date
CREDO23	24b62a63b4	refactor(agents): introduce chat/ category; dissolve top-level agents/shared Recursive shared-folder rule: a shared/ must be shared by ALL siblings at its level. The kernel (context, compaction, retry_after, web_search) was shared by only 2 of the agents -- anonymous_chat + multi_agent_chat -- never by podcaster or video_presentation. Those 2 are the "chat" category, so their shared code belongs in that category's shared/, not the top-level one. app/agents/anonymous_chat/ -> app/agents/chat/anonymous_chat/ app/agents/multi_agent_chat/ -> app/agents/chat/multi_agent_chat/ app/agents/shared/ -> app/agents/chat/shared/ (anon<->mac kernel) Top-level app/agents/shared/ is gone: nothing was shared across all three categories (chat / podcaster / video_presentation). ~289 import sites rewritten (app.agents.{anonymous_chat,multi_agent_chat,shared} -> app.agents.chat.*); all moves are git renames (history preserved). app/agents/ now: chat/, podcaster/, video_presentation/, runtime/.	2026-06-05 12:54:02 +02:00
CREDO23	82c5dc5b02	refactor(agents): move mac-only modules out of the cross-agent shared kernel app/agents/shared/ is a sibling of anonymous_chat/podcaster/multi_agent_chat/ video_presentation, so it should only hold code shared across 2+ of those agents. In practice podcaster and video_presentation import nothing from it, and anonymous_chat needs only context + compaction + retry_after + web_search. Everything else was multi_agent_chat-only (the boundary just passes through). Move the multi_agent_chat-only cluster into multi_agent_chat/shared/ (files moved verbatim via git rename; ~116 import sites rewritten): errors, feature_flags, filesystem_selection, path_resolver, prompt_caching, sandbox, llm_config, mention_resolver middleware/busy_mutex, middleware/kb_persistence busy_mutex/llm_config/mention_resolver are boundary-only but import the moved modules, so they were folded in to avoid a backwards shared -> multi_agent_chat dependency. main_agent builders now import the impls directly; the shared middleware barrel keeps only the genuinely-shared compaction + retry_after. Also delete the dead leftover shared/plugins and shared/skills dirs (live copies already live under main_agent/). Remaining in app/agents/shared/: context, system_prompt(+prompts), checkpointer, middleware/{compaction,retry_after,dedup_tool_calls}, tools/. checkpointer and system_prompt are boundary-only infra pending a dedicated home decision.	2026-06-05 12:30:15 +02:00
CREDO23	8ae190a11d	refactor(agents): move MAC middleware impls out of shared kernel knowledge_search, memory_injection and scoped_model_fallback no longer belong in the cross-agent kernel (app/agents/shared/middleware): they are consumed only inside multi_agent_chat. Relocate each impl next to the builder that uses it: - knowledge_search.py -> multi_agent_chat/shared/middleware/ (genuinely shared: its _render_priority_message feeds kb_context_projection, used by both the main agent and the KB subagent) - memory_injection.py -> multi_agent_chat/shared/middleware/ (beside its memory.py builder) - scoped_model_fallback.py -> multi_agent_chat/shared/middleware/resilience/ (beside fallback.py/bundle.py) Impls moved verbatim (git rename). Builders/consumers now import the local sibling; main_agent knowledge_priority imports the new shared path; shared middleware barrel trimmed. Tests: repoint imports; convert the knowledge_search monkeypatch targets from brittle dotted-string form to object-based patching (monkeypatch.setattr on the imported module), which is robust to import ordering. No behavior change.	2026-06-05 12:04:31 +02:00
CREDO23	21509e7eca	refactor(agents): group filesystem backends under filesystem/backends/ The concrete filesystem backends are consumed only by the MAC filesystem layer (tools, path-resolution middleware, the resolver, skills backend) and tests -- no external app code. Group them next to the filesystem middleware they serve: - filesystem_backends.py -> filesystem/backends/resolver.py - middleware/kb_postgres_backend.py -> filesystem/backends/kb_postgres.py - middleware/local_folder_backend.py -> filesystem/backends/local_folder.py - middleware/multi_root_local_folder_backend.py -> .../multi_root_local_folder.py - document_xml.py -> filesystem/backends/document_xml.py Repoint all 21 importers. No behavior change; import-all + filesystem backend/path-resolution/knowledge-search unit tests stay green (478).	2026-06-05 11:02:26 +02:00
CREDO23	2db4ad479e	refactor(agents): colocate KB-search tool with its sole consumer; fix report ImportError shared/tools/knowledge_base.py had exactly one production consumer: the report deliverable, which imported it via `from .knowledge_base import ...` -- a sibling path that did not exist, so the report KB-search path would raise ImportError at runtime. Move the module next to report.py (subagents/builtins/deliverables/tools/) which makes that relative import valid, and move its only dependency (shared/utils.py date helpers) to multi_agent_chat/shared/date_filters.py, shared between the KB tool and the knowledge_search middleware. Drop the now-unused knowledge-base re-exports from the shared/tools barrel and repoint the integration tests. import-all + error-contract stay green.	2026-06-05 10:28:56 +02:00
CREDO23	add9e14694	refactor(agents): colocate middleware into vertical slices Eliminate the top-level multi_agent_chat/middleware/ package so each slice owns its middleware (vertical-slice colocation): - middleware/shared/ -> shared/middleware/ (cross-slice middleware) - middleware/subagent/ -> subagents/shared/middleware/ (subagent stack) - main_agent/middleware/ already colocated in Slice A The moved shared/ subtree is internally consistent (all relative imports stay within it), so only external absolute refs were rewritten. The subagent stack's ..shared.* relatives were promoted to absolute paths to the new shared/middleware/ location. multi_agent_chat/ root is now: main_agent/, shared/, subagents/. Verified: 2430 unit tests pass, 1 skipped (baseline unchanged).	2026-06-04 18:13:47 +02:00
CREDO23	1acde6a470	test(agents): cover live filesystem middleware, retire dead twin The single-agent-era filesystem middleware (app/agents/shared/middleware/ filesystem.py, ~2000 lines) was never instantiated in production, yet three unit suites validated it — an illusory guardrail while the live decomposed middleware (multi_agent_chat/middleware/shared/filesystem) was unguarded. Close the gap before reorganizing the agents module: - Add 14 integration tests driving live B's tools in desktop mode (real on-disk effects) and cloud mode (in-state staging, namespace policy). - Port all high-value dead-twin assertions onto the live path: cloud rm/rmdir staging + guard rails, KBPostgresBackend delete-view filter, mode-scoped system prompt, cwd/relative/namespace resolution, multi-root mount normalization. - Delete dead twin filesystem.py, drop its __init__ re-export, and retire its 3 dead-twin tests. Verified: test_import_all + middleware unit + FS integration all green.	2026-06-04 17:46:49 +02:00
CREDO23	aab95b9130	refactor(agents): move tools package to app/agents/shared (slice 6) Relocate the entire new_chat/tools/ package (62 files incl. registry, hitl, MCP cluster, and all connector subpackages: gmail/slack/discord/teams/drive/etc.) to the shared kernel. The package turned out to be a clean cohesive cluster: its only references to non-tools new_chat modules were comments, and its middleware deps were already flipped to shared in slice 5c. Flip 33 live importers (multi-agent, flows, routes, services, anonymous_agent, tests). Re-export shims remain for the frozen single-agent stack: a package __init__ mirroring the public surface (new_chat.__init__ imports it) plus invalid_tool + registry submodule shims (chat_deepagent imports those). Resolves slice 5c's two transient back-edges: shared/middleware/action_log (TYPE_CHECKING ToolDefinition) and tool_call_repair (local INVALID_TOOL_NAME) now point at app.agents.shared.tools.	2026-06-04 13:11:56 +02:00
CREDO23	227983a104	refactor(agents): move middleware package to app/agents/shared (slice 5c) Relocate the entire new_chat/middleware/ package to the shared kernel as one cohesive unit (it is live shared infrastructure: the multi-agent stack wraps nearly every middleware via multi_agent_chat/middleware/main_agent/*, and anonymous_agent consumes it too). Flip 69 live importers across both the package-path and submodule-path forms. Shims left for the frozen single-agent stack: a package __init__ re-export plus submodule shims for permission, skills_backends, and scoped_model_fallback (the three imported via submodule path by chat_deepagent/subagents). Cycle break: importing shared.middleware previously reached back into new_chat.tools at module load, which dragged in new_chat.__init__ -> chat_deepagent -> the middleware shim -> half-initialized shared.middleware. Made action_log's ToolDefinition import TYPE_CHECKING-only and tool_call_repair's INVALID_TOOL_NAME import function-local. These tools-package back-edges fully resolve in slice 6. Asset note: skills_backends._default_builtin_root now walks to app/agents/new_chat/skills/builtin (the skills/ tree migrates in slice 7).	2026-06-04 13:00:41 +02:00
CREDO23	fb70e23dd2	test: add agent refactor guardrail suite	2026-06-04 11:44:23 +02:00
CREDO23	3f770203ca	test: add notifications integration behavior guard	2026-06-03 21:53:06 +02:00
Anish Sarkar	cb9a0f327c	test: refactor Gmail indexer tests to utilize ComposioService and hybrid chunking	2026-05-16 21:26:40 +05:30
Anish Sarkar	a0f2563dc3	test: update Stripe and Google Calendar integration tests to use ComposioService	2026-05-16 21:13:17 +05:30
Anish Sarkar	bd452b3df4	fix(tests): improve composio module hijack in integration tests	2026-05-13 00:44:20 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	c8374e6c5b	feat: improved document, folder mentions rendering Some checks are pending Build and Push Docker Images / tag_release (push) Waiting to run Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Build and Push Docker Images / create_manifest (backend, surfsense-backend) (push) Blocked by required conditions Build and Push Docker Images / create_manifest (web, surfsense-web) (push) Blocked by required conditions	2026-05-09 22:15:51 -07:00
Anish Sarkar	de6fc80dbd	chore: ran linting	2026-05-09 05:28:09 +05:30
Anish Sarkar	f7bac59a4b	test(integration): enhance Drive indexer credential resolution tests for Composio and native connectors	2026-05-09 05:26:36 +05:30
Anish Sarkar	87dd5af259	test(backend): add Composio route integration tests	2026-05-06 17:19:32 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	19b6e0a025	feat: moved chat persistance to Server Side	2026-05-04 03:06:15 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	31a372bb84	feat: updated agent harness	2026-04-28 09:22:19 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	8d50f90060	chore: linting Some checks failed Obsidian Plugin Lint / lint (push) Has been cancelled	2026-04-27 14:04:50 -07:00
Anish Sarkar	02795e08e3	feat: add server time to obsidian connect responses and enhance error handling - Included server_time_utc in the connect response schema for better synchronization. - Updated obsidian_connect function to set server_time_utc during connection handling. - Enhanced integration tests to verify the presence of server_time_utc in responses. - Improved connectivity status recovery in the sync engine for better error management.	2026-04-25 03:57:07 +05:30
Anish Sarkar	e84dc87c5b	feat(obsidian_plugin): validate binary attachments and enforce MIME type checks	2026-04-25 00:23:17 +05:30
Anish Sarkar	6ac5256431	feat: implement background processing for binary attachments in Obsidian plugin - Added a new Celery task for indexing non-markdown attachments. - Enhanced the Obsidian plugin schema to support binary attachments. - Updated routes to enqueue binary attachments for background processing. - Improved metadata handling for binary attachments during indexing. - Added tests for binary attachment processing and validation.	2026-04-22 23:00:34 +05:30
Anish Sarkar	6eeaa2db4d	feat: enhance Obsidian plugin schema with HeadingRef class	2026-04-22 20:26:58 +05:30
Anish Sarkar	3eb4d55ef5	chore: ran linting	2026-04-22 06:40:39 +05:30
Anish Sarkar	4a75603d4f	feat: implement sync notifications for Obsidian plugin - Added functionality to create and update notifications during the Obsidian sync process. - Improved handling of sync completion and failure notifications. - Updated connector naming convention in various locations for consistency.	2026-04-22 06:38:51 +05:30
Anish Sarkar	54ce2666f5	feat: implement cross-device deduplication for Obsidian connectors using vault fingerprinting and enhance connector management	2026-04-21 04:21:33 +05:30
Anish Sarkar	2d90ed0fec	feat: deactivate legacy Obsidian connectors and implement partial unique index for improved upsert handling	2026-04-21 03:18:44 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	656e061f84	feat: add processing mode support for document uploads and ETL pipeline, improded error handling ux Some checks are pending Build and Push Docker Images / tag_release (push) Waiting to run Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Build and Push Docker Images / create_manifest (backend, surfsense-backend) (push) Blocked by required conditions Build and Push Docker Images / create_manifest (web, surfsense-web) (push) Blocked by required conditions - Introduced a `ProcessingMode` enum to differentiate between basic and premium processing modes. - Updated `EtlRequest` to include a `processing_mode` field, defaulting to basic. - Enhanced ETL pipeline services to utilize the selected processing mode for Azure Document Intelligence and LlamaCloud parsing. - Modified various routes and services to handle processing mode, affecting document upload and indexing tasks. - Improved error handling and logging to include processing mode details. - Added tests to validate processing mode functionality and its impact on ETL operations.	2026-04-14 21:26:00 -07:00
CREDO23	a95bf58c8f	Make Vision LLM opt-in for uploads and connectors	2026-04-10 16:45:51 +02:00
Anish Sarkar	56c5809170	chore: ran linting	2026-04-08 18:23:03 +05:30
Anish Sarkar	37c52ce7ea	feat: implement indexing progress management in local folder indexing process and enhance related test coverage	2026-04-08 18:01:55 +05:30
Anish Sarkar	a624c86b04	refactor: update file skipping logic in Dropbox, Google Drive, and OneDrive connectors to return unsupported extension information	2026-04-07 05:11:15 +05:30
Anish Sarkar	f03bf05aaa	refactor: enhance Google Drive indexer to support file extension filtering, improving file handling and error reporting	2026-04-06 22:34:49 +05:30
Anish Sarkar	a2b3541046	chore: ran linting	2026-04-04 03:11:56 +05:30
Anish Sarkar	0d2acc665d	Merge remote-tracking branch 'upstream/dev' into feat/page-limit-connectors	2026-04-04 03:08:27 +05:30
Anish Sarkar	ce40da80ea	feat: implement page limit estimation and enforcement in file based connector indexers - Added a static method `estimate_pages_from_metadata` to `PageLimitService` for estimating page counts based on file metadata. - Integrated page limit checks in Google Drive, Dropbox, and OneDrive indexers to prevent exceeding user quotas during file indexing. - Updated relevant indexing methods to utilize the new page estimation logic and enforce limits accordingly. - Enhanced tests for page limit functionality, ensuring accurate estimation and enforcement across different file types.	2026-04-04 02:51:28 +05:30
Anish Sarkar	9c0af6569d	feat: implement page limit checks in local folder indexing to manage user page usage	2026-04-03 19:13:25 +05:30
Anish Sarkar	edda5b98cb	chore: ran linting	2026-04-03 17:38:29 +05:30
Anish Sarkar	b759bb36a9	feat: add direct conversion support for CSV, TSV, and HTML files in local folder indexing	2026-04-03 17:36:48 +05:30
Anish Sarkar	746c730b2e	chore: ran linting	2026-04-03 13:14:40 +05:30
Anish Sarkar	62b44889d1	Merge remote-tracking branch 'upstream/dev' into feat/local-folder-sync	2026-04-03 11:42:43 +05:30
Anish Sarkar	2b9d79d44c	feat: add integration tests for batch processing of local folder indexing, covering multiple file scenarios and error handling	2026-04-03 10:04:14 +05:30
Anish Sarkar	1fa8e1cc83	feat: refactor folder indexing to support batch processing of multiple files, enhancing performance and error handling	2026-04-03 10:02:36 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	62e698d8aa	refactor: streamline document upload limits and enhance handling of mentioned documents - Updated maximum file size limit to 500 MB per file. - Removed restrictions on the number of files per upload and total upload size. - Enhanced handling of user-mentioning documents in the knowledge base search middleware. - Improved document reading and processing logic to accommodate new features and optimizations.	2026-04-02 19:39:10 -07:00
Anish Sarkar	53df393cf7	refactor: streamline local folder indexing logic by removing unused imports, enhancing content hashing, and improving document creation process	2026-04-02 23:28:23 +05:30
Anish Sarkar	c27d24a117	feat: enhance folder indexing by adding root folder ID support and implement folder creation and cleanup logic	2026-04-02 22:41:45 +05:30
Anish Sarkar	caf2525ab5	fix: update folder ID collection logic to include deleted directories and adjust test cases for document titles	2026-04-02 22:29:07 +05:30
Anish Sarkar	22ee5c99cc	refactor: remove Local Folder connector and related tasks, implement new folder indexing endpoints	2026-04-02 22:21:31 +05:30

1 2 3

102 commits