SurfSense

mirror of https://github.com/MODSetter/SurfSense.git synced 2026-05-25 19:15:18 +02:00

Author	SHA1	Message	Date
Anish Sarkar	53691f9c51	feat(agents): track permission and compaction events	2026-05-21 23:02:54 +05:30
Anish Sarkar	ea3d0a6463	feat(agents): emit metrics for model and tool calls	2026-05-21 23:02:36 +05:30
CREDO23	49da7a57df	Merge remote-tracking branch 'upstream/dev' into improvement-agent-speed Resolves: surfsense_backend/app/agents/new_chat/middleware/memory_injection.py - Took both imports: upstream moved MEMORY_HARD_LIMIT/SOFT_LIMIT to app.services.memory; kept our perf-logger import for timing. Pulls in upstream changes: - Memory document feature (services/memory refactor, removal of app.agents.new_chat.memory_extraction and background extraction in stream_new_chat — agent now drives memory via update_memory tool). - BACKEND_URL env refactor across web tool-ui/editor/chat/dashboard/lib. - GitHub Actions backend test workflow + pre-commit biome bump. - Token-display polish in MessageInfoDropdown; save_memory no-update sentinel. Verified: 1723 unit tests pass, ruff clean. No semantic regression in stream_new_chat (their memory-extraction deletion and our preflight removal touch different functions).	2026-05-20 21:23:48 +02:00
CREDO23	d5ee8cc4cd	Merge remote-tracking branch 'upstream/dev' into improvement-agent-speed	2026-05-20 19:22:49 +02:00
CREDO23	704d1bf18f	refactor(mcp): per-connector cache refresh on lifecycle events Collapse the invalidate + warmup pair into a single refresh_mcp_tools_cache_for_connector(connector_id, search_space_id) helper and scope live discovery to the one connector that changed instead of the whole search space. - new mcp_tool.discover_single_mcp_connector: load one connector, refresh OAuth if needed, force live MCP discovery so its cached_tools row is rewritten; returned wrappers are discarded since the in-process LRU is rebuilt lazily on the next user query - mcp_tools_cache.refresh_mcp_tools_cache_for_connector: synchronously evicts the per-space LRU (LRU keys cannot scope finer) and schedules the per-connector prefetch via loop.create_task - routes (OAuth callback, MCP POST, MCP PUT) collapse their two back-to-back calls into a single refresh call; DELETE handlers keep using bare invalidate_mcp_tools_cache (nothing to prefetch) No new automated tests: the new functions are I/O glue (DB + network) where mocked unit tests would test implementation rather than behavior. The existing 9 unit tests for the cached_tools data shape are unchanged.	2026-05-20 17:43:27 +02:00
CREDO23	c0aa4261ac	perf(mcp): persist list_tools discovery in connector.config.cached_tools Skip the ~1-3s MCP initialize + list_tools handshake on every cache miss by reading tool definitions from the connector row we already load. Lazy populate on first miss, self-heal on corrupt cache, zero schema migration.	2026-05-20 16:11:07 +02:00
CREDO23	db8bffab38	perf(prompt-cache): enable Azure prompt_cache_key routing hint Splits the OpenAI-family gate into per-param predicates so AZURE and AZURE_OPENAI configs now receive prompt_cache_key for backend routing affinity (Microsoft auto-caches GPT-4o+ deployments at >=1024 tokens; the key clusters same-prefix requests on the same GPU pool and raises hit rate on turn 2+). prompt_cache_retention stays opted out for Azure because litellm 1.83.14's Azure transformer would drop it silently; revisit when Azure's supported params list is updated.	2026-05-20 11:58:15 +02:00
CREDO23	71dead0406	perf(kb-planner): route internal planner calls to dedicated small/fast LLM Adds an optional planner LLM role wired through KnowledgePriorityMiddleware so KB query rewriting, date extraction, and recency classification run on a cheap model (e.g. gpt-4o-mini, Haiku, Azure nano) instead of the user's chat LLM. Operators opt in by setting is_planner: true on exactly one global config; without it, behavior is unchanged.	2026-05-20 11:42:52 +02:00
Anish Sarkar	132e7b3c44	refactor: remove memory extraction functions and related components from the new chat agent	2026-05-20 14:03:28 +05:30
CREDO23	52d425f170	perf(kb-persistence): offload sync embed_texts to thread _create_document and _update_document run on the chat critical path when the filesystem subagent writes via the user's chat turn. Both called embed_texts synchronously inside an async coroutine, blocking the event loop for the duration of the embed.	2026-05-20 10:03:14 +02:00
CREDO23	4fa85a9a94	perf(kb-search): offload sync embed_texts to thread embed_texts holds a threading.Lock and runs a sync embedding call inside search_knowledge_base, an async coroutine on the KB priority middleware critical path. Blocking the event loop here stalls every other coroutine on the worker (SSE keepalives, concurrent chat requests, background tasks). Wrap in asyncio.to_thread so the embed runs on the default executor pool while the loop keeps serving.	2026-05-20 10:02:38 +02:00
Anish Sarkar	5247dc7097	feat: refine private and team memory protocols	2026-05-20 02:02:10 +05:30
Anish Sarkar	ceedd02353	refactor: extract shared memory service	2026-05-20 02:01:36 +05:30
CREDO23	b3b66e4c48	perf(new-chat): add memory_injection middleware timing log	2026-05-19 21:30:19 +02:00
CREDO23	1df40fbe31	perf(new-chat): add knowledge_tree middleware timing log	2026-05-19 21:30:14 +02:00
CREDO23	9bfba34e8e	perf(mcp): add per-call, discovery, and oauth-refresh timing logs	2026-05-19 21:29:56 +02:00
Anish Sarkar	f65bc81509	Merge remote-tracking branch 'upstream/dev' into feat/ui-revamp	2026-05-16 19:26:36 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	c187b04e82	chore: linting	2026-05-15 17:33:44 -07:00
CREDO23	c8b756ae8f	hitl/wire: rename 'always' decision-type to 'approve_always' Renames the SurfSense HITL extension decision-type from "always" to "approve_always" so it sits in the same verb-first family as "approve", "reject", and "edit". The Python constant is now SURFSENSE_DECISION_APPROVE_ALWAYS; the wire value, the permission-domain decision_type, and the FE union members all match (no wire/internal mismatch). Both the multi_agent_chat permission middleware and the legacy new_chat one accept the new wire value; the FE types.ts union is updated accordingly. The "context.always" payload key is intentionally left untouched - it's the patterns-to-promote field, semantically distinct from the decision type.	2026-05-15 14:47:32 +02:00
CREDO23	a97d1548a6	multi_agent_chat/permissions: surface MCP tool metadata into ask interrupts The FE permission card needs mcp_connector_id, mcp_server, and tool_description in the interrupt context to render "Always Allow" against the right connected account. Thread the tool through the ask pipeline: - pack_subagent → build_permission_mw(tools=...) → PermissionMiddleware (tools_by_name) → request_permission_decision(tool=...) → build_permission_ask_payload(tool=...) projects card fields out of BaseTool. - mcp_tool.py: stdio path now stashes mcp_connector_id in metadata for parity with the HTTP path.	2026-05-15 11:28:06 +02:00
Anish Sarkar	01d7379914	refactor: add public URL handling for SurfSense documents across various components and schemas	2026-05-15 02:05:11 +05:30
CREDO23	246dae40a8	Merge upstream/dev into feature/multi-agent	2026-05-12 21:23:37 +02:00
CREDO23	df2afed18d	subagents/knowledge_base: wire KB specialist into orchestrator (renderer/projector split, FS middleware stack, cloud-mode gating)	2026-05-11 20:43:44 +02:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	c8374e6c5b	feat: improved document, folder mentions rendering Some checks are pending Build and Push Docker Images / tag_release (push) Waiting to run Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Build and Push Docker Images / create_manifest (backend, surfsense-backend) (push) Blocked by required conditions Build and Push Docker Images / create_manifest (web, surfsense-web) (push) Blocked by required conditions	2026-05-09 22:15:51 -07:00
CREDO23	78f4747382	refactor(chat): stream agent events via stream_output and remove parity v2 flag	2026-05-07 19:40:10 +02:00
CREDO23	9a4ee5d16b	chore: trim narrative comments and docstrings	2026-05-05 18:27:46 +02:00
CREDO23	b394dc71c7	feat(multi-agent): extend model fallback to general-purpose subagent	2026-05-05 18:12:38 +02:00
CREDO23	1745d7dccf	feat(middleware): scope model fallback to provider/network errors only	2026-05-05 18:04:47 +02:00
Rohan Verma	ce6d9233bc	Merge pull request #1335 from AnishSarkar22/fix/memory-extraction refactor(memory): streamline memory extraction	2026-05-04 17:26:27 -07:00
CREDO23	5119915f4f	Merge upstream/dev into feature/multi-agent	2026-05-05 01:44:46 +02:00
CREDO23	277bd50f37	Harden HITL for multi-step tasks: bypass internal MCP gate, full-args dedup, and decision-envelope normalization.	2026-05-04 19:25:27 +02:00
CREDO23	4ac3f0b304	Forward HITL decisions from the streaming layer to subagents via the config side-channel.	2026-05-04 18:42:58 +02:00
Anish Sarkar	e38e20b484	fix: handle empty response in forced rewrite function - Updated the `_forced_rewrite` function to strip whitespace from the extracted text and added a warning log if the response is empty, preventing potential issues with empty rewrites.	2026-05-04 12:18:09 +05:30
Anish Sarkar	b981b51ab1	Merge remote-tracking branch 'upstream/dev' into fix/memory-extraction	2026-05-04 12:03:44 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	a34f1fb25c	feat: implement agent caches and fix invalid prompt cache configs Some checks are pending Build and Push Docker Images / tag_release (push) Waiting to run Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Build and Push Docker Images / create_manifest (backend, surfsense-backend) (push) Blocked by required conditions Build and Push Docker Images / create_manifest (web, surfsense-web) (push) Blocked by required conditions - Added a new function `_warm_agent_jit_caches` to pre-warm agent caches at startup, reducing cold invocation costs. - Updated the `SurfSenseContextSchema` to include per-invocation fields for better state management during agent execution. - Introduced caching mechanisms in various tools to ensure fresh database sessions are used, improving performance and reliability. - Enhanced middleware to support new context features and improve error handling during connector and document type discovery.	2026-05-03 06:03:40 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	c938d39277	feat: moved most things behind correct feature flag	2026-05-02 23:10:48 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	cea8618aed	fix: fixed composio issues	2026-05-02 21:16:03 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	47b2994ec7	feat: fixed vision/image provider specific errors and fixed podcast/video streaming Some checks are pending Build and Push Docker Images / tag_release (push) Waiting to run Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Build and Push Docker Images / create_manifest (backend, surfsense-backend) (push) Blocked by required conditions Build and Push Docker Images / create_manifest (web, surfsense-web) (push) Blocked by required conditions	2026-05-02 19:18:53 -07:00
Anish Sarkar	9975e085aa	refactor(memory): streamline memory extraction by utilizing extract_text_content utility	2026-05-02 16:10:30 +05:30
Rohan Verma	451a98936e	Merge pull request #1332 from AnishSarkar22/feat/model-pinnning-mode Some checks are pending Build and Push Docker Images / tag_release (push) Waiting to run Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Build and Push Docker Images / create_manifest (backend, surfsense-backend) (push) Blocked by required conditions Build and Push Docker Images / create_manifest (web, surfsense-web) (push) Blocked by required conditions feat: Auto-pin quality scoring, OpenRouter tier refactor and live usage sidebar	2026-05-01 15:57:19 -07:00
Anish Sarkar	25ccc959cf	feat(busy_mutex): enhance thread lock management to prevent stale middleware interference	2026-05-02 01:35:30 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	e57c3a7d0c	feat: prompt caching Some checks are pending Build and Push Docker Images / tag_release (push) Waiting to run Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Build and Push Docker Images / create_manifest (backend, surfsense-backend) (push) Blocked by required conditions Build and Push Docker Images / create_manifest (web, surfsense-web) (push) Blocked by required conditions - Updated `litellm` dependency version from `1.83.4` to `1.83.7`. - Adjusted `aiohttp` version from `3.13.5` to `3.13.4` in the lock file. - Implemented `apply_litellm_prompt_caching` in `chat_deepagent.py` to improve prompt caching. - Added model name resolution logic in `chat_deepagent.py` to ensure correct provider-variant dispatch. - Enhanced `llm_config.py` to configure prompt caching for various LLM providers. - Updated tests to verify correct model name forwarding and prompt caching behavior.	2026-05-01 05:10:53 -07:00
CREDO23	5d3b8b9ca9	Merge remote-tracking branch 'upstream/dev' into feature/multi-agent	2026-05-01 00:05:20 +02:00
Anish Sarkar	af66fbf106	refactor(chat): implement turn cancellation and status management in new chat routes for improved user experience and error handling	2026-05-01 01:47:52 +05:30
CREDO23	3908131105	Fix stale feature flags cache so specialized subagents respect env.	2026-04-29 21:36:34 +02:00
CREDO23	bf9b606a61	Wire Linear and Slack specialists and prompt routing.	2026-04-29 20:51:06 +02:00
CREDO23	41cb4a567b	Harden Linear and Slack MCP subagent permissions.	2026-04-29 20:24:21 +02:00
CREDO23	3dec2a7327	Add a Slack operations specialist subagent.	2026-04-29 16:25:36 +02:00
CREDO23	8ddfa6ac6f	Add shared safety constants for provider subagents.	2026-04-29 16:25:28 +02:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	c110f5b955	feat: improved agent streaming	2026-04-29 07:20:31 -07:00

1 2 3 4 5 ...

404 commits