SurfSense

mirror of https://github.com/MODSetter/SurfSense.git synced 2026-07-12 22:42:13 +02:00

Author	SHA1	Message	Date
Anish Sarkar	53691f9c51	feat(agents): track permission and compaction events	2026-05-21 23:02:54 +05:30
Anish Sarkar	ea3d0a6463	feat(agents): emit metrics for model and tool calls	2026-05-21 23:02:36 +05:30
CREDO23	49da7a57df	Merge remote-tracking branch 'upstream/dev' into improvement-agent-speed Resolves: surfsense_backend/app/agents/new_chat/middleware/memory_injection.py - Took both imports: upstream moved MEMORY_HARD_LIMIT/SOFT_LIMIT to app.services.memory; kept our perf-logger import for timing. Pulls in upstream changes: - Memory document feature (services/memory refactor, removal of app.agents.new_chat.memory_extraction and background extraction in stream_new_chat — agent now drives memory via update_memory tool). - BACKEND_URL env refactor across web tool-ui/editor/chat/dashboard/lib. - GitHub Actions backend test workflow + pre-commit biome bump. - Token-display polish in MessageInfoDropdown; save_memory no-update sentinel. Verified: 1723 unit tests pass, ruff clean. No semantic regression in stream_new_chat (their memory-extraction deletion and our preflight removal touch different functions).	2026-05-20 21:23:48 +02:00
CREDO23	d5ee8cc4cd	Merge remote-tracking branch 'upstream/dev' into improvement-agent-speed	2026-05-20 19:22:49 +02:00
CREDO23	704d1bf18f	refactor(mcp): per-connector cache refresh on lifecycle events Collapse the invalidate + warmup pair into a single refresh_mcp_tools_cache_for_connector(connector_id, search_space_id) helper and scope live discovery to the one connector that changed instead of the whole search space. - new mcp_tool.discover_single_mcp_connector: load one connector, refresh OAuth if needed, force live MCP discovery so its cached_tools row is rewritten; returned wrappers are discarded since the in-process LRU is rebuilt lazily on the next user query - mcp_tools_cache.refresh_mcp_tools_cache_for_connector: synchronously evicts the per-space LRU (LRU keys cannot scope finer) and schedules the per-connector prefetch via loop.create_task - routes (OAuth callback, MCP POST, MCP PUT) collapse their two back-to-back calls into a single refresh call; DELETE handlers keep using bare invalidate_mcp_tools_cache (nothing to prefetch) No new automated tests: the new functions are I/O glue (DB + network) where mocked unit tests would test implementation rather than behavior. The existing 9 unit tests for the cached_tools data shape are unchanged.	2026-05-20 17:43:27 +02:00
CREDO23	c0aa4261ac	perf(mcp): persist list_tools discovery in connector.config.cached_tools Skip the ~1-3s MCP initialize + list_tools handshake on every cache miss by reading tool definitions from the connector row we already load. Lazy populate on first miss, self-heal on corrupt cache, zero schema migration.	2026-05-20 16:11:07 +02:00
CREDO23	db8bffab38	perf(prompt-cache): enable Azure prompt_cache_key routing hint Splits the OpenAI-family gate into per-param predicates so AZURE and AZURE_OPENAI configs now receive prompt_cache_key for backend routing affinity (Microsoft auto-caches GPT-4o+ deployments at >=1024 tokens; the key clusters same-prefix requests on the same GPU pool and raises hit rate on turn 2+). prompt_cache_retention stays opted out for Azure because litellm 1.83.14's Azure transformer would drop it silently; revisit when Azure's supported params list is updated.	2026-05-20 11:58:15 +02:00
CREDO23	71dead0406	perf(kb-planner): route internal planner calls to dedicated small/fast LLM Adds an optional planner LLM role wired through KnowledgePriorityMiddleware so KB query rewriting, date extraction, and recency classification run on a cheap model (e.g. gpt-4o-mini, Haiku, Azure nano) instead of the user's chat LLM. Operators opt in by setting is_planner: true on exactly one global config; without it, behavior is unchanged.	2026-05-20 11:42:52 +02:00
Anish Sarkar	132e7b3c44	refactor: remove memory extraction functions and related components from the new chat agent	2026-05-20 14:03:28 +05:30
CREDO23	52d425f170	perf(kb-persistence): offload sync embed_texts to thread _create_document and _update_document run on the chat critical path when the filesystem subagent writes via the user's chat turn. Both called embed_texts synchronously inside an async coroutine, blocking the event loop for the duration of the embed.	2026-05-20 10:03:14 +02:00
CREDO23	4fa85a9a94	perf(kb-search): offload sync embed_texts to thread embed_texts holds a threading.Lock and runs a sync embedding call inside search_knowledge_base, an async coroutine on the KB priority middleware critical path. Blocking the event loop here stalls every other coroutine on the worker (SSE keepalives, concurrent chat requests, background tasks). Wrap in asyncio.to_thread so the embed runs on the default executor pool while the loop keeps serving.	2026-05-20 10:02:38 +02:00
CREDO23	0cdda14922	perf(kb subagent, desktop): cap evidence.content_excerpt to 500 chars	2026-05-20 09:43:36 +02:00
CREDO23	5edf0520c4	perf(kb subagent, cloud): cap evidence.content_excerpt to 500 chars	2026-05-20 09:43:32 +02:00
CREDO23	b554c600bb	perf(research subagent): cap evidence.findings and evidence.sources to bound output	2026-05-20 09:42:57 +02:00
CREDO23	6c173dc2a7	perf(teams subagent): stop echoing raw teams/channels/messages payload into evidence.items	2026-05-20 09:42:03 +02:00
CREDO23	20f7896a99	perf(luma subagent): stop echoing raw events list into evidence.items	2026-05-20 09:41:47 +02:00
CREDO23	f4e66718be	perf(discord subagent): stop echoing raw channels/messages payload into evidence.items	2026-05-20 09:41:36 +02:00
CREDO23	56d8ff89e2	perf(airtable subagent): stop echoing raw records list into evidence.items	2026-05-20 09:41:18 +02:00
CREDO23	1b2f13e25c	perf(clickup subagent): stop echoing raw tasks list into evidence.items	2026-05-20 09:41:04 +02:00
CREDO23	6be1b22ef6	perf(jira subagent): stop echoing raw issues list into evidence.items	2026-05-20 09:40:48 +02:00
CREDO23	6e5dd54bbf	perf(slack subagent): stop echoing raw messages list into evidence.items	2026-05-20 09:40:33 +02:00
CREDO23	d3d396a473	perf(linear subagent): stop echoing raw issues list into evidence.items	2026-05-20 09:40:18 +02:00
CREDO23	553becea28	perf(gmail subagent): stop echoing raw emails array into evidence.items	2026-05-20 09:40:00 +02:00
Anish Sarkar	5247dc7097	feat: refine private and team memory protocols	2026-05-20 02:02:10 +05:30
Anish Sarkar	ceedd02353	refactor: extract shared memory service	2026-05-20 02:01:36 +05:30
CREDO23	3a5e16e868	perf(calendar): stop echoing raw events into evidence.items	2026-05-19 21:30:28 +02:00
CREDO23	b3b66e4c48	perf(new-chat): add memory_injection middleware timing log	2026-05-19 21:30:19 +02:00
CREDO23	1df40fbe31	perf(new-chat): add knowledge_tree middleware timing log	2026-05-19 21:30:14 +02:00
CREDO23	bd153d3cdb	perf(multi-agent): add kb_context_projection timing log	2026-05-19 21:30:09 +02:00
CREDO23	33bfce4406	perf(subagent): add atask EXIT breakdown timing log	2026-05-19 21:30:05 +02:00
CREDO23	9e81f2a35b	perf(subagent): add subagent compile timing log	2026-05-19 21:30:01 +02:00
CREDO23	9bfba34e8e	perf(mcp): add per-call, discovery, and oauth-refresh timing logs	2026-05-19 21:29:56 +02:00
Anish Sarkar	f65bc81509	Merge remote-tracking branch 'upstream/dev' into feat/ui-revamp	2026-05-16 19:26:36 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	c187b04e82	chore: linting	2026-05-15 17:33:44 -07:00
CREDO23	98b6977c68	permissions/ask: gate 'approve_always' palette entry on MCP-ness Only MCP tools have a persistence target for 'approve_always' (the connector's trusted-tools list); for native tools the decision lives only in the in-memory runtime ruleset. Reflect that in the wire palette so the FE can stay a pure renderer of allowed_decisions instead of peeking at context.mcp_connector_id to decide whether to show the 'Always Allow' button. The backend still accepts an 'approve_always' reply for any tool kind (in-memory promotion is harmless), it just doesn't advertise it when there's nowhere to persist.	2026-05-15 14:54:16 +02:00
CREDO23	c8b756ae8f	hitl/wire: rename 'always' decision-type to 'approve_always' Renames the SurfSense HITL extension decision-type from "always" to "approve_always" so it sits in the same verb-first family as "approve", "reject", and "edit". The Python constant is now SURFSENSE_DECISION_APPROVE_ALWAYS; the wire value, the permission-domain decision_type, and the FE union members all match (no wire/internal mismatch). Both the multi_agent_chat permission middleware and the legacy new_chat one accept the new wire value; the FE types.ts union is updated accordingly. The "context.always" payload key is intentionally left untouched - it's the patterns-to-promote field, semantically distinct from the decision type.	2026-05-15 14:47:32 +02:00
CREDO23	6671c91841	multi_agent_chat/permissions: persist 'always' decisions to trusted-tools list Until now an "Always Allow" reply only updated the in-memory runtime ruleset, evaporating after the session ended. Persist it to the existing connector.config['trusted_tools'] list so the next session's fetch_user_allowlist_rulesets picks it up and the user is never asked again for the same (connector, tool) pair. - TrustedToolSaver + make_trusted_tool_saver(user_id) in user_tool_allowlist: opens its own session via async_session_maker per call, logs and swallows failures (in-memory promotion is the canonical "always" path, durable persistence is opportunistic). - PermissionMiddleware._process is now pure: returns (state_update, list[_AlwaysPromotion]). aafter_model awaits the saver for each promotion; after_model discards them. Promotions are only emitted for tools whose metadata exposes mcp_connector_id, so native tools and KB FS ops are correctly skipped. - main_agent factory builds the saver once per turn and stashes it in dependencies["trusted_tool_saver"]; pack_subagent and the KB middleware stack forward it through build_permission_mw. - Renamed pm._process(state, None) call sites in two existing tests to pm.after_model(state, None) so they exercise the public hook contract instead of the now-tuple-returning private method.	2026-05-15 14:07:08 +02:00
CREDO23	a97d1548a6	multi_agent_chat/permissions: surface MCP tool metadata into ask interrupts The FE permission card needs mcp_connector_id, mcp_server, and tool_description in the interrupt context to render "Always Allow" against the right connected account. Thread the tool through the ask pipeline: - pack_subagent → build_permission_mw(tools=...) → PermissionMiddleware (tools_by_name) → request_permission_decision(tool=...) → build_permission_ask_payload(tool=...) projects card fields out of BaseTool. - mcp_tool.py: stdio path now stashes mcp_connector_id in metadata for parity with the HTTP path.	2026-05-15 11:28:06 +02:00
Anish Sarkar	01d7379914	refactor: add public URL handling for SurfSense documents across various components and schemas	2026-05-15 02:05:11 +05:30
CREDO23	ef1152b80e	multi_agent_chat/permissions: layer user allow-list into subagent compile	2026-05-14 21:57:38 +02:00
CREDO23	31d6b43a42	multi_agent_chat/shared: drop bucket types and helpers	2026-05-14 20:10:25 +02:00
CREDO23	014801c764	multi_agent_chat/loader: MCP tools as flat list[BaseTool] per agent	2026-05-14 20:10:11 +02:00
CREDO23	5a00df8e48	multi_agent_chat/builtins: KB+deliverables+memory+research adopt RULESET + flat load_tools()	2026-05-14 20:09:55 +02:00
CREDO23	3bb90124d2	multi_agent_chat/connectors: every route declares its own RULESET + flat load_tools()	2026-05-14 20:09:49 +02:00
CREDO23	d45dfbfbd6	multi_agent_chat: pack_subagent owns per-subagent PermissionMiddleware via Ruleset	2026-05-14 20:09:29 +02:00
CREDO23	67142e68b1	multi_agent_chat: scope MCP allow/ask permissions per subagent + drop "policy" synonym	2026-05-14 18:09:14 +02:00
CREDO23	0723702320	multi_agent_chat: real-graph regressions for unified HITL paths + format pass	2026-05-14 17:41:24 +02:00
CREDO23	adb52fb575	multi_agent_chat: KB owns its ruleset, drop interrupt_on duplication	2026-05-14 17:41:07 +02:00
CREDO23	d68280113b	multi_agent_chat/connectors+builtins: adopt symmetric self_gated_tool_permission_row helper	2026-05-14 17:40:59 +02:00
CREDO23	a06aec2821	multi_agent_chat/subagents: HITL umbrella + ToolKind rename	2026-05-14 17:40:29 +02:00

1 2 3 4 5 ...

685 commits