SurfSense

mirror of https://github.com/MODSetter/SurfSense.git synced 2026-07-10 22:32:16 +02:00

Author	SHA1	Message	Date
Anish Sarkar	dbb652d4f8	feat(observability): add telemetry error and event helpers	2026-05-22 17:48:01 +05:30
Anish Sarkar	0fe2bba5a8	refactor(chat): replace TooltipIconButton with Button for scroll functionality	2026-05-22 17:46:35 +05:30
Anish Sarkar	87a4dcfd05	feat(tasks): record indexing heartbeat metrics	2026-05-22 13:50:32 +05:30
Anish Sarkar	7c07c220fc	feat(connectors): add connector sync spans	2026-05-22 13:49:59 +05:30
Anish Sarkar	4e3a6dff46	feat(etl): instrument extraction spans and outcomes	2026-05-22 13:49:42 +05:30
Anish Sarkar	8bca29fe0d	feat(agents): track subagent invocation telemetry	2026-05-22 13:48:57 +05:30
Anish Sarkar	5a6b92c2b6	feat(chat): instrument streamed chat request telemetry	2026-05-22 13:48:19 +05:30
Anish Sarkar	f7f49de109	feat(observability): add chat subagent and ETL telemetry primitives	2026-05-22 13:47:50 +05:30
Anish Sarkar	21d9b1f218	fix(observability): sanitize outbound HTTP span URLs	2026-05-22 13:47:10 +05:30
Anish Sarkar	cea5605e32	feat(indexing): track indexing and connector outcomes	2026-05-21 23:03:43 +05:30
Anish Sarkar	b9d76f006d	feat(retriever): instrument knowledge base search	2026-05-21 23:03:31 +05:30
Anish Sarkar	53691f9c51	feat(agents): track permission and compaction events	2026-05-21 23:02:54 +05:30
Anish Sarkar	ea3d0a6463	feat(agents): emit metrics for model and tool calls	2026-05-21 23:02:36 +05:30
Anish Sarkar	6095b48b5f	feat(observability): add SurfSense metric helpers	2026-05-21 23:02:20 +05:30
Anish Sarkar	eb2e2b253b	feat(observability): add OpenTelemetry process bootstrap	2026-05-21 23:01:54 +05:30
Anish Sarkar	60049936e3	chore(dev): add local OpenTelemetry backend configuration	2026-05-21 23:00:56 +05:30
Anish Sarkar	15458157b9	refactor(ui): update alert variants and button styles in connector and document upload popups	2026-05-21 19:16:17 +05:30
Anish Sarkar	2fd05d720e	chore: add OpenTelemetry dependencies and update lock file	2026-05-21 17:23:41 +05:30
Anish Sarkar	862381f4e6	Merge remote-tracking branch 'upstream/dev' into feat/opentelemetry	2026-05-21 03:46:53 +05:30
Rohan Verma	5c4da79da4	Merge pull request #1422 from CREDO23/improvement-agent-speed [Improvement] Agent: faster turns and lower LLM cost	2026-05-20 14:57:19 -07:00
Anish Sarkar	6e27f63d85	refactor(env): consolidate BACKEND_URL imports in GoogleLoginButton and DocumentTabContent	2026-05-21 00:54:22 +05:30
CREDO23	49da7a57df	Merge remote-tracking branch 'upstream/dev' into improvement-agent-speed Resolves: surfsense_backend/app/agents/new_chat/middleware/memory_injection.py - Took both imports: upstream moved MEMORY_HARD_LIMIT/SOFT_LIMIT to app.services.memory; kept our perf-logger import for timing. Pulls in upstream changes: - Memory document feature (services/memory refactor, removal of app.agents.new_chat.memory_extraction and background extraction in stream_new_chat — agent now drives memory via update_memory tool). - BACKEND_URL env refactor across web tool-ui/editor/chat/dashboard/lib. - GitHub Actions backend test workflow + pre-commit biome bump. - Token-display polish in MessageInfoDropdown; save_memory no-update sentinel. Verified: 1723 unit tests pass, ruff clean. No semantic regression in stream_new_chat (their memory-extraction deletion and our preflight removal touch different functions).	2026-05-20 21:23:48 +02:00
Anish Sarkar	7a0de6d377	Merge remote-tracking branch 'upstream/dev' into feat/opentelemetry	2026-05-21 00:46:40 +05:30
Rohan Verma	883ac81ce1	Merge pull request #1420 from AnishSarkar22/fix/pnpm-version-ci fix: Update CI workflow versions and scoped test triggers	2026-05-20 12:14:47 -07:00
Rohan Verma	54716105d2	Merge pull request #1418 from VarunShukla07/fix/env-config-tool-ui-generators refactor(env): replace inline process.env reads with BACKEND_URL in tool-ui generators	2026-05-20 12:14:09 -07:00
Rohan Verma	1a6d1ced43	Merge pull request #1417 from VarunShukla07/fix/env-config-editor-chat-dashboard refactor(env): replace inline process.env reads with BACKEND_URL in editor, chat, dashboard and settings	2026-05-20 12:14:01 -07:00
Rohan Verma	c97bb81126	Merge pull request #1416 from VarunShukla07/fix/env-config-connector-forms refactor(env): replace inline process.env reads with BACKEND_URL in connector forms and hooks	2026-05-20 12:13:50 -07:00
Rohan Verma	8823574c19	Merge pull request #1415 from VarunShukla07/fix/env-config-lib-files refactor(env): replace inline process.env reads with BACKEND_URL in lib/	2026-05-20 12:13:12 -07:00
Rohan Verma	55cce4ea59	Merge pull request #1414 from AnishSarkar22/feature/memory-support-document-panel feat: improve memory extraction & add document-panel memory editing	2026-05-20 12:12:27 -07:00
Anish Sarkar	8c91c9d76b	refactor: replace ContextMenu with DropdownMenu in SearchSpaceAvatar for improved mobile and desktop interactions	2026-05-21 00:19:59 +05:30
CREDO23	d5ee8cc4cd	Merge remote-tracking branch 'upstream/dev' into improvement-agent-speed	2026-05-20 19:22:49 +02:00
CREDO23	2be3f04df5	chore(scripts): drop one-off MCP session lifetime probe The probe answered its question (informing the cached_tools persistence design). Future MCP session-pooling work, if revived, can recreate it.	2026-05-20 19:11:00 +02:00
CREDO23	704d1bf18f	refactor(mcp): per-connector cache refresh on lifecycle events Collapse the invalidate + warmup pair into a single refresh_mcp_tools_cache_for_connector(connector_id, search_space_id) helper and scope live discovery to the one connector that changed instead of the whole search space. - new mcp_tool.discover_single_mcp_connector: load one connector, refresh OAuth if needed, force live MCP discovery so its cached_tools row is rewritten; returned wrappers are discarded since the in-process LRU is rebuilt lazily on the next user query - mcp_tools_cache.refresh_mcp_tools_cache_for_connector: synchronously evicts the per-space LRU (LRU keys cannot scope finer) and schedules the per-connector prefetch via loop.create_task - routes (OAuth callback, MCP POST, MCP PUT) collapse their two back-to-back calls into a single refresh call; DELETE handlers keep using bare invalidate_mcp_tools_cache (nothing to prefetch) No new automated tests: the new functions are I/O glue (DB + network) where mocked unit tests would test implementation rather than behavior. The existing 9 unit tests for the cached_tools data shape are unchanged.	2026-05-20 17:43:27 +02:00
Anish Sarkar	58a975205d	chore: add workflow file to change detection filters for backend and frontend jobs	2026-05-20 20:06:38 +05:30
Anish Sarkar	a6a0f7a373	chore: streamline GitHub Actions workflow by removing change detection job and simplifying test conditions	2026-05-20 20:03:51 +05:30
Anish Sarkar	844b8ba609	chore: refactor GitHub Actions workflow to improve backend change detection and job dependencies	2026-05-20 19:42:47 +05:30
CREDO23	c0aa4261ac	perf(mcp): persist list_tools discovery in connector.config.cached_tools Skip the ~1-3s MCP initialize + list_tools handshake on every cache miss by reading tool definitions from the connector row we already load. Lazy populate on first miss, self-heal on corrupt cache, zero schema migration.	2026-05-20 16:11:07 +02:00
Anish Sarkar	a786574484	chore: simplify job names in GitHub Actions workflow	2026-05-20 19:29:09 +05:30
Anish Sarkar	2de8ea5501	chore: update biome version in pre-commit configuration	2026-05-20 19:25:39 +05:30
Anish Sarkar	20152b1243	chore: update Node.js version to 20 in GitHub Actions workflow	2026-05-20 18:41:05 +05:30
Anish Sarkar	ff2d621185	chore: fix the pnpm version in GitHub Actions workflow	2026-05-20 18:37:21 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	f5f2456dfd	Merge branch 'dev' of https://github.com/MODSetter/SurfSense into dev	2026-05-20 03:01:49 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	ed22da7b95	feat: bumped version to 0.0.24	2026-05-20 03:01:37 -07:00
Anish Sarkar	39c29d651f	feat: enhance token display in MessageInfoDropdown with improved visual separation	2026-05-20 15:29:41 +05:30
CREDO23	db8bffab38	perf(prompt-cache): enable Azure prompt_cache_key routing hint Splits the OpenAI-family gate into per-param predicates so AZURE and AZURE_OPENAI configs now receive prompt_cache_key for backend routing affinity (Microsoft auto-caches GPT-4o+ deployments at >=1024 tokens; the key clusters same-prefix requests on the same GPU pool and raises hit rate on turn 2+). prompt_cache_retention stays opted out for Azure because litellm 1.83.14's Azure transformer would drop it silently; revisit when Azure's supported params list is updated.	2026-05-20 11:58:15 +02:00
CREDO23	71dead0406	perf(kb-planner): route internal planner calls to dedicated small/fast LLM Adds an optional planner LLM role wired through KnowledgePriorityMiddleware so KB query rewriting, date extraction, and recency classification run on a cheap model (e.g. gpt-4o-mini, Haiku, Azure nano) instead of the user's chat LLM. Operators opt in by setting is_planner: true on exactly one global config; without it, behavior is unchanged.	2026-05-20 11:42:52 +02:00
Anish Sarkar	8c9be9796a	feat: add no-update sentinel handling to save_memory function and corresponding unit tests	2026-05-20 15:03:35 +05:30
CREDO23	c3db25302b	perf(chat): kill auto-pin preflight + speculative build, rely on reactive 429 recovery The preflight pattern probed the LLM with a 1-token ping before each cold turn (when requested_llm_config_id==0, llm_config_id<0, and the 45s healthy TTL had expired) to detect 429s before fanning out into planner/classifier/title-gen. To absorb its ~1-5s RTT cost we built the agent speculatively in parallel; on 429 we discarded the build and repinned. Three problems with that design: 1. False security. Provider rate limits are token-bucket. A 1-token ping consumes ~5 tokens; the real request consumes 10-50K. The probe can return 200 while the real call still 429s. 2. Pure overhead in the common case. On warm-agent-cache turns the probe dominates wall time: ~2.5s of TTFT pure tax for ~99% of users who never see a 429. 3. The in-stream recovery loop (catch of _is_provider_rate_limited gated by not _first_event_logged) already does the right thing reactively: mark_runtime_cooldown -> resolve_or_get_pinned_llm_config_id with exclude_config_ids={previous} -> rebuild agent -> retry the stream. Preflight was never the only safety net; it was a redundant probe in front of one. Changes: - Delete _preflight_llm, _settle_speculative_agent_build, and the _PREFLIGHT_TIMEOUT_SEC / _PREFLIGHT_MAX_TOKENS constants. - Drop the parallel agent_build_task / preflight_task plumbing in both stream_new_chat and stream_resume_chat; build the agent inline with await _build_main_agent_for_thread(...). - Drop the unused is_recently_healthy / mark_healthy imports here (still exported from auto_model_pin_service since OpenRouter catalogue refresh and a few tests reference clear_healthy). - Remove the obsolete preflight + settle-speculative tests from test_stream_new_chat_contract.py. Net: -447 LOC. ~2.5s removed from TTFT on every cold preflight-eligible turn. 429 recovery path is unchanged - same repin/rebuild/retry, just not paid in advance on the healthy path.	2026-05-20 11:03:08 +02:00
Anish Sarkar	132e7b3c44	refactor: remove memory extraction functions and related components from the new chat agent	2026-05-20 14:03:28 +05:30
Rohan Verma	4774105015	Merge pull request #1413 from mvanhorn/fix/1359-shared-isMobile-hook fix: use shared useIsMobile (768px) in SidebarSlideOutPanel (#1359)	2026-05-20 01:32:04 -07:00

1 2 3 4 5 ...

6004 commits