SurfSense

mirror of https://github.com/MODSetter/SurfSense.git synced 2026-07-10 22:32:16 +02:00

Author	SHA1	Message	Date
CREDO23	49da7a57df	Merge remote-tracking branch 'upstream/dev' into improvement-agent-speed Resolves: surfsense_backend/app/agents/new_chat/middleware/memory_injection.py - Took both imports: upstream moved MEMORY_HARD_LIMIT/SOFT_LIMIT to app.services.memory; kept our perf-logger import for timing. Pulls in upstream changes: - Memory document feature (services/memory refactor, removal of app.agents.new_chat.memory_extraction and background extraction in stream_new_chat — agent now drives memory via update_memory tool). - BACKEND_URL env refactor across web tool-ui/editor/chat/dashboard/lib. - GitHub Actions backend test workflow + pre-commit biome bump. - Token-display polish in MessageInfoDropdown; save_memory no-update sentinel. Verified: 1723 unit tests pass, ruff clean. No semantic regression in stream_new_chat (their memory-extraction deletion and our preflight removal touch different functions).	2026-05-20 21:23:48 +02:00
CREDO23	71dead0406	perf(kb-planner): route internal planner calls to dedicated small/fast LLM Adds an optional planner LLM role wired through KnowledgePriorityMiddleware so KB query rewriting, date extraction, and recency classification run on a cheap model (e.g. gpt-4o-mini, Haiku, Azure nano) instead of the user's chat LLM. Operators opt in by setting is_planner: true on exactly one global config; without it, behavior is unchanged.	2026-05-20 11:42:52 +02:00
Anish Sarkar	8c9be9796a	feat: add no-update sentinel handling to save_memory function and corresponding unit tests	2026-05-20 15:03:35 +05:30
Anish Sarkar	132e7b3c44	refactor: remove memory extraction functions and related components from the new chat agent	2026-05-20 14:03:28 +05:30
CREDO23	1791241c0c	perf(indexers): offload sync embed_text to thread across background workers Connector kb_sync_services (gmail, onedrive, google_calendar, jira), streaming indexers (discord, luma, teams) and the file-processor save path all called embed_text inside async coroutines, blocking the background worker's event loop for the duration of the embed. Wrap each call site in asyncio.to_thread so concurrent indexing tasks stop serialising on the embed.	2026-05-20 10:09:38 +02:00
CREDO23	a8de98895a	perf(revert-service): offload sync embed_texts to thread _restore_in_place_document and _reinsert_document_from_revision are async helpers invoked by the synchronous-feeling POST /api/threads/.../revert route; both ran embed_texts inline, blocking the event loop while the HTTP client waited.	2026-05-20 10:04:26 +02:00
CREDO23	32f6766cb6	fix(tokens): use canonical prompt_tokens_details path for cache fields LiteLLM normalizes every provider's cache fields onto usage.prompt_tokens_details (cached_tokens + cache_creation_tokens). The earlier fallback to usage.cache_read_input_tokens / usage.cache_creation_input_tokens was wrong: Anthropic-shaped fields only live there via a trailing setattr loop, and the canonical field name on the wrapper is cache_creation_tokens (not _input_tokens).	2026-05-20 09:55:39 +02:00
CREDO23	6090980c5e	obs(tokens): log prompt-cache read/write counts and hit ratio per LLM call	2026-05-20 09:51:44 +02:00
Anish Sarkar	a0ff86e0e8	feat: add memory document model and parsing functionality for markdown handling	2026-05-20 13:20:05 +05:30
Anish Sarkar	fe07de3f9c	chore: ran linting	2026-05-20 12:55:10 +05:30
Anish Sarkar	73043a0756	feat: enhance memory API responses with limits and update UI components for memory limit handling	2026-05-20 03:17:05 +05:30
Anish Sarkar	ceedd02353	refactor: extract shared memory service	2026-05-20 02:01:36 +05:30
CREDO23	581bbfb5c1	perf(tokens): add per-call latency to capture log	2026-05-19 21:30:25 +02:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	c187b04e82	chore: linting	2026-05-15 17:33:44 -07:00
CREDO23	4980f9f1ba	Merge remote-tracking branch 'upstream/dev' into feature/multi-agent-with-task-parallelization	2026-05-15 16:44:22 +02:00
CREDO23	6671c91841	multi_agent_chat/permissions: persist 'always' decisions to trusted-tools list Until now an "Always Allow" reply only updated the in-memory runtime ruleset, evaporating after the session ended. Persist it to the existing connector.config['trusted_tools'] list so the next session's fetch_user_allowlist_rulesets picks it up and the user is never asked again for the same (connector, tool) pair. - TrustedToolSaver + make_trusted_tool_saver(user_id) in user_tool_allowlist: opens its own session via async_session_maker per call, logs and swallows failures (in-memory promotion is the canonical "always" path, durable persistence is opportunistic). - PermissionMiddleware._process is now pure: returns (state_update, list[_AlwaysPromotion]). aafter_model awaits the saver for each promotion; after_model discards them. Promotions are only emitted for tools whose metadata exposes mcp_connector_id, so native tools and KB FS ops are correctly skipped. - main_agent factory builds the saver once per turn and stashes it in dependencies["trusted_tool_saver"]; pack_subagent and the KB middleware stack forward it through build_permission_mw. - Renamed pm._process(state, None) call sites in two existing tests to pm.after_model(state, None) so they exercise the public hook contract instead of the now-tuple-returning private method.	2026-05-15 14:07:08 +02:00
CREDO23	e99c06c887	user_tool_allowlist: extract trust-tool storage into reusable service	2026-05-14 21:20:30 +02:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	3737118050	chore: evals	2026-05-13 14:02:26 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	c8374e6c5b	feat: improved document, folder mentions rendering Some checks are pending Build and Push Docker Images / tag_release (push) Waiting to run Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Build and Push Docker Images / create_manifest (backend, surfsense-backend) (push) Blocked by required conditions Build and Push Docker Images / create_manifest (web, surfsense-web) (push) Blocked by required conditions	2026-05-09 22:15:51 -07:00
CREDO23	2ab6b1c757	Merge upstream/dev into feature/multi-agent.	2026-05-09 23:00:56 +02:00
CREDO23	e802de2333	Include optional metadata on tool and thinking-step SSE payloads.	2026-05-08 22:47:58 +02:00
CREDO23	78f4747382	refactor(chat): stream agent events via stream_output and remove parity v2 flag	2026-05-07 19:40:10 +02:00
CREDO23	7e07092f67	refactor(chat): drop alternate streaming entry path; use graph_stream	2026-05-07 19:25:20 +02:00
CREDO23	fef7621d96	Add StreamingService and interrupt correlation for chat streams.	2026-05-06 20:08:47 +02:00
CREDO23	fc429d8702	Add streaming emitter and registry for scoped SSE writes.	2026-05-06 20:08:47 +02:00
CREDO23	5510c6c314	Add typed event payload modules for the streaming service.	2026-05-06 20:08:47 +02:00
CREDO23	a9bf7ab7d2	Add SSE envelope helpers under app.services.streaming.	2026-05-06 20:08:47 +02:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	5e87a7a251	fix: composio tool calls in composio connectors	2026-05-05 18:57:10 -07:00
CREDO23	5119915f4f	Merge upstream/dev into feature/multi-agent	2026-05-05 01:44:46 +02:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	a34f1fb25c	feat: implement agent caches and fix invalid prompt cache configs Some checks are pending Build and Push Docker Images / tag_release (push) Waiting to run Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Build and Push Docker Images / create_manifest (backend, surfsense-backend) (push) Blocked by required conditions Build and Push Docker Images / create_manifest (web, surfsense-web) (push) Blocked by required conditions - Added a new function `_warm_agent_jit_caches` to pre-warm agent caches at startup, reducing cold invocation costs. - Updated the `SurfSenseContextSchema` to include per-invocation fields for better state management during agent execution. - Introduced caching mechanisms in various tools to ensure fresh database sessions are used, improving performance and reliability. - Enhanced middleware to support new context features and improve error handling during connector and document type discovery.	2026-05-03 06:03:40 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	e4f9d79635	feat: add preferred premium auto configuration logic and corresponding tests	2026-05-02 23:35:47 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	c938d39277	feat: moved most things behind correct feature flag	2026-05-02 23:10:48 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	cea8618aed	fix: fixed composio issues	2026-05-02 21:16:03 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	47b2994ec7	feat: fixed vision/image provider specific errors and fixed podcast/video streaming Some checks are pending Build and Push Docker Images / tag_release (push) Waiting to run Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Build and Push Docker Images / create_manifest (backend, surfsense-backend) (push) Blocked by required conditions Build and Push Docker Images / create_manifest (web, surfsense-web) (push) Blocked by required conditions	2026-05-02 19:18:53 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	ae9d36d77f	feat: unified credits and its cost calculations	2026-05-02 14:34:23 -07:00
Rohan Verma	451a98936e	Merge pull request #1332 from AnishSarkar22/feat/model-pinnning-mode Some checks are pending Build and Push Docker Images / tag_release (push) Waiting to run Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Build and Push Docker Images / create_manifest (backend, surfsense-backend) (push) Blocked by required conditions Build and Push Docker Images / create_manifest (web, surfsense-web) (push) Blocked by required conditions feat: Auto-pin quality scoring, OpenRouter tier refactor and live usage sidebar	2026-05-01 15:57:19 -07:00
Anish Sarkar	cd25175b84	chore: ran linting	2026-05-02 03:36:13 +05:30
Anish Sarkar	2764fa5e30	feat(openrouter): clear healthy-status cache on catalogue refresh	2026-05-02 02:07:30 +05:30
Anish Sarkar	14686cdf82	feat(auto_pin): add short-TTL healthy-status cache for preflight reuse	2026-05-02 02:07:16 +05:30
Anish Sarkar	25ccc959cf	feat(busy_mutex): enhance thread lock management to prevent stale middleware interference	2026-05-02 01:35:30 +05:30
Anish Sarkar	f65b3be1ce	feat(auto_model_pin): implement runtime cooldown for error handling and enhance candidate selection	2026-05-02 00:57:52 +05:30
Anish Sarkar	4bef75d298	feat(auto_pin): quality-aware tier-locked selection with health gate	2026-05-01 23:38:53 +05:30
Anish Sarkar	1eedcaa551	feat(openrouter): blend per-model /endpoints health into quality score	2026-05-01 23:38:40 +05:30
Anish Sarkar	d9058b73f5	feat(auto_pin): add pure-function quality scoring module	2026-05-01 23:37:49 +05:30
Anish Sarkar	421a4d7d08	refactor(auto_model_pin): simplify thread-level pinning by removing unused fields and indexes	2026-05-01 19:32:42 +05:30
Anish Sarkar	680a1c1c38	refactor(openrouter): remove virtual openrouter/free auto-select entry	2026-05-01 18:16:47 +05:30
Anish Sarkar	4d34b56c4d	docs(router): drop reference to virtual openrouter/free in is_premium_model	2026-05-01 18:09:50 +05:30
Anish Sarkar	ccd7caf99f	feat(openrouter): derive billing tier per-model and stabilize config IDs	2026-05-01 17:42:21 +05:30
Anish Sarkar	5dd45a5740	refactor(router): add router_pool_eligible filter and rebuild() API	2026-05-01 17:41:52 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	e57c3a7d0c	feat: prompt caching Some checks are pending Build and Push Docker Images / tag_release (push) Waiting to run Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-24.04-arm, linux/arm64, arm64) (push) Blocked by required conditions Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-latest, linux/amd64, amd64) (push) Blocked by required conditions Build and Push Docker Images / create_manifest (backend, surfsense-backend) (push) Blocked by required conditions Build and Push Docker Images / create_manifest (web, surfsense-web) (push) Blocked by required conditions - Updated `litellm` dependency version from `1.83.4` to `1.83.7`. - Adjusted `aiohttp` version from `3.13.5` to `3.13.4` in the lock file. - Implemented `apply_litellm_prompt_caching` in `chat_deepagent.py` to improve prompt caching. - Added model name resolution logic in `chat_deepagent.py` to ensure correct provider-variant dispatch. - Enhanced `llm_config.py` to configure prompt caching for various LLM providers. - Updated tests to verify correct model name forwarding and prompt caching behavior.	2026-05-01 05:10:53 -07:00

1 2 3 4 5 ...

423 commits