SurfSense

mirror of https://github.com/MODSetter/SurfSense.git synced 2026-06-26 21:39:43 +02:00

Author	SHA1	Message	Date
CREDO23	e72b17fbed	retrieval: instrument hybrid search; note deferred citation markers	2026-06-25 09:00:23 +02:00
CREDO23	4fe208557a	retrieval: add reranking wrapper and context service	2026-06-25 08:23:29 +02:00
CREDO23	407bfcd94f	retrieval: add source label and retrieved-document adapter	2026-06-25 08:23:29 +02:00
CREDO23	608192057f	retrieval: add search scope models and hybrid chunk search	2026-06-25 08:23:29 +02:00
CREDO23	26a1431e87	retrieved_context: drop document completeness concept	2026-06-25 08:23:29 +02:00
CREDO23	6bb20df510	citations: rewrite model [n] ordinals to frontend [citation:] markers	2026-06-25 06:48:25 +02:00
CREDO23	9ffbba8d8c	retrieved_context: package surface	2026-06-24 22:38:47 +02:00
CREDO23	1f5da25ef5	retrieved_context: renderer	2026-06-24 22:38:47 +02:00
CREDO23	4d68fa8998	retrieved_context: models	2026-06-24 22:38:47 +02:00
CREDO23	85b999a52d	feat(chat): add citations package surface	2026-06-24 21:35:19 +02:00
CREDO23	61b8af0af4	feat(chat): add citation registry	2026-06-24 21:35:19 +02:00
CREDO23	98b164c2d3	feat(chat): add citation entry data shapes	2026-06-24 21:35:19 +02:00
Anish Sarkar	3695e1d5c5	Merge remote-tracking branch 'upstream/dev' into feat/api-key	2026-06-23 13:09:53 +05:30
Rohan Verma	1dc3fac81d	Merge pull request #1527 from Muhammad-Ikhwan-Fathulloh/dev fix: normalize image URLs before persistence and add model selector aria-label	2026-06-23 00:08:41 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	a08de01cc7	Revert "Merge pull request #1523 from CREDO23/fix/chat-citations" This reverts commit `cd2242147a`, reversing changes made to `a4bb0a5253`.	2026-06-22 22:55:29 -07:00
Muhammad-Ikhwan-Fathulloh	2848ac6c39	fix: normalize image URLs before persistence and add model selector aria-label	2026-06-20 19:49:58 +07:00
Anish Sarkar	fd31ac34fd	Merge remote-tracking branch 'upstream/dev' into feat/api-key	2026-06-20 10:50:03 +05:30
CREDO23	73dd4e8e3a	feat: embed line-citation tokens in search hits	2026-06-19 17:37:41 +02:00
CREDO23	188ae053ac	feat: serve numbered source_markdown reads with citation preamble	2026-06-19 17:37:41 +02:00
Anish Sarkar	6fd3f8570e	refactor: streamline auth context usage across chat and automation routes	2026-06-19 21:04:21 +05:30
CREDO23	fc17b9becd	docs: rename evidence.chunk_ids to citations in desktop kb prompt	2026-06-19 17:32:45 +02:00
CREDO23	30ca0e1ef5	docs: readonly kb specialist cites line or chunk form	2026-06-19 17:32:45 +02:00
CREDO23	3c63a7bcd3	docs: kb specialist cites numbered or legacy chunk form	2026-06-19 17:32:45 +02:00
CREDO23	141801f1cc	docs: clarify web/kb/legacy citation channels	2026-06-19 17:32:45 +02:00
Anish Sarkar	096dea45d4	refactor: pass auth context through automations	2026-06-19 20:28:35 +05:30
CREDO23	1741fdc9c8	feat: numbered-read preamble and matched line ranges	2026-06-19 15:43:21 +02:00
CREDO23	7967b62b42	feat: search tool renders matched passage with lines	2026-06-19 14:53:49 +02:00
CREDO23	f2fe2e576e	feat: note writes chunk via shared span builder	2026-06-18 20:17:45 +02:00
okxint	a12cd21f2f	fix(image-gen): resolve relative URLs returned by Xinference and compatible backends Some OpenAI-compatible image backends (e.g. Xinference) return a relative URL like /files/image.png in data[0].url instead of an absolute one. Browsers cannot resolve these, causing images to fail to load. Track the provider's api_base after resolving model config via to_litellm(). When the returned URL starts with "/", extract the origin (scheme + host + port) from api_base and prepend it to produce a full absolute URL. No behaviour change for providers that return absolute URLs (OpenAI, Azure, etc). Closes #1496	2026-06-17 10:57:39 +05:30
CREDO23	32a6e54ce6	Merge remote-tracking branch 'upstream/dev' into features/documents-injestion-layered-cached	2026-06-14 11:30:33 +02:00
Anish Sarkar	c7409c8995	chore: ran linting	2026-06-13 21:59:35 +05:30
Anish Sarkar	bd4a04f2e7	feat(database-migrations): add migration to remove legacy model config tables and remove stale model connection code	2026-06-13 12:45:43 +05:30
CREDO23	052e9ef4d1	refactor(chunks): order chunk reads by (document_id, position) Presentation and citation ordering moves off Chunk.id/created_at to the explicit position column (id kept as tiebreaker). Vector and ts_rank ranking order_by clauses are untouched.	2026-06-12 18:53:21 +02:00
CREDO23	5a71769dba	fix(chunks): set position on remaining chunk insert paths document_converters, the github size-fallback chunker, revert_service restores, and the kb-persistence middleware now write explicit positions (the middleware read path also orders by position).	2026-06-12 18:53:08 +02:00
Anish Sarkar	908790e40f	Merge remote-tracking branch 'upstream/dev' into feat/unified-model-connections	2026-06-12 03:15:28 +05:30
Anish Sarkar	5d5d574550	refactor(model-connections): move backend model connections to provider capabilities	2026-06-12 02:17:22 +05:30
Anish Sarkar	831ad23c6c	fix(chat): harden image generation model routing	2026-06-11 18:22:45 +05:30
CREDO23	eb56acc407	refactor(podcasts): regenerate via brief gate, render brief inline in chat	2026-06-11 11:45:17 +02:00
CREDO23	3eb7cdb2d8	refactor(podcasts): gate chat-triggered podcast on brief review	2026-06-10 21:44:50 +02:00
Anish Sarkar	85114d2a0e	refactor(chat): rename image generation config parameters for clarity	2026-06-10 21:50:42 +05:30
Anish Sarkar	077016d6e4	refactor(images): use model connections for image generation	2026-06-10 21:48:37 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	41ff57101c	feat: made chat fast - Introduced lazy knowledge base retrieval mode, allowing the main agent to fetch KB content on demand via the `search_knowledge_base` tool, improving performance by skipping expensive pre-injection processes. - Added cross-thread caching capability, enabling reuse of compiled graphs across different user chats, reducing latency for returning users. - Updated middleware to support new lazy loading and caching features, ensuring efficient resource utilization and improved response times. - Enhanced logging for performance tracking during knowledge retrieval and agent interactions.	2026-06-09 04:45:17 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	ce952d2ad1	chore: linting	2026-06-09 00:42:26 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	0a012dbc79	feat(middleware): enhance performance logging in chat agents - Integrated performance logging in `OtelSpanMiddleware` to track model call durations even when OTel is disabled. - Added detailed performance metrics in `KnowledgePriorityMiddleware` for database operations and embedding processes, improving visibility into query performance. - Utilized `get_perf_logger` for consistent logging across middleware components.	2026-06-09 00:28:53 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	640ef5f15d	feat(proxy): integrate Scrapling for enhanced web scraping capabilities - Replaced Playwright with Scrapling's fetchers in the web crawling and YouTube processing modules for improved performance and flexibility. - Updated proxy configuration to support dynamic proxy selection via environment variables. - Enhanced logging to track performance metrics during web scraping operations. - Refactored related modules to utilize the new proxy utilities and streamline the scraping process.	2026-06-09 00:15:10 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	c2beaf1e5a	refactor(config): centralize configuration management across modules - Replaced environment variable usage with a centralized configuration system in multiple modules, including `celery_app`, `agent_cache_store`, `sandbox`, `file_storage`, and `connector_service`. - Enhanced maintainability and readability by sourcing configuration values from the `config` module instead of directly from environment variables. - Updated relevant settings to ensure consistent access to configuration values across the application.	2026-06-08 13:50:16 -07:00
CREDO23	8bdfd00a15	Merge upstream/dev	2026-06-05 19:18:12 +02:00
CREDO23	a3d05f6418	docs(agents): tighten docstrings and comments across agent module Recursive pass over the agents module to make docstrings and inline comments concise and intent-oriented: drop narration that just restates the code, condense verbose module/function docstrings, and keep only the non-obvious "why" notes. No functional code changed.	2026-06-05 17:39:38 +02:00
CREDO23	88fe213176	refactor(agents): extract subagent-invocation contract to subagents/shared The knowledge_base subagent imported subagent_invoke_config + EXCLUDED_STATE_KEYS from main_agent's checkpointed_subagent_middleware -- a subagent reaching into main-agent internals. Both symbols (plus the recursion-limit constant they need) are a subagent-invocation contract shared by the orchestrator's task middleware and any nested-invoking subagent. Move them to subagents/shared/invocation.py; config.py keeps the HITL resume side-channel and constants.py keeps the main-agent tuning knobs. All consumers (task_tool, kb tool, tests) repointed.	2026-06-05 14:18:44 +02:00
CREDO23	490bb3c5c5	refactor(agents): extract shared Google OAuth helper from gmail connector build_credentials/get_token_encryption are Google-OAuth helpers used by both the Gmail and Calendar connector tools. They lived inside gmail/tools/_helpers.py, forcing calendar -> gmail coupling. Move them to a neutral connector-level module (connectors/google_auth.py); gmail/_helpers.py re-exports them under the legacy private names so existing gmail tools are untouched, and calendar now imports the shared module directly.	2026-06-05 14:14:32 +02:00

1 2

59 commits