SurfSense

mirror of https://github.com/MODSetter/SurfSense.git synced 2026-04-25 16:56:22 +02:00

Author	SHA1	Message	Date
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	40a091f8cc	feat: enhance knowledge base search and document retrieval - Introduced a mechanism to identify degenerate queries that lack meaningful search signals, improving search accuracy. - Implemented a fallback method for browsing recent documents when queries are degenerate, ensuring relevant results are returned. - Added limits on the number of chunks fetched per document to optimize performance and prevent excessive data loading. - Updated the ConnectorService to allow for reusable query embeddings, enhancing efficiency in search operations. - Enhanced LLM router service to support context window fallbacks, improving robustness during context window limitations.	2026-02-28 19:40:24 -08:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	d959a6a6c8	feat: optimize document upload process and enhance memory management - Increased maximum file upload limit from 10 to 50 to improve user experience. - Implemented batch processing for document uploads to avoid proxy timeouts, splitting files into manageable chunks. - Enhanced garbage collection in chat streaming functions to prevent memory leaks and improve performance. - Added memory delta tracking in system snapshots for better monitoring of resource usage. - Updated LLM router and service configurations to prevent unbounded internal accumulation and improve efficiency.	2026-02-28 17:22:34 -08:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	f4b2ab0899	feat: enhance caching mechanisms to prevent memory leaks - Improved in-memory rate limiting by evicting timestamps outside the current window and cleaning up empty keys. - Updated LLM router service to cache context profiles and avoid redundant computations. - Introduced cache eviction logic for MCP tools and sandbox instances to manage memory usage effectively. - Added garbage collection triggers in chat streaming functions to reclaim resources promptly.	2026-02-27 17:56:00 -08:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	0e723a5b8b	feat: perf optimizations - improved search_knowledgebase_tool - Added new endpoint to batch-fetch comments for multiple messages, reducing the number of API calls. - Introduced CommentBatchRequest and CommentBatchResponse schemas for handling batch requests and responses. - Updated chat_comments_service to validate message existence and permissions before fetching comments. - Enhanced frontend with useBatchCommentsPreload hook to optimize comment loading for assistant messages.	2026-02-27 17:19:25 -08:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	664c43ca13	feat: add performance logging middleware and enhance performance tracking across services - Introduced RequestPerfMiddleware to log request performance metrics, including slow request thresholds. - Updated various services and retrievers to utilize the new performance logging utility for better tracking of execution times. - Enhanced existing methods with detailed performance logs for operations such as embedding, searching, and indexing. - Removed deprecated logging setup in stream_new_chat and replaced it with the new performance logger.	2026-02-27 16:32:30 -08:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	1e4b8d3e89	feat: enhance document formatting and context management for LLM tools - Introduced dynamic character budget calculation for document formatting based on model's context window. - Updated `format_documents_for_context` to respect character limits and improve output quality. - Added `max_input_tokens` parameter to various functions to facilitate context-aware processing. - Enhanced error handling for context overflow in LLM router service.	2026-02-26 20:47:19 -08:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	e9892c8fe9	feat: added configable summary calculation and various improvements - Replaced direct embedding calls with a utility function across various components to streamline embedding logic. - Added enable_summary flag to several models and routes to control summary generation behavior.	2026-02-26 18:24:57 -08:00
CREDO23	a96d18ec31	Fix GitHub Models provider: use github prefix, fix migration, add Zod enum	2026-02-09 19:14:16 +02:00
CREDO23	a9d6c41964	Add GITHUB_MODELS to provider maps	2026-02-09 17:34:25 +02:00
akc	bcc77c2864	fix: also update PROVIDER_MAP in llm_router_service.py for Auto mode	2026-01-31 01:50:05 +03:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	6fb656fd8f	hotpatch(cloud): add llm load balancing	2026-01-29 15:28:31 -08:00

11 commits