Commit graph

39 commits

Author SHA1 Message Date
Anish Sarkar
293de6876a feat: implement fuzzy search in mention document 2026-01-17 20:46:47 +05:30
Anish Sarkar
b001b65067 feat: add pg_trgm indexes and lightweight document title search
- Introduced pg_trgm extension and GIN trigram indexes for efficient document title searches, enhancing performance for mention picker functionality.
- Implemented a new API endpoint for lightweight document title searches, returning only essential fields.
- Updated frontend components to utilize the new title search feature with throttling for improved user experience.
- Added necessary schemas and types for the new search functionality.
2026-01-17 20:45:10 +05:30
DESKTOP-RTLN3BA\$punk
c768730b8c feat: fixed issues of note management
Issues Fixed

- Missing pagination fields in API response schemas (page, page_size, has_more)
- NOTE enum missing from frontend Zod schema
- Missing fields in DocumentRead response construction (content_hash, updated_at)
- BlockNote slash menu clipped by overflow-hidden CSS
- Sidebar click conflicts - hidden action buttons intercepting clicks
- Rewrote All Notes sidebar - replaced fragile custom portal with shadcn Sheet
- Missing translation keys for new UI strings
- Missing NOTE retrieval logic in researcher agent
- Added search to All Notes sidebar
- Removed frontend logging - was causing toasters on every page refresh
- Added backend logging to document reindex Celery task
2025-12-17 00:09:43 -08:00
WayChan
3c423436cc fix: retrieve wrong field for content in saving extension document. 2025-12-04 00:31:50 +00:00
WayChan
081080233a fix: saving document from browser extension fails due to missing and mismatch fields of backend data models 2025-12-03 15:32:32 +00:00
DESKTOP-RTLN3BA\$punk
e9d32c3516 feat: Implement Role-Based Access Control (RBAC) for search space resources.
-Introduce granular permissions for documents, chats, podcasts, and logs.
- Update routes to enforce permission checks for creating, reading, updating, and deleting resources. - Refactor user and search space interactions to align with RBAC model, removing ownership checks in favor of permission validation.
2025-11-27 22:45:04 -08:00
samkul-swe
6d19e0fad8 Fixing search logic 2025-11-22 13:33:16 -08:00
DESKTOP-RTLN3BA\$punk
ecd07d6155 refactor: update API endpoint paths to remove trailing slashes
- Modified various FastAPI route definitions to remove trailing slashes for consistency across the application.
- Updated corresponding fetch calls in the frontend to align with the new endpoint structure.
- Ensured that all affected routes maintain their functionality without trailing slashes.
2025-10-31 01:33:01 -07:00
DESKTOP-RTLN3BA\$punk
18adf79649 feat(fix): document type filtering 2025-10-21 21:53:55 -07:00
DESKTOP-RTLN3BA\$punk
c80bbfa867 feat: added celery and removed background_tasks for MQ's
- removed pre commit hooks
- updated docker setup
- updated github docker actions
- updated docs
2025-10-20 00:30:00 -07:00
DESKTOP-RTLN3BA\$punk
70b547c9c9 chore: updated docs & refactored sst_service.py 2025-10-15 14:31:38 -07:00
Rohan Verma
5ed9aa2b0b
Merge pull request #387 from nabthebest135/local-stt
local STT implementation with Faster-Whisper
2025-10-15 14:08:09 -07:00
Nabhan
9b72ec65b5 fix: address code review feedback for STT implementation
- Add header to local STT transcription for consistency
- Add empty text validation for external STT path
- Refactor external STT to eliminate duplication in atranscription calls
- Ensure both local and external paths have consistent error handling
2025-10-13 14:26:36 +05:00
Differ
917cf4f398 feat: add Chinese LLM providers support with auto-fill API Base URL
- Add support for DeepSeek, Qwen (Alibaba), Kimi (Moonshot), and GLM (Zhipu)
- Implement auto-fill API Base URL when selecting Chinese LLM providers
- Add smart validation and warnings for missing API endpoints
- Fix session state management in task logging service
- Add comprehensive Chinese setup documentation
- Add database migration for new LLM provider enums

Closes #383
2025-10-12 19:10:46 +08:00
Nabhan
15ba2b86f6 fix: add defensive dictionary access and error handling for local STT
- Use .get() for safe dictionary access instead of direct key access
- Add explicit try-catch for local STT transcription failures
- Validate transcription result is not empty
- Provide clear error messages for corrupted audio files
- Match error handling pattern with external STT service
2025-10-12 11:14:12 +05:00
Nabhan
504399ad01 refactor: eliminate duplicated STT service condition check
- Compute stt_service_type once and reuse
- Follow DRY principles
- Improve code maintainability
2025-10-12 11:13:30 +05:00
Nabhan
cf0e265107 refactor: integrate local STT with existing upload flow
- Simplify STT_SERVICE config to local/MODEL_SIZE format
- Remove separate STT routes, integrate with document upload
- Add local STT support to audio file processing pipeline
- Remove React component, use existing upload interface
- Support both local Faster-Whisper and external STT services
- Tested with real speech: 99% accuracy, 2.87s processing
2025-10-12 10:50:55 +05:00
Natsume Ryuhane
797fe26f53
Implemented serverside pagination;
Enabled searchspace file mgmt panel to use serverside pagination;
2025-10-01 13:05:22 -07:00
DESKTOP-RTLN3BA\$punk
76732c36ba feat: added jump to source referencing of citations 2025-08-23 18:48:18 -07:00
DESKTOP-RTLN3BA\$punk
1c4c61eb04 feat: Fixed Document Summary Content across connectors and processors 2025-08-18 20:51:48 -07:00
DESKTOP-RTLN3BA\$punk
5aa52375c3 refactor: refactored background_tasks & indexing_tasks 2025-08-12 15:28:13 -07:00
Utkarsh-Patel-13
d359a59f6d Fixed all ruff lint and formatting errors 2025-07-24 14:43:48 -07:00
MSI\ModSetter
9e8929ed2d refactor: Update import path for TaskLoggingService in documents_routes.py 2025-07-21 06:20:44 -07:00
MSI\ModSetter
931fafa403 refactor: Remove deprecated document processing services and update imports
- Deleted the document_processing module and its associated docling_service.
- Updated imports in documents_routes.py and background_tasks.py to reflect the new service structure.
- Ensured compatibility with the task logging system by adjusting type hints for log entries.
2025-07-21 06:19:37 -07:00
Abdullah 3li
f117d94ef7 fix: Resolve merge conflict in documents_routes.py
- Integrated Docling ETL service with new task logging system
- Maintained consistent logging pattern across all ETL services
- Added progress and success/failure logging for Docling processing
2025-07-21 10:43:15 +03:00
Abdullah 3li
aa00822169 feat: Add Docling support as ETL_SERVICE option
- Added DOCLING as third ETL_SERVICE option (alongside UNSTRUCTURED/LLAMACLOUD)
- Implemented add_received_file_document_using_docling function
- Added Docling processing logic in documents_routes.py
- Enhanced chunking with configurable overlap support
- Added comprehensive document processing service
- Supports both CPU and GPU processing with user selection

Addresses #161 - Add Docling Support as an ETL_SERVICE
Follows same pattern as LlamaCloud integration (PR #123)
2025-07-20 11:42:55 +03:00
MSI\ModSetter
1eb072cc69 feat(BACKEND): Added Log Management System for better Bug's Tracking
- Background tasks are now logged so non tech users can effectively track the failurte points easily.
2025-07-16 01:10:33 -07:00
DESKTOP-RTLN3BA\$punk
21fb231683 fix: Markdown & Text files as default support. 2025-07-07 22:55:51 -07:00
DESKTOP-RTLN3BA\$punk
a85f7920a9 feat: added configurable LLM's 2025-06-09 15:50:15 -07:00
DESKTOP-RTLN3BA\$punk
73751c0eb1 feat: Removed Hard Dependency on Unstructured.io
- Added Llamaparse Support :)
2025-05-30 19:17:19 -07:00
DESKTOP-RTLN3BA\$punk
a8080d2dc7 feat: Added Speech to Text support.
- Supports audio & video files.
- Will be useful for Youtube vids which dont have transcripts.
2025-05-13 21:13:53 -07:00
DESKTOP-RTLN3BA\$punk
1586a0bd78 chore: Added direct handling for markdown files.
- Fixed podcast imports.
2025-05-07 22:04:57 -07:00
DESKTOP-RTLN3BA\$punk
2008b07304 fix: Docs & Chats in other search spaces 2025-04-17 23:19:56 -07:00
DESKTOP-RTLN3BA\$punk
b43272a115 feat(youtube): integrate YouTube video processing connector
- Added support for processing YouTube videos, including transcript extraction and document creation.
- Implemented a new background task for adding YouTube video documents.
- Enhanced the connector service to search for YouTube videos and return relevant results.
- Updated frontend components to include YouTube video options in the dashboard and connector sources.
- Added necessary dependencies for YouTube transcript API.
2025-04-11 15:05:17 -07:00
DESKTOP-RTLN3BA\$punk
1609e59086 YouTube video processing utils 2025-04-09 18:46:10 -07:00
DESKTOP-RTLN3BA\$punk
8cd1264d3f feat: Updated the extension for SurfSense v0.0.6 2025-03-26 20:02:53 -07:00
DESKTOP-RTLN3BA\$punk
ee0c518553 not-integreated: Add DocumentHybridSearchRetriever 2025-03-20 22:56:24 -07:00
DESKTOP-RTLN3BA\$punk
709aa6f303 feat: Added Docker Support and missing dependencies. 2025-03-20 18:52:06 -07:00
DESKTOP-RTLN3BA\$punk
da23012970 feat: SurfSense v0.0.6 init 2025-03-14 18:53:14 -07:00