Commit graph

46 commits

Author SHA1 Message Date
CREDO23
929445afd9 feat: use batch embedding in IndexingPipelineService.index 2026-03-09 16:13:44 +02:00
Anish Sarkar
b2bf00e11a chore: ran linting 2026-02-28 02:28:03 +05:30
Anish Sarkar
ce82807f16 test: enhance reindexing tests for UploadDocumentAdapter 2026-02-28 02:18:02 +05:30
Anish Sarkar
37f76a8533 test: add should_summarize parameter to file upload adapter tests 2026-02-28 01:44:41 +05:30
Anish Sarkar
23a98d802c refactor: implement UploadDocumentAdapter for file indexing and reindexing 2026-02-28 01:38:32 +05:30
DESKTOP-RTLN3BA\$punk
a4dc84d1ab feat: add should_summarize parameter to task dispatchers
- Introduced should_summarize parameter in TaskDispatcher and CeleryTaskDispatcher to control summary generation.
- Updated InlineTaskDispatcher to support the new parameter for document processing.
2026-02-26 19:12:37 -08:00
Anish Sarkar
836d5293df refactor: remove unused TestStatusPolling class from document upload integration tests 2026-02-27 01:52:35 +05:30
Anish Sarkar
fd032f3709 refactor: simplify and clarify documentation in document upload integration tests 2026-02-27 01:48:25 +05:30
Anish Sarkar
7c09958ddc refactor: enhance document upload integration tests for API contract validation 2026-02-27 01:24:20 +05:30
Anish Sarkar
1068ea25a7 refactor: standardize test database configuration across test files 2026-02-27 00:45:51 +05:30
Anish Sarkar
f09b5b0ea4 refactor: replace hardcoded embedding dimension with dynamic configuration
- Updated the embedding dimension in test configurations to use the value from the application config, enhancing maintainability and consistency across tests.
2026-02-27 00:17:39 +05:30
Anish Sarkar
223c2de0d2 refactor: update database connection handling in test configurations 2026-02-27 00:05:21 +05:30
Anish Sarkar
3393e435f9 feat: implement task dispatcher for document processing
- Introduced a TaskDispatcher abstraction to decouple the upload endpoint from Celery, allowing for easier testing with synchronous implementations.
- Updated the create_documents_file_upload function to utilize the new dispatcher for task management.
- Removed direct Celery task imports from the upload function, enhancing modularity.
- Added integration tests for document upload, including page limit enforcement and file size restrictions.
2026-02-26 23:55:47 +05:30
Anish Sarkar
bf60a5049f feat: add end-to-end test for document searchability after upload
- Introduced a new test class to verify that uploaded documents appear in search results once their status is ready.
- Implemented assertions to ensure the uploaded document's ID is present in the search response.
2026-02-26 03:33:37 +05:30
Anish Sarkar
9ccee054a5 chore: ran linting 2026-02-26 03:05:20 +05:30
Anish Sarkar
7332be956e refactor: reorganize test fixtures and helpers for improved clarity and maintainability 2026-02-26 03:04:56 +05:30
Anish Sarkar
f59a70f7a5 Merge remote-tracking branch 'upstream/dev' into feat/document-test 2026-02-26 02:22:10 +05:30
Anish Sarkar
d6e442b466 fix: Improve document upload tests by adding assertions for document IDs and handling transient transport errors 2026-02-26 01:50:57 +05:30
Anish Sarkar
25df3dff64 chore: ran linting 2026-02-26 01:29:25 +05:30
Anish Sarkar
380c1c3877 fix: Refactor document ID usage in file processing to improve clarity 2026-02-26 01:28:09 +05:30
Anish Sarkar
a57ab02900 feat: Implement file upload limits and page limit enforcement in backend
- Added constants for maximum files per upload, per-file size, and total upload size.
- Enhanced document upload route to validate file counts and sizes, returning appropriate HTTP errors.
- Introduced end-to-end tests for upload limits and page limit enforcement, ensuring correct behavior under various scenarios.
- Updated test helpers to support notification retrieval for page limit exceeded scenarios.
2026-02-26 01:25:34 +05:30
CREDO23
0de74f4bf7 add docstrings to all indexing pipeline tests 2026-02-25 20:30:31 +02:00
CREDO23
4293910e8e plug file upload into indexing pipeline adapter and add integration tests 2026-02-25 20:20:52 +02:00
CREDO23
cad400be1b add file upload adapter and make index() return refreshed document 2026-02-25 19:56:59 +02:00
Anish Sarkar
853750e6e9 chore: ran linting 2026-02-25 20:08:20 +05:30
Anish Sarkar
a7df26d451 refactor: remove redundant status assertion 2026-02-25 20:07:34 +05:30
Anish Sarkar
c564e5f768 feat: Enhance test document purging by implementing direct database access 2026-02-25 19:51:29 +05:30
Anish Sarkar
c3273af20b feat: Add fixture to purge all test search space documents before a new session 2026-02-25 18:59:23 +05:30
Anish Sarkar
03d8788241 chore: Enhance pytest configuration and add custom markers in pyproject.toml 2026-02-25 17:51:33 +05:30
Anish Sarkar
4ff712578d refactor: Enhance test utilities for document upload by integrating search space handling
- Updated test fixtures to include search space ID retrieval for improved document upload tests.
- Refactored authentication and document upload functions to accept search space ID as a parameter.
- Removed hardcoded search space ID references to streamline test configurations.
2026-02-25 17:29:09 +05:30
CREDO23
ca870cf660 add fallback document sumary 2026-02-25 13:47:36 +02:00
Anish Sarkar
41eb68663a feat: Add end-to-end tests for document upload pipeline and shared test utilities
- Introduced new test files for end-to-end testing of document uploads, including support for .txt, .md, and .pdf formats.
- Created shared fixtures and helper functions for authentication, document management, and cleanup.
- Added sample documents for testing purposes.
- Established a conftest.py file to provide reusable fixtures across test modules.
2026-02-25 16:39:45 +05:30
CREDO23
36d1fba75f fix: isolate per-document errors in prepare_for_indexing 2026-02-25 13:00:34 +02:00
CREDO23
e6b7ce7345 fix: handle IntegrityError in prepare_for_indexing and add within-batch content dedup test 2026-02-25 12:03:00 +02:00
CREDO23
1b4ed35de3 fix: correct test fixtures and add missing summarizer tests 2026-02-25 11:15:48 +02:00
CREDO23
c5ae62140d fix: rescue stuck documents with unchanged content on next indexing run 2026-02-25 11:13:25 +02:00
CREDO23
0363cb9c17 fix: updated_at on title change, LLM fallback, stale chunks deleted on re-index 2026-02-25 08:40:13 +02:00
CREDO23
af22fa7c88 refactor: remove redundant and low-value tests, enforce connector_id and created_by_id constraints 2026-02-25 08:29:53 +02:00
CREDO23
5b616eac5a fix: plug all gaps found in deep review of indexing pipeline 2026-02-25 02:20:44 +02:00
CREDO23
46c7ccd70b rename test_index.py to test_index_document.py 2026-02-25 01:43:21 +02:00
CREDO23
61e50834e6 feat: implement and test index method 2026-02-25 01:40:30 +02:00
CREDO23
497ed681d5 feat: implement and test index happy path 2026-02-25 00:30:11 +02:00
CREDO23
579a9e2cb5 feat: implement and test prepare_for_indexing 2026-02-25 00:06:34 +02:00
CREDO23
a0134a5830 test: add document hashing unit tests and clean up conftest mocks 2026-02-24 22:48:40 +02:00
CREDO23
d5e10bd8f9 test: add ConnectorDocument unit tests and factory fixture 2026-02-24 22:20:08 +02:00
CREDO23
10a6ba6924 test: bootstrap pytest environment for backend 2026-02-24 18:19:56 +02:00