Commit graph

40 commits

Author SHA1 Message Date
CREDO23
c6cb754aac refactor: update the webcrawler index to compare hashes without metadata 2025-12-17 18:44:58 +02:00
DESKTOP-RTLN3BA\$punk
8c9aa68faa feat: update document tracking to use 'updated_at' timestamp instead of 'last_edited_at' 2025-12-12 01:32:14 -08:00
Differ
500bc60d02 fix: add input validation, retry limit, code formatting, and exclude i18n from secret detection 2025-12-05 09:58:49 +08:00
Differ
6b1b8d0f2e feat: add BookStack connector for wiki documentation indexing 2025-12-04 14:08:44 +08:00
DESKTOP-RTLN3BA\$punk
ab6ea7e0ab feat(UI): reorganized connectors 2025-11-26 13:44:38 -08:00
DESKTOP-RTLN3BA\$punk
8f30cfd69a chore(lint): ruff checks 2025-11-26 13:22:31 -08:00
samkul-swe
121e2f0c0e Renaming resources 2025-11-22 19:19:00 -08:00
samkul-swe
896e410e2a Webcrawler connector draft 2025-11-21 23:27:21 -08:00
DESKTOP-RTLN3BA\$punk
a3a5b13f48 chore: linting 2025-11-03 16:00:58 -08:00
DESKTOP-RTLN3BA\$punk
e65d74f2e2 refactor: added batch commits and Increased task time limits in celery_app.py
- Increased task time limits in celery_app.py for longer processing times.
- Enhanced pagination logic in NotionHistoryConnector to handle large result sets.
- Implemented batch commits every 10 documents across various indexers (Airtable, ClickUp, Confluence, Discord, GitHub, Google Calendar, Gmail, JIRA, Linear, Luma, Notion, Slack) to improve performance and reduce database load.
- Updated final commit logging for clarity on total documents processed.
2025-11-03 15:57:19 -08:00
DESKTOP-RTLN3BA\$punk
0e6669ac4e fix: celery_app path and gmail indexing 2025-10-21 21:11:41 -07:00
DESKTOP-RTLN3BA\$punk
5b957ec21c feat: bumped version to v0.0.8 2025-10-16 22:44:12 -07:00
Anish Sarkar
bbb2abfc02 fix: ran formatter as per coderrabbitai 2025-10-17 02:44:44 +05:30
Anish Sarkar
0ff1b586a2 feat: update Elasticsearch integration and logging
- revised Elasticsearch connector enum revision IDs
- added `TaskLoggingService` to elasticsearch_indexer
- integrated Elasticsearch into prompts.py as requested
2025-10-17 02:21:56 +05:30
Anish Sarkar
82438c7396 refactor: streamline Elasticsearch indexing by removing unused services and integrating document chunking, also added documentation 2025-10-16 17:48:28 +05:30
Anish Sarkar
929035f802 Merge remote-tracking branch 'upstream/main' into feature/elasticsearch-connector 2025-10-16 16:24:37 +05:30
DESKTOP-RTLN3BA\$punk
c99cd710ea feat: add unique identifier hash for documents to prevent duplicates across various connectors 2025-10-14 21:11:19 -07:00
DESKTOP-RTLN3BA\$punk
31982cea9a chore: removed content trunking for better UI 2025-10-14 14:19:48 -07:00
Anish Sarkar
72e8d98f40 feat: enhance Elasticsearch connector to handle missing index configuration 2025-10-12 10:10:19 +05:30
Anish Sarkar
55d752e3c8 feat: added elasticsearch connector 2025-10-12 09:39:04 +05:30
DESKTOP-RTLN3BA\$punk
633ea3ac0f feat: moved LLMConfigs from User to SearchSpaces
- RBAC soon??
- Updated various services and routes to handle search space-specific LLM preferences.
- Modified frontend components to pass search space ID for LLM configuration management.
- Removed onboarding page and settings page as part of the refactor.
2025-10-10 00:50:29 -07:00
DESKTOP-RTLN3BA\$punk
aea09a5dad feat: Moved searchconnectors association from user to searchspace
- Need to move llm configs to searchspace
2025-10-08 21:13:01 -07:00
DESKTOP-RTLN3BA\$punk
94367e4226 chore: linting and formatting 2025-09-28 22:26:26 -07:00
Rohan Verma
ef361e16b4
Merge pull request #337 from samkul-swe/feature/add-luma-connector
[Feature] Add Luma connector
2025-09-28 22:14:15 -07:00
samkul-swe
9d2b808e66 Added Luma connector 2025-09-28 14:59:10 -07:00
CREDO23
8f9f66b7f8 handle token token refreshing when expired 2025-09-21 21:14:03 +02:00
Rohan Verma
662212d4e2
Merge pull request #295 from CREDO23/feature/airtable-connector
[Feature]  Add Airtable connector
2025-09-03 12:49:14 -07:00
Rohan Verma
c2030cec48
Merge pull request #275 from CREDO23/improvement/persist-refreshed-token-in-google-related-connector
[Improvement] Google connectors | Update the connector config after refreshing the token
2025-08-26 18:47:36 -07:00
CREDO23
45d2c18c16 update airtable indexer 2025-08-26 19:17:46 +02:00
CREDO23
55d0cc4d0d Add sirtable indexer 2025-08-26 15:42:42 +02:00
CREDO23
ecbb1f27e0 clean up 2025-08-26 11:53:27 +02:00
CREDO23
85664f2ff8 update the connector config after refreshing google calendar access token 2025-08-26 11:49:31 +02:00
DESKTOP-RTLN3BA\$punk
3b87ecc3c5 fix: made notion indexing async 2025-08-21 14:43:04 -07:00
DESKTOP-RTLN3BA\$punk
f443a6636f fix: slack indexing
- Indivisual messages as Document instead of concatinating it.
2025-08-21 14:23:52 -07:00
CREDO23
9711af2b72 refresh the token when expired 2025-08-21 01:09:13 +02:00
CREDO23
b0b6df0971 updated the connector config after refreshing the token 2025-08-20 20:32:08 +02:00
DESKTOP-RTLN3BA\$punk
1c4c61eb04 feat: Fixed Document Summary Content across connectors and processors 2025-08-18 20:51:48 -07:00
CREDO23
089c9d1625 use new indexer files structureclear 2025-08-15 10:11:50 +02:00
DESKTOP-RTLN3BA\$punk
54374bd7be ruff format 2025-08-12 15:33:17 -07:00
DESKTOP-RTLN3BA\$punk
5aa52375c3 refactor: refactored background_tasks & indexing_tasks 2025-08-12 15:28:13 -07:00