SurfSense

mirror of https://github.com/MODSetter/SurfSense.git synced 2026-06-26 21:39:43 +02:00

Author	SHA1	Message	Date
CREDO23	2beafbdec8	agent: retire eager KB priority/planner path and its dead flags The pull-based KB design (on-demand search_knowledge_base tool + pre-injected workspace tree) fully replaced the old eager retrieval path. Remove its last remnants: - Delete KnowledgePriorityMiddleware (knowledge_search.py) and its tests. - Drop the kb_priority state field + reducer default; trim KbContextProjectionMiddleware to project only workspace_tree_text. - Remove the now-dead feature flags enable_kb_priority_preinjection and enable_kb_planner_runnable across backend (flags, route schema, tests, env examples) and frontend (settings toggle, zod schema). - Scrub <priority_documents> and stale KnowledgePriorityMiddleware references from prompts, docstrings, and the ADR. No functional change: nothing wrote kb_priority and neither flag gated live behavior after the cutover. Full backend suite green (pre-existing unrelated failures aside).	2026-06-25 18:37:14 +02:00
CREDO23	ce15016533	citations: consolidate prompts, retire eager path, refresh ADR Rewrite the main-agent citation contract to a single [n] channel and sync the orphaned system_prompt_composer surface to match; drop stale [citation:chunk_id] / <chunk_index> references from dynamic_context and provider hints. Reuse the shared hybrid search in the deliverables report (citations omitted for now) and delete the orphaned report KB helper. Remove the dead eager KnowledgePriorityMiddleware wiring (knowledge_priority + stack) and its legacy browse test. Update ADR 0001 to reflect the cutover.	2026-06-25 15:27:09 +02:00
CREDO23	c98bdea5cf	search-kb: on-demand KB tool on the [n] spine; drop kb_matched_chunk_ids The main agent's search_knowledge_base tool runs the hybrid spine, renders a <retrieved_context> of numbered [n] passages, and persists the registry. KB subagent prompts teach citing [n] from <document view="full"> reads (evidence.chunk_ids -> evidence.citations). Delete the now-unused search->read highlighting hand-off: the kb_matched_chunk_ids state field, its reducer default, the tool's _matched_chunk_ids writer, and the dead KnowledgePriorityMiddleware writes.	2026-06-25 15:26:39 +02:00
CREDO23	852ab3a576	retrieval: add hybrid search behavior tests	2026-06-25 09:15:59 +02:00
Anish Sarkar	3695e1d5c5	Merge remote-tracking branch 'upstream/dev' into feat/api-key	2026-06-23 13:09:53 +05:30
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	a08de01cc7	Revert "Merge pull request #1523 from CREDO23/fix/chat-citations" This reverts commit `cd2242147a`, reversing changes made to `a4bb0a5253`.	2026-06-22 22:55:29 -07:00
Anish Sarkar	fd31ac34fd	Merge remote-tracking branch 'upstream/dev' into feat/api-key	2026-06-20 10:50:03 +05:30
Anish Sarkar	1e8baa10ec	refactor(routes): replace user variable with auth context in tests	2026-06-20 03:34:40 +05:30
Anish Sarkar	af5a112212	refactor(auth): replace user variable with auth context in integration and unit tests	2026-06-20 03:11:00 +05:30
Anish Sarkar	b3fa96bef8	test(auth): cover PAT fail-closed authorization	2026-06-20 02:13:05 +05:30
CREDO23	31a14190e3	fix: update upload conftest mock to span-aware chunker	2026-06-19 19:36:26 +02:00
CREDO23	773f913f06	test: cover by-chunk span and line-range resolve	2026-06-19 15:31:44 +02:00
CREDO23	a2a92c592f	test: assert hybrid search returns chunk spans	2026-06-19 14:53:49 +02:00
CREDO23	c376fbaf61	test: seed chunk spans in retriever fixture	2026-06-19 14:53:49 +02:00
CREDO23	5a315eafd3	test: verify note write chunk spans	2026-06-18 20:17:45 +02:00
CREDO23	03012c3077	test: span-aware paragraph chunker fixture	2026-06-18 20:06:33 +02:00
CREDO23	a7cf9bd946	test: mock span chunker in reindex test	2026-06-18 20:06:33 +02:00
CREDO23	12e948cad1	test: mock span chunker in integration fixtures	2026-06-18 20:06:33 +02:00
CREDO23	60fff66ee0	test: verify chunk span persistence on index	2026-06-18 20:06:33 +02:00
CREDO23	b446897638	test: editor read paths never reconstruct body from chunks	2026-06-18 19:23:49 +02:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	55f91a29d5	chore: linting	2026-06-17 22:31:36 -07:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	b89866541e	chore: linting	2026-06-17 20:50:07 -07:00
CREDO23	2db1615195	test long filename document processing notifications	2026-06-17 15:06:05 +02:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	0fe650fd8e	Merge commit '`7ce409c580`' into dev	2026-06-16 22:48:14 -07:00
Dmitry Maranik	81fc467187	test(connectors): regression tests for cross-search-space index authorization Two integration tests pinning the connector index endpoint's authorization: - cross-space index (attacker owns space B, connector lives in victim's space A, request passes search_space_id=B) is rejected with 404 at the search-space reconciliation, before the permission check (which would otherwise pass for the attacker's own space). - same-space index authorizes check_permission against the connector's own search space, not the caller-supplied query param. Mirrors the existing tests/integration harness (direct handler calls with the savepoint-rolled-back db_session; check_permission patched so the test needs no real RBAC wiring).	2026-06-16 16:18:40 -07:00
CREDO23	7a415b61ea	test: align QuotaInsufficientError fixtures with balance_micros API Billable calls now raise quota errors with balance_micros instead of used_micros/limit_micros; update mocks so CI passes on main.	2026-06-16 23:56:11 +02:00
CREDO23	38991c7db8	test(podcasts): update integration fixtures for second-based duration	2026-06-16 23:38:28 +02:00
CREDO23	1048d0afc3	test(podcasts): cover public stream missing-object 404	2026-06-16 20:09:08 +02:00
CREDO23	810ded2dde	test(podcasts): cover in-flight 409 and missing-object 404	2026-06-16 20:09:08 +02:00
CREDO23	86a8833fb4	test(podcasts): add exists to fake storage backend	2026-06-16 20:09:08 +02:00
CREDO23	dcebfc4756	Merge remote-tracking branch 'upstream/dev' into features/documents-injestion-layered-cached	2026-06-12 19:35:34 +02:00
CREDO23	311570b4f0	test(indexing): cover the edit path and make integration caches hermetic Real-DB tests assert unchanged chunk rows survive edits, only new text is embedded, removed rows are deleted with positions compacted, and the kill switch restores full-replace. An autouse fixture disables the ETL/embedding caches so a developer's .env can't leak cache hits into unrelated tests.	2026-06-12 18:53:21 +02:00
CREDO23	412493ae08	test(embedding-cache): add integration tests for service, repository, and store Covers the public cache surface against real Postgres and a real local file backend (no mocks): recall miss, remember->recall vector/text/order round-trip, the dimension-mismatch refusal, the repository SQL behind eviction and dedup (size sum, coldest ordering, TTL cutoff, duplicate-key no-op, reuse counter), and the blob store save/load round-trip and delete.	2026-06-12 17:33:21 +02:00
CREDO23	8cf578d965	test(index-cache): add unit tests and repoint embed/chunk patch targets	2026-06-12 16:48:18 +02:00
CREDO23	99cf212c31	test: fix auth-mode mismatch and stale QuotaInsufficientError kwargs Pin AUTH_TYPE=LOCAL (and REGISTRATION_ENABLED=TRUE) in the test bootstrap so the email/password auth routers mount during integration tests regardless of a developer's .env=GOOGLE; without this the upload tests 404 on registration. Also update three tests to the current QuotaInsufficientError signature (balance_micros) after used_micros/limit_micros were removed.	2026-06-12 12:19:49 +02:00
CREDO23	d5e0280097	test(etl-cache): cover two-phase eviction task on real infra	2026-06-12 11:54:36 +02:00
CREDO23	1460173dad	test(etl-cache): cover extract_with_cache end-to-end	2026-06-12 11:50:57 +02:00
CREDO23	c49a0f1233	test(etl-cache): cover store, service, and repository on real infra	2026-06-12 11:50:57 +02:00
Rohan Verma	4c28ba5295	Merge pull request #1487 from CREDO23/improvement-podcast-graph [Feat] Podcast: Backend-owned language offering for the brief form	2026-06-12 00:58:02 -07:00
CREDO23	0c7e5dee8b	test(podcast): align quota error kwargs with wallet refactor	2026-06-12 07:38:38 +02:00
CREDO23	402ae6befe	test(podcast): languages endpoint	2026-06-12 07:38:38 +02:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	05190da0a9	chore: linting	2026-06-11 15:31:43 -07:00
CREDO23	41f4a58663	Merge remote-tracking branch 'upstream/dev' into improvement-podcast-graph # Conflicts: # surfsense_backend/app/tasks/celery_tasks/podcast_tasks.py	2026-06-11 23:14:49 +02:00
CREDO23	ca9b157676	fix(podcasts): keep legacy episodes readable and guard regenerate	2026-06-11 12:43:07 +02:00
CREDO23	aa7f14d94f	feat(podcasts): add revert-regeneration and surface cancel on the live card	2026-06-11 12:31:42 +02:00
CREDO23	eb56acc407	refactor(podcasts): regenerate via brief gate, render brief inline in chat	2026-06-11 11:45:17 +02:00
CREDO23	11a6b178a0	refactor(podcasts): drop transcript gate, add regenerate-from-ready and voice previews	2026-06-11 10:42:13 +02:00
CREDO23	c84525897b	test(podcasts): relocate stateful tests to integration Move the lifecycle service, Celery task bodies, and mark_failed coverage out of DB-faking unit tests and into integration tests against a real Postgres, faking only true externals (broker, object store, TTS, ffmpeg, billing, LLM). Add HTTP slices for cancel, voices, scoping, and public-chat streaming. The unit tier is now fake-free pure logic with no session doubles.	2026-06-11 06:27:00 +02:00
$DESKTOP-RTLN3BA\$punk$ DESKTOP-RTLN3BA\$punk	a7407502d3	feat(refactor): refactor payment system to implement unified credit wallet. - Updated environment variables and - configurations for credit purchases via Stripe, replacing legacy page pack system. - Introduced auto-reload feature for credit top-ups and modified database models to track credit transactions. - Updated notification system to handle insufficient credits and auto-reload failures. - Adjusted API routes and schemas to reflect changes in credit management.	2026-06-10 16:49:03 -07:00
CREDO23	59c1cf14c7	test(indexers): cover mark_connector_documents_failed behavior	2026-06-10 00:11:00 +02:00

1 2 3 4

158 commits