cybermaggedon
52b133fc86
Collection delete pt. 3 ( #542 )
...
* Fixing collection deletion
* Fixing collection management param error
* Always test for collections
* Add Cassandra collection table
* Updated tech spec for explicit creation/deletion
* Remove implicit collection creation
* Fix up collection tracking in all processors
2025-09-30 16:02:33 +01:00
cybermaggedon
13ff7d765d
Collection management ( #520 )
...
* Tech spec
* Refactored Cassanda knowledge graph for single table
* Collection management, librarian services to manage metadata and collection deletion
2025-09-18 15:57:52 +01:00
cybermaggedon
dd70aade11
Implement logging strategy ( #444 )
...
* Logging strategy and convert all prints() to logging invocations
2025-07-30 23:18:38 +01:00
cybermaggedon
f37decea2b
Increase storage test coverage ( #435 )
...
* Fixing storage and adding tests
* PR pipeline only runs quick tests
2025-07-15 09:33:35 +01:00
cybermaggedon
a9197d11ee
Feature/configure flows ( #345 )
...
- Keeps processing in different flows separate so that data can go to different stores / collections etc.
- Potentially supports different processing flows
- Tidies the processing API with common base-classes for e.g. LLMs, and automatic configuration of 'clients' to use the right queue names in a flow
2025-04-22 20:21:38 +01:00
cybermaggedon
f350abb415
Maint/asyncio ( #305 )
...
* Move to asyncio services, even though everything is largely sync
2025-02-11 23:24:46 +00:00
cybermaggedon
6aa212061d
Fix/document embeddings ( #247 )
...
* Update schema for doc embeddings
* Rename embeddings-vectorize to graph-embeddings
* Added document-embeddings processor (broken, needs fixing)
* Added scripts
* Fixed DE queue schema
* Add missing DE process
* Fix doc RAG processing, put graph-rag and doc-rag in appropriate component files.
2025-01-04 21:51:28 +00:00
cybermaggedon
319f9ac04a
Feature/pinecone integration ( #170 )
...
* Added Pinecone for GE write & query
* Add templates
* Doc embedding support
2024-11-22 23:48:21 +00:00