From 2b0cd8728103fa0e578225dbb6cb3d3676a51ff4 Mon Sep 17 00:00:00 2001 From: Mingtian Zhang Date: Thu, 3 Apr 2025 13:38:46 +0800 Subject: [PATCH 1/2] Update README.md --- README.md | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/README.md b/README.md index 08fac90..89c6180 100644 --- a/README.md +++ b/README.md @@ -1,9 +1,9 @@ # PageIndex ### **Document Index System for Reasoning-Based RAG** -Frustrated with vector database retrieval accuracy for long professional documents? You need a reasoning-based native index for your RAG system. +Are you frustrated with vector database retrieval accuracy for long professional documents? You need a reasoning-based native index for your RAG system. -Traditional vector-based retrieval relies heavily on semantic similarity. But when working with professional documents that require domain expertise and multi-step reasoning, similarity search often falls short. +Traditional vector-based retrieval relies heavily on semantic similarity. However, when working with professional documents that require domain expertise and multi-step reasoning, similarity search often falls short. **Reasoning-Based RAG** offers a better alternative: enabling LLMs to *think* and *reason* their way to the most relevant document sections. Inspired by **AlphaGo**, we leverage **tree search** to perform structured document retrieval. @@ -27,7 +27,7 @@ It’s ideal for: financial reports, regulatory filings, academic textbooks, leg Enables LLMs to traverse documents logically—like an intelligent, LLM-optimized table of contents. - **Precise Page Referencing** - Every node contains its own summary and start/end page physical index, allowing pinpoint retrieval. + Every node contains its summary and start/end page physical index, allowing pinpoint retrieval. - **Chunk-Free Segmentation** No arbitrary chunking. Nodes follow the natural structure of the document. @@ -145,4 +145,4 @@ Need customized support for your documents or reasoning-based RAG system? :loudspeaker: [Join our Discord](https://discord.com/invite/nnyyEdT2RG) -:envelope: [Leave us a Message](https://ii2abc2jejf.typeform.com/to/meB40zV0) +:envelope: [Leave us a message](https://ii2abc2jejf.typeform.com/to/meB40zV0) From 09621109a746ee0affb8d8aba1541ad2914c47df Mon Sep 17 00:00:00 2001 From: Mingtian Zhang Date: Thu, 3 Apr 2025 13:45:33 +0800 Subject: [PATCH 2/2] Update README.md --- README.md | 8 ++++++++ 1 file changed, 8 insertions(+) diff --git a/README.md b/README.md index 89c6180..ec5acf2 100644 --- a/README.md +++ b/README.md @@ -139,6 +139,14 @@ PageIndex’s hierarchical indexing enabled precise navigation and extraction of 👉 See full [benchmark results](https://github.com/VectifyAI/Mafin2.5-FinanceBench) for detailed comparisons and performance metrics. +## 🚧 Notice + +This project is in its early beta development, and all progress will remain open and transparent. +Due to the non-deterministic nature of large language models (LLMs) and the diverse structures of PDF documents, you may encounter bugs or instability during usage. + +We welcome you to raise issues, reach out with questions, or contribute directly to the project. +Together, let's push forward the revolution of reasoning-based RAG systems. + ## 📬 Contact Us Need customized support for your documents or reasoning-based RAG system?