mirror of
https://github.com/VectifyAI/PageIndex.git
synced 2026-04-24 23:56:21 +02:00
Update README
This commit is contained in:
parent
0ba6206ef0
commit
8f1ed7783b
1 changed files with 26 additions and 20 deletions
46
README.md
46
README.md
|
|
@ -16,10 +16,10 @@
|
||||||
<p align="center"><b>Reasoning-based RAG ◦ No Vector DB ◦ No Chunking ◦ Human-like Retrieval</b></p>
|
<p align="center"><b>Reasoning-based RAG ◦ No Vector DB ◦ No Chunking ◦ Human-like Retrieval</b></p>
|
||||||
|
|
||||||
<h4 align="center">
|
<h4 align="center">
|
||||||
<a href="https://vectify.ai">🏠 Homepage</a> •
|
<a href="https://vectify.ai">🌐 Homepage</a> •
|
||||||
<a href="https://chat.pageindex.ai">🖥️ Chat Platform</a> •
|
<a href="https://chat.pageindex.ai">🖥️ Chat Platform</a> •
|
||||||
<a href="https://pageindex.ai/developer">🔌 MCP</a> •
|
<a href="https://pageindex.ai/developer">🔌 MCP & API</a> •
|
||||||
<a href="https://docs.pageindex.ai">📚 Docs</a> •
|
<a href="https://docs.pageindex.ai">📖 Docs</a> •
|
||||||
<a href="https://discord.com/invite/VuXuf29EUj">💬 Discord</a> •
|
<a href="https://discord.com/invite/VuXuf29EUj">💬 Discord</a> •
|
||||||
<a href="https://ii2abc2jejf.typeform.com/to/tK3AXl8T">✉️ Contact</a>
|
<a href="https://ii2abc2jejf.typeform.com/to/tK3AXl8T">✉️ Contact</a>
|
||||||
</h4>
|
</h4>
|
||||||
|
|
@ -30,9 +30,9 @@
|
||||||
<details open>
|
<details open>
|
||||||
<summary><h2>📢 Updates</h2></summary>
|
<summary><h2>📢 Updates</h2></summary>
|
||||||
|
|
||||||
- 🔥 [**Agentic Vectorless RAG**](https://github.com/VectifyAI/PageIndex/blob/main/examples/agentic_vectorless_rag_demo.py): A simple *agentic, vectorless RAG* [example](https://github.com/VectifyAI/PageIndex/blob/main/examples/agentic_vectorless_rag_demo.py) with self-hosted PageIndex, using OpenAI Agents SDK.
|
- 🔥 [**Agentic Vectorless RAG**](https://github.com/VectifyAI/PageIndex/blob/main/examples/agentic_vectorless_rag_demo.py) — A simple *agentic, vectorless RAG* [example](https://github.com/VectifyAI/PageIndex/blob/main/examples/agentic_vectorless_rag_demo.py) with self-hosted PageIndex, using OpenAI Agents SDK.
|
||||||
- [PageIndex Chat](https://chat.pageindex.ai): A Human-like document analysis agent [platform](https://chat.pageindex.ai) for professional long documents. Also available via [MCP](https://pageindex.ai/developer) or [API](https://pageindex.ai/developer).
|
- [PageIndex Chat](https://chat.pageindex.ai) — Human-like document analysis agent [platform](https://chat.pageindex.ai) for professional long documents. Also available via [MCP](https://pageindex.ai/developer) or [API](https://pageindex.ai/developer).
|
||||||
- [PageIndex Framework](https://pageindex.ai/blog/pageindex-intro): The PageIndex framework — an *agentic, in-context tree index* that enables LLMs to perform *reasoning-based, human-like retrieval* over long documents.
|
- [PageIndex Framework](https://pageindex.ai/blog/pageindex-intro) — Deep dive into PageIndex: an *agentic, in-context tree index* that enables LLMs to perform *reasoning-based, human-like retrieval* over long documents.
|
||||||
|
|
||||||
<!-- **🧪 Cookbooks:**
|
<!-- **🧪 Cookbooks:**
|
||||||
- [Vectorless RAG](https://docs.pageindex.ai/cookbook/vectorless-rag-pageindex): A minimal, hands-on example of reasoning-based RAG using PageIndex. No vectors, no chunking, and human-like retrieval.
|
- [Vectorless RAG](https://docs.pageindex.ai/cookbook/vectorless-rag-pageindex): A minimal, hands-on example of reasoning-based RAG using PageIndex. No vectors, no chunking, and human-like retrieval.
|
||||||
|
|
@ -66,11 +66,11 @@ Compared to traditional vector-based RAG, **PageIndex** features:
|
||||||
- **Human-like Retrieval**: Simulates how human experts navigate and extract knowledge from complex documents.
|
- **Human-like Retrieval**: Simulates how human experts navigate and extract knowledge from complex documents.
|
||||||
- **Better Explainability and Traceability**: Retrieval is based on reasoning — traceable and interpretable, with page and section references. No more opaque, approximate vector search (“vibe retrieval”).
|
- **Better Explainability and Traceability**: Retrieval is based on reasoning — traceable and interpretable, with page and section references. No more opaque, approximate vector search (“vibe retrieval”).
|
||||||
|
|
||||||
PageIndex powers a reasoning-based RAG system that achieved **state-of-the-art** [98.7% accuracy](https://github.com/VectifyAI/Mafin2.5-FinanceBench) on FinanceBench, demonstrating superior performance over vector-based RAG solutions in professional document analysis (see our [blog post](https://vectify.ai/blog/Mafin2.5) for details).
|
PageIndex powers a reasoning-based RAG system that achieved **state-of-the-art** [98.7% accuracy](https://github.com/VectifyAI/Mafin2.5-FinanceBench) on FinanceBench, demonstrating superior performance over vector-based RAG solutions in professional document analysis. See our [blog post](https://vectify.ai/blog/Mafin2.5) for details.
|
||||||
|
|
||||||
### 📍 Explore PageIndex
|
### 📍 Explore PageIndex
|
||||||
|
|
||||||
To learn more, please see a detailed introduction of the [PageIndex framework](https://pageindex.ai/blog/pageindex-intro). Check out this GitHub repo for open-source code, and the [cookbooks](https://docs.pageindex.ai/cookbook), [tutorials](https://docs.pageindex.ai/tutorials), and [blog](https://pageindex.ai/blog) for additional usage guides and examples.
|
To learn more, please see a detailed introduction to the [PageIndex framework](https://pageindex.ai/blog/pageindex-intro). Check out this GitHub repo for open-source code, and the [cookbooks](https://docs.pageindex.ai/cookbook), [tutorials](https://docs.pageindex.ai/tutorials), and [blog](https://pageindex.ai/blog) for additional usage guides and examples.
|
||||||
|
|
||||||
The PageIndex service is available as a ChatGPT-style [chat platform](https://chat.pageindex.ai), or can be integrated via [MCP](https://pageindex.ai/developer) or [API](https://pageindex.ai/developer).
|
The PageIndex service is available as a ChatGPT-style [chat platform](https://chat.pageindex.ai), or can be integrated via [MCP](https://pageindex.ai/developer) or [API](https://pageindex.ai/developer).
|
||||||
|
|
||||||
|
|
@ -191,9 +191,9 @@ python3 run_pageindex.py --md_path /path/to/your/document.md
|
||||||
> Note: in this mode, we use "#" to determine node headings and their levels. For example, "##" is level 2, "###" is level 3, etc. Make sure your markdown file is formatted correctly. If your Markdown file was converted from a PDF or HTML, we don't recommend using this mode, since most existing conversion tools cannot preserve the original hierarchy. Instead, use our [PageIndex OCR](https://pageindex.ai/blog/ocr), which is designed to preserve the original hierarchy, to convert the PDF to a markdown file and then use this mode.
|
> Note: in this mode, we use "#" to determine node headings and their levels. For example, "##" is level 2, "###" is level 3, etc. Make sure your markdown file is formatted correctly. If your Markdown file was converted from a PDF or HTML, we don't recommend using this mode, since most existing conversion tools cannot preserve the original hierarchy. Instead, use our [PageIndex OCR](https://pageindex.ai/blog/ocr), which is designed to preserve the original hierarchy, to convert the PDF to a markdown file and then use this mode.
|
||||||
</details>
|
</details>
|
||||||
|
|
||||||
### Agentic Vectorless RAG Example
|
## Agentic Vectorless RAG: An Example
|
||||||
|
|
||||||
For a simple, end-to-end _**agentic vectorless RAG**_ example using PageIndex (with OpenAI Agents SDK), see [`examples/agentic_vectorless_rag_demo.py`](examples/agentic_vectorless_rag_demo.py).
|
For a simple, end-to-end _**agentic vectorless RAG**_ example using PageIndex with OpenAI Agents SDK, see [`examples/agentic_vectorless_rag_demo.py`](examples/agentic_vectorless_rag_demo.py).
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
# Install optional dependency
|
# Install optional dependency
|
||||||
|
|
@ -238,14 +238,21 @@ Explore the full [benchmark results](https://github.com/VectifyAI/Mafin2.5-Finan
|
||||||
|
|
||||||
# 🧭 Resources
|
# 🧭 Resources
|
||||||
|
|
||||||
* 🧪 [Cookbooks](https://docs.pageindex.ai/cookbook/vectorless-rag-pageindex): hands-on, runnable examples and advanced use cases.
|
|
||||||
* 📖 [Tutorials](https://docs.pageindex.ai/doc-search): practical guides and strategies, including *Document Search* and *Tree Search*.
|
|
||||||
* 📝 [Blog](https://pageindex.ai/blog): technical articles, research insights, and product updates.
|
* 📝 [Blog](https://pageindex.ai/blog): technical articles, research insights, and product updates.
|
||||||
* 🔧 [Developer](https://pageindex.ai/developer): MCP setup, API docs, and integration guides.
|
* 🔧 [Developer](https://pageindex.ai/developer): MCP setup, API docs, and integration guides.
|
||||||
|
* 🧪 [Cookbooks](https://docs.pageindex.ai/cookbook): hands-on, runnable examples and advanced use cases.
|
||||||
|
* 📖 [Tutorials](https://docs.pageindex.ai/tutorials): practical guides and strategies, including *Document Search* and *Tree Search*.
|
||||||
|
|
||||||
---
|
---
|
||||||
|
|
||||||
# ⭐ Support Us
|
# ⭐ Support Us
|
||||||
|
|
||||||
|
Leave us a star 🌟 if you like our project. Thank you!
|
||||||
|
|
||||||
|
<p>
|
||||||
|
<img src="https://github.com/user-attachments/assets/eae4ff38-48ae-4a7c-b19f-eab81201d794" width="80%">
|
||||||
|
</p>
|
||||||
|
|
||||||
Please cite this work as:
|
Please cite this work as:
|
||||||
```
|
```
|
||||||
Mingtian Zhang, Yu Tang and PageIndex Team,
|
Mingtian Zhang, Yu Tang and PageIndex Team,
|
||||||
|
|
@ -268,19 +275,18 @@ PageIndex Blog, Sep 2025.
|
||||||
```
|
```
|
||||||
</details>
|
</details>
|
||||||
|
|
||||||
Leave us a star 🌟 if you like our project. Thank you!
|
|
||||||
|
|
||||||
<p>
|
|
||||||
<img src="https://github.com/user-attachments/assets/eae4ff38-48ae-4a7c-b19f-eab81201d794" width="80%">
|
|
||||||
</p>
|
|
||||||
|
|
||||||
### Connect with Us
|
### Connect with Us
|
||||||
|
|
||||||
[](https://x.com/PageIndexAI)
|
<div align="center">
|
||||||
[](https://www.linkedin.com/company/vectify-ai/)
|
|
||||||
[](https://discord.com/invite/VuXuf29EUj)
|
[](https://x.com/PageIndexAI) 
|
||||||
|
[](https://www.linkedin.com/company/vectify-ai/) 
|
||||||
|
[](https://discord.com/invite/VuXuf29EUj) 
|
||||||
[](https://ii2abc2jejf.typeform.com/to/tK3AXl8T)
|
[](https://ii2abc2jejf.typeform.com/to/tK3AXl8T)
|
||||||
|
|
||||||
|
</div>
|
||||||
|
|
||||||
---
|
---
|
||||||
|
|
||||||
© 2026 [Vectify AI](https://vectify.ai)
|
© 2026 [Vectify AI](https://vectify.ai)
|
||||||
|
|
|
||||||
Loading…
Add table
Add a link
Reference in a new issue