diff --git a/README.md b/README.md index 1ed8ee1..138f2ac 100644 --- a/README.md +++ b/README.md @@ -27,10 +27,21 @@ --- -## ๐Ÿšจ **New Releases:** +### ๐Ÿšจ New Releases: - ๐Ÿ“– [**PageIndex Chat**](https://chat.pageindex.ai): World's first human-like document analyst agent, designed for professional long documents. - ๐Ÿ”Œ [**PageIndex MCP**](https://pageindex.ai/mcp): Bring PageIndex into Claude, Cursor, or any MCP-enabled agent. Chat with long PDFs the reasoning-based, human-like way. +### ๐Ÿ“ข Recent Updates + +#### ๐Ÿ“ Articles: +* ๐Ÿงฉ [**โ€œPageIndex: Next-Gen Vectorless, Reasoning-based RAGโ€**](https://pageindex.ai/blog/pageindex-intro): Introduces the **PageIndex** framework โ€” an **agentic in-context index** that enables LLMs to perform **reasoning-based, human-like retrieval** over long documents, without vectors or chunking. +* ๐Ÿงพ [**โ€œDo We Still Need OCR?โ€**](https://pageindex.ai/blog/do-we-need-ocr): Explores how vision-based, reasoning-native RAG challenges the traditional OCR pipeline โ€” and why the future of document AI might be *vectorless* and *vision-based*. + +#### ๐Ÿงช **Cookbooks:** +* [**Vectorless RAG**](https://github.com/VectifyAI/PageIndex/blob/main/cookbook/pageindex_RAG_simple.ipynb): A minimal, hands-on example of reasoning-based RAG using **PageIndex** โ€” no vectors, no chunking, and human-like retrieval. +* [**Vision-based Vectorless RAG**](https://github.com/VectifyAI/PageIndex/blob/main/cookbook/vision_RAG_pageindex.ipynb): Experience OCR-free document understanding through PageIndexโ€™s visual retrieval workflow โ€” retrieving and reasoning directly over PDF page images. + + # ๐Ÿ“‘ Introduction to PageIndex Are you frustrated with vector database retrieval accuracy for long professional documents? Traditional vector-based RAG relies on semantic *similarity* rather than true *relevance*. But **similarity โ‰  relevance** โ€” what we truly need in retrieval is **relevance**, and that requires **reasoning**. When working with professional documents that demand domain expertise and multi-step reasoning, similarity search often falls short. @@ -162,7 +173,7 @@ python3 run_pageindex.py --md_path /path/to/your/document.md --- -# โ˜๏ธ Improved Tree Generation with PageIndex OCR + # ๐Ÿ“ˆ Case Study: Mafin 2.5 on FinanceBench