fix notebook

This commit is contained in:
Ray 2025-08-22 18:30:59 +08:00
parent 95e2cfa892
commit 48cbf4e93b

View file

@ -43,15 +43,15 @@
"metadata": {}, "metadata": {},
"source": [ "source": [
"## PageIndex Introduction\n", "## PageIndex Introduction\n",
"PageIndex is a new **vectorless RAG** framework. It conduct retrieval in two steps: \n", "PageIndex is a new **reasoning-based**, **vectorless RAG** framework that performs retrieval in two steps: \n",
"1. Generate a tree structure to index documents \n", "1. Generate a tree structure index of documents \n",
"2. Perform reasoning-based retrieval through tree search \n", "2. Perform reasoning-based retrieval through tree search \n",
"\n", "\n",
"<div align=\"center\">\n", "<div align=\"center\">\n",
" <img src=\"https://pageindex.ai/static/images/vectorless_rag_workflow.png\" width=\"70%\">\n", " <img src=\"https://pageindex.ai/static/images/vectorless_rag_workflow.png\" width=\"70%\">\n",
"</div>\n", "</div>\n",
"\n", "\n",
"Compared to classic vector-based RAG, PageIndex features:\n", "Compared to traditional vector-based RAG, PageIndex features:\n",
"- **No Vectors Needed**: Uses document structure and LLM reasoning for retrieval.\n", "- **No Vectors Needed**: Uses document structure and LLM reasoning for retrieval.\n",
"- **No Chunking Needed**: Documents are organized into natural sections rather than artificial chunks.\n", "- **No Chunking Needed**: Documents are organized into natural sections rather than artificial chunks.\n",
"- **Human-like Retrieval**: Simulates how human experts navigate and extract knowledge from complex documents. \n", "- **Human-like Retrieval**: Simulates how human experts navigate and extract knowledge from complex documents. \n",
@ -64,7 +64,7 @@
"source": [ "source": [
"## 📝 Notebook Overview\n", "## 📝 Notebook Overview\n",
"\n", "\n",
"This notebook demonstrates a simple example of **vectorless RAG** with PageIndex through the following steps:\n", "This notebook demonstrates a simple example of **vectorless RAG** with PageIndex. You will learn how to:\n",
"- [x] Build a PageIndex tree structure of a document\n", "- [x] Build a PageIndex tree structure of a document\n",
"- [x] Perform reasoning-based retrieval with tree search\n", "- [x] Perform reasoning-based retrieval with tree search\n",
"- [x] Generate answers based on the retrieved context\n", "- [x] Generate answers based on the retrieved context\n",
@ -201,7 +201,7 @@
"id": "Mzd1VWjwMUJL" "id": "Mzd1VWjwMUJL"
}, },
"source": [ "source": [
"#### 1.1 Submit a document with PageIndex SDK" "#### 1.1 Submit a document for generating PageIndex tree"
] ]
}, },
{ {
@ -225,7 +225,7 @@
} }
], ],
"source": [ "source": [
"# You can also use our GitHub repo to generate PageIndex structure\n", "# You can also use our GitHub repo to generate PageIndex tree\n",
"# https://github.com/VectifyAI/PageIndex\n", "# https://github.com/VectifyAI/PageIndex\n",
"\n", "\n",
"pdf_url = \"https://arxiv.org/pdf/2501.12948.pdf\"\n", "pdf_url = \"https://arxiv.org/pdf/2501.12948.pdf\"\n",