mirror of
https://github.com/VectifyAI/PageIndex.git
synced 2026-04-25 08:06:22 +02:00
Simplify root directory
This commit is contained in:
parent
d7d5aed668
commit
e5ac754828
10 changed files with 4 additions and 20 deletions
37
examples/tutorials/doc-search/metadata.md
Normal file
37
examples/tutorials/doc-search/metadata.md
Normal file
|
|
@ -0,0 +1,37 @@
|
|||
|
||||
|
||||
## Document Search by Metadata
|
||||
<callout>PageIndex with metadata support is in closed beta. Fill out this form to request early access to this feature.</callout>
|
||||
|
||||
For documents that can be easily distinguished by metadata, we recommend using metadata to search the documents.
|
||||
This method is ideal for the following document types:
|
||||
- Financial reports categorized by company and time period
|
||||
- Legal documents categorized by case type
|
||||
- Medical records categorized by patient or condition
|
||||
- And many others
|
||||
|
||||
In such cases, you can search documents by leveraging their metadata. A popular method is to use "Query to SQL" for document retrieval.
|
||||
|
||||
|
||||
### Example Pipeline
|
||||
|
||||
#### PageIndex Tree Generation
|
||||
Upload all documents into PageIndex to get their `doc_id`.
|
||||
|
||||
#### Set up SQL tables
|
||||
|
||||
Store documents along with their metadata and the PageIndex `doc_id` in a database table.
|
||||
|
||||
#### Query to SQL
|
||||
|
||||
Use an LLM to transform a user’s retrieval request into a SQL query to fetch relevant documents.
|
||||
|
||||
#### Retrieve with PageIndex
|
||||
|
||||
Use the PageIndex `doc_id` of the retrieved documents to perform further retrieval via the PageIndex retrieval API.
|
||||
|
||||
## 💬 Help & Community
|
||||
Contact us if you need any advice on conducting document searches for your use case.
|
||||
|
||||
- 🤝 [Join our Discord](https://discord.gg/VuXuf29EUj)
|
||||
- 📨 [Leave us a message](https://ii2abc2jejf.typeform.com/to/meB40zV0)
|
||||
Loading…
Add table
Add a link
Reference in a new issue