PageIndex/examples/tutorials/doc-search/metadata.md
2026-03-27 03:30:13 +08:00

37 lines
No EOL
1.3 KiB
Markdown
Raw Permalink Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

## Document Search by Metadata
<callout>PageIndex with metadata support is in closed beta. Fill out this form to request early access to this feature.</callout>
For documents that can be easily distinguished by metadata, we recommend using metadata to search the documents.
This method is ideal for the following document types:
- Financial reports categorized by company and time period
- Legal documents categorized by case type
- Medical records categorized by patient or condition
- And many others
In such cases, you can search documents by leveraging their metadata. A popular method is to use "Query to SQL" for document retrieval.
### Example Pipeline
#### PageIndex Tree Generation
Upload all documents into PageIndex to get their `doc_id`.
#### Set up SQL tables
Store documents along with their metadata and the PageIndex `doc_id` in a database table.
#### Query to SQL
Use an LLM to transform a users retrieval request into a SQL query to fetch relevant documents.
#### Retrieve with PageIndex
Use the PageIndex `doc_id` of the retrieved documents to perform further retrieval via the PageIndex retrieval API.
## 💬 Help & Community
Contact us if you need any advice on conducting document searches for your use case.
- 🤝 [Join our Discord](https://discord.gg/VuXuf29EUj)
- 📨 [Leave us a message](https://ii2abc2jejf.typeform.com/to/meB40zV0)