PageIndex

mirror of https://github.com/VectifyAI/PageIndex.git synced 2026-07-03 20:41:02 +02:00

Author	SHA1	Message	Date
Shreyansh Dubey	f413c66fee	fix: prevent KeyError crash and context exhaustion in TOC processing (#188 ) * fix: prevent KeyError crash and context exhaustion in TOC processing - Use .get() with safe defaults for all LLM response dict accesses - Optimize extract_toc_content retry loop to grow chat_history incrementally instead of rebuilding with full accumulated response - Optimize toc_transformer retry loop to use chat_history instead of re-embedding the entire raw TOC and incomplete JSON in each prompt - Return best-effort results on max retries instead of raising - Add 14 mock-based tests covering all fix scenarios Closes #163 * fix: address review feedback on retry behavior and None guard - Restore explicit Exception on max retries instead of silent warning - Move truncation logic before the retry loop so it only runs once on the initial incomplete response, not on every iteration - Add explicit None guard for physical_index before passing to convert_physical_index_to_int to prevent potential TypeError - Update test to expect Exception on max retries --------- Co-authored-by: Your Name <you@example.com>	2026-07-03 20:20:31 +08:00
dependabot[bot]	076dd07bd7	Bump actions/checkout from 4 to 7 (#338 ) Bumps [actions/checkout](https://github.com/actions/checkout) from 4 to 7. - [Release notes](https://github.com/actions/checkout/releases) - [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md) - [Commits](https://github.com/actions/checkout/compare/v4...v7) --- updated-dependencies: - dependency-name: actions/checkout dependency-version: '7' dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-07-03 17:24:52 +08:00
Kylin	27f01e9e7d	fix: bump litellm to 1.84.0 to resolve python-dotenv install conflict (#342 ) litellm 1.83.7 hard-pins python-dotenv==1.0.1, which conflicts with python-dotenv==1.2.2 in requirements.txt and makes a fresh install fail (ResolutionImpossible under both pip 25.2 and uv). Downgrading dotenv to 1.0.1 is not an option: python-dotenv < 1.2.2 is affected by GHSA-mf9w-mj56-hr94 (moderate). litellm 1.84.0 relaxed its pin to python-dotenv<2.0,>=1.0.0, allowing the patched python-dotenv==1.2.2 to remain. Full requirements.txt resolves cleanly under both pip and uv. Closes #286	2026-07-03 16:40:28 +08:00
Mouli	2cf46689f9	Adds missing re import (#281 )	2026-07-03 16:14:58 +08:00
Ray	293730afbd	edit readme (#337 )	2026-06-23 03:56:30 +08:00
Ray	42aa805339	edit readme (#336 )	2026-06-23 01:11:20 +08:00
Ray	54346716bd	edit readme (#335 )	2026-06-22 23:28:08 +08:00
Ray	fe89f246f2	update readme	2026-06-19 07:44:04 +08:00
Ray	5a18553284	update readme	2026-06-06 06:08:19 +08:00
Ray	415288b4b2	update readme	2026-06-05 01:15:23 +08:00
Ray	4d4d14a38a	update readme	2026-06-05 00:47:28 +08:00
Ray	7d2bdb9f28	update readme	2026-06-02 01:16:56 +08:00
Ray	c13eed7d6c	Tighten FinanceBench sentence in README	2026-06-02 00:50:02 +08:00
Ray	f21c90fa2b	Update README: Connect with Us buttons and header tagline - Add Website and Book a Demo buttons; reorder and recolor to brand palette - Replace removed/invalid shields logos (LinkedIn, envelope) and the Website icon with inline white SVGs so all badges show an icon - Reword header tagline (No Vector DB, No Chunking; Context-Aware Retrieval; Human-like) - Rename top-nav Homepage -> Website	2026-06-01 19:19:53 +08:00
Ray	dd064dc39a	Update README (#307 )	2026-05-30 18:51:40 +08:00
Ray	aad68cac7d	Update README (#305 )	2026-05-30 18:38:11 +08:00
Ray	7592163e2a	Update README (#271 ) Some checks failed CodeQL / Analyze (actions) (push) Has been cancelled	2026-05-12 03:28:29 +08:00
Ray	f50e529753	update README (#262 ) Some checks failed CodeQL / Analyze (actions) (push) Has been cancelled	2026-05-08 02:08:33 +08:00
Ray	c1a0f94fd3	update README (#261 )	2026-05-08 01:57:33 +08:00
Ray	e7dfc5e1ff	update README (#259 )	2026-05-07 20:57:14 +08:00
Ray	dcda5656ba	Fix Agentic RAG entry formatting in Updates Some checks failed CodeQL / Analyze (actions) (push) Has been cancelled	2026-05-06 04:53:41 +08:00
Ray	495e8929b5	Trim Cloud Service note in README	2026-05-06 04:40:32 +08:00
Ray	46244aed33	update README Some checks are pending CodeQL / Analyze (actions) (push) Waiting to run	2026-05-06 00:57:25 +08:00
Bukely_	a51d97f63c	Add security CI workflows (#248 ) Some checks failed CodeQL / Analyze (actions) (push) Has been cancelled * Add security CI workflows * Remove duplicate Python CodeQL workflow	2026-04-25 00:46:01 +08:00
dependabot[bot]	40073375ff	Bump the pip group across 1 directory with 2 updates (#247 ) Bumps the pip group with 2 updates in the / directory: [litellm](https://github.com/BerriAI/litellm) and [python-dotenv](https://github.com/theskumar/python-dotenv). Updates `litellm` from 1.83.0 to 1.83.7 - [Release notes](https://github.com/BerriAI/litellm/releases) - [Commits](https://github.com/BerriAI/litellm/commits) Updates `python-dotenv` from 1.1.0 to 1.2.2 - [Release notes](https://github.com/theskumar/python-dotenv/releases) - [Changelog](https://github.com/theskumar/python-dotenv/blob/main/CHANGELOG.md) - [Commits](https://github.com/theskumar/python-dotenv/compare/v1.1.0...v1.2.2) --- updated-dependencies: - dependency-name: litellm dependency-version: 1.83.7 dependency-type: direct:production dependency-group: pip - dependency-name: python-dotenv dependency-version: 1.2.2 dependency-type: direct:production dependency-group: pip ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-25 00:19:47 +08:00
Bukely_	29b240a689	Add Dependabot config for GitHub Actions updates (#241 ) Weekly scan to open PRs upgrading third-party actions used in CI. pip dependencies remain pinned and are covered by Dependabot security updates separately.	2026-04-23 23:59:26 +08:00
Xinyan Zhou	f2dcffc0b7	Merge pull request #214 from VectifyAI/dependabot/pip/pip-480c85e8a1 Bump litellm from 1.82.0 to 1.83.0 in the pip group across 1 directory	2026-04-08 18:19:25 +08:00
dependabot[bot]	eb57d2a10c	Bump litellm from 1.82.0 to 1.83.0 in the pip group across 1 directory Bumps the pip group with 1 update in the / directory: [litellm](https://github.com/BerriAI/litellm). Updates `litellm` from 1.82.0 to 1.83.0 - [Release notes](https://github.com/BerriAI/litellm/releases) - [Commits](https://github.com/BerriAI/litellm/commits) --- updated-dependencies: - dependency-name: litellm dependency-version: 1.83.0 dependency-type: direct:production dependency-group: pip ... Signed-off-by: dependabot[bot] <support@github.com>	2026-04-03 22:16:59 +00:00
Ray	8f1ed7783b	Update README	2026-03-30 01:34:45 +08:00
Ray	0ba6206ef0	Update developer links	2026-03-29 20:01:58 +08:00
Ray	28542de889	Polish agent system prompt wording	2026-03-29 05:31:02 +08:00
Ray	54542f03e6	Merge pull request #197 from VectifyAI/polish/demo-docstring-and-pathlib Polish demo docstring and migrate to pathlib	2026-03-29 05:03:50 +08:00
Ray	ce9cbc2ed0	Polish demo docstring and migrate to pathlib	2026-03-29 04:56:27 +08:00
Ray	a108c021ae	Disable agent tracing and auto-add litellm/ prefix for retrieve_model * Disable agent tracing and auto-add litellm/ prefix for retrieve_model * Preserve supported retrieve_model prefixes * Remove temporary retrieve_model tests * Limit tracing disablement to demo execution	2026-03-29 00:55:57 +08:00
Ray	d50c293309	Simplify agentic vectorless RAG demo (#191 ) * Simplify and fix agentic RAG demo * Show labeled reasoning output in RAG demo * Comment out reasoning model settings by default	2026-03-28 09:42:46 +08:00
Ray	4002dc94de	Rename demo script and update README wording	2026-03-28 04:56:05 +08:00
Ray	77722838e1	Restructure examples directory and improve document storage (#189 ) * Consolidate tests/ into examples/documents/ * Add line_count and reorder structure keys * Lazy-load documents with _meta.json index * Update demo script and add pre-shipped workspace * Extract shared helpers for JSON reading and meta entry building	2026-03-28 04:28:59 +08:00
Ray	74e549a23a	Merge pull request #184 from VectifyAI/cleanup/simplify-root-directory Simplify root directory	2026-03-27 16:33:24 +08:00
Ray	a7a9985223	Update README	2026-03-27 03:55:07 +08:00
Ray	e5ac754828	Simplify root directory	2026-03-27 03:30:13 +08:00
Ray	d7d5aed668	Update README	2026-03-27 03:21:20 +08:00
Ray	88ef448d1a	Add agentic vectorless RAG example to README highlights	2026-03-27 02:31:40 +08:00
Ray	9798aaae19	Update demo example paper and polish README	2026-03-27 01:22:03 +08:00
Kylin	5d4491f3bf	Add PageIndexClient with agent-based retrieval via OpenAI Agents SDK (#125 ) * Add PageIndexClient with retrieve, streaming support and litellm integration * Add OpenAI agents demo example * Update README with example agent demo section * Support separate retrieve_model configuration for index and retrieve	2026-03-26 23:19:50 +08:00
Kylin	2403be8f27	Integrate LiteLLM for multi-provider LLM support (#168 ) * Integrate litellm for multi-provider LLM support * recover the default config yaml * Use litellm.acompletion for native async support * fix tob * Rename llm_complete/allm_complete to llm_completion/llm_acompletion, remove unused llm_complete_stream * Pin litellm to version 1.82.0 * resolve comments * args from cli is used to overrides config.yaml * Fix get_page_tokens hardcoded model default Pass opt.model to get_page_tokens so tokenization respects the configured model instead of always using gpt-4o-2024-11-20. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * Remove explicit openai dependency from requirements.txt openai is no longer directly imported; it comes in as a transitive dependency of litellm. Pinning it explicitly risks version conflicts. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * Restore openai==1.101.0 pin in requirements.txt litellm==1.82.0 and openai-agents have conflicting openai version requirements, but openai==1.101.0 works at runtime for both. The pin is necessary to prevent litellm from pulling in openai>=2.x which would break openai-agents. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * Remove explicit openai dependency from requirements.txt openai is not directly used; it comes in as a transitive dependency of litellm. No openai-agents in this branch so no pin needed. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix an litellm error log * resolve comments --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-20 18:47:07 +08:00
Bukely_	4b4b20f9c4	Merge pull request #167 from VectifyAI/fix/list-index-shadowing Fix list_index variable shadowing in fix_incorrect_toc	2026-03-16 14:20:32 +08:00
BukeLy	85f17f9955	Fix list_index variable shadowing in fix_incorrect_toc The loop variable `list_index = page_index - start_index` was overwriting the outer `list_index = incorrect_item['list_index']`, causing results to be written back to wrong index positions. Rename the loop variable to `page_list_idx` to avoid shadowing. Closes #66	2026-03-16 14:19:51 +08:00
Bukely_	599d2ce497	Merge pull request #65 from luojiyin1987/fix/extract-toc-infinite-loop fix: prevent infinite loop in extract_toc_content	2026-03-16 13:34:05 +08:00
Bukely_	b487f9d7c7	Merge pull request #63 from luojiyin1987/fix/api-error-return fix: make ChatGPT_API_with_finish_reason return consistent tuple	2026-03-16 13:24:54 +08:00
Bukely_	959452d3cb	Merge pull request #142 from VectifyAI/fix/allow-all-users-dedupe Allow all users to trigger issue dedup	2026-03-04 10:52:53 +08:00

1 2 3 4 5 ...

300 commits