PageIndex

mirror of https://github.com/VectifyAI/PageIndex.git synced 2026-05-12 16:22:37 +02:00

Author	SHA1	Message	Date
Xinyan Zhou	595895cf28	feat:compatible with Pageindex SDK (#238 ) * feat:compatible with Pageindex SDK * corner cases fixed * fix: mock behavior of old SDK * fix: close streaming response and warn on empty api_key - LegacyCloudAPI: close response in `finally` for both _stream_chat_response variants so abandoned iterators no longer leak the TCP connection. - PageIndexClient: emit a warning instead of silently falling back to local when api_key is the empty string, surfacing typical env-var-unset misconfig. - FakeResponse: add close()/closed to match the real requests.Response API. - Add unit coverage for stream close (both paths) and the empty-api_key warning. - Add scripts/e2e_legacy_sdk.py to smoke-test the legacy SDK contract end-to-end against api.pageindex.ai. * chore: mark legacy SDK methods with @deprecated and docstring pointers - Decorate the 12 PageIndexClient cloud-SDK compat methods with @typing_extensions.deprecated(..., category=PendingDeprecationWarning): - IDE/type-checkers render them with a strikethrough hint - runtime warnings stay silent by default (no spam for existing callers), surfaceable via `python -W default::PendingDeprecationWarning` - Add a one-line docstring on each pointing to the Collection-based equivalent. - Promote typing-extensions to a direct dependency (was transitive via litellm). --------- Co-authored-by: XinyanZhou <xinyanzhou@XinyanZhoudeMacBook-Pro.local> Co-authored-by: saccharin98 <xinyanzhou938@gmail.com> Co-authored-by: mountain <kose2livs@gmail.com>	2026-05-11 21:06:23 +08:00
Ray	6d29886892	chore: bump version to 0.3.0.dev1	2026-04-11 01:18:22 +08:00
Ray	edb203102a	fix: poll status=="completed" in cloud add_document (#226 ) The cloud backend previously polled tree_resp["retrieval_ready"] as the ready signal. Empirically this flag is not a reliable indicator — docs can reach status=="completed" without retrieval_ready flipping, causing col.add() to wait until the 10 min timeout before giving up on otherwise-successful uploads. The cloud API's canonical ready signal is status=="completed"; switch the poll to check that instead.	2026-04-11 01:16:48 +08:00
Ray	f5de9c9dbb	Add dist/ to .gitignore	2026-04-08 20:57:22 +08:00
Ray	27e671eefd	Update pyproject.toml: switch to poetry and bump to 0.3.0.dev0	2026-04-08 20:45:49 +08:00
Kylin	c7fe93bb56	feat: add PageIndex SDK with local/cloud dual-mode support (#207 )	2026-04-08 20:21:58 +08:00
Xinyan Zhou	f2dcffc0b7	Merge pull request #214 from VectifyAI/dependabot/pip/pip-480c85e8a1 Bump litellm from 1.82.0 to 1.83.0 in the pip group across 1 directory	2026-04-08 18:19:25 +08:00
dependabot[bot]	eb57d2a10c	Bump litellm from 1.82.0 to 1.83.0 in the pip group across 1 directory Bumps the pip group with 1 update in the / directory: [litellm](https://github.com/BerriAI/litellm). Updates `litellm` from 1.82.0 to 1.83.0 - [Release notes](https://github.com/BerriAI/litellm/releases) - [Commits](https://github.com/BerriAI/litellm/commits) --- updated-dependencies: - dependency-name: litellm dependency-version: 1.83.0 dependency-type: direct:production dependency-group: pip ... Signed-off-by: dependabot[bot] <support@github.com>	2026-04-03 22:16:59 +00:00
Ray	8f1ed7783b	Update README	2026-03-30 01:34:45 +08:00
Ray	0ba6206ef0	Update developer links	2026-03-29 20:01:58 +08:00
Ray	28542de889	Polish agent system prompt wording	2026-03-29 05:31:02 +08:00
Ray	54542f03e6	Merge pull request #197 from VectifyAI/polish/demo-docstring-and-pathlib Polish demo docstring and migrate to pathlib	2026-03-29 05:03:50 +08:00
Ray	ce9cbc2ed0	Polish demo docstring and migrate to pathlib	2026-03-29 04:56:27 +08:00
Ray	a108c021ae	Disable agent tracing and auto-add litellm/ prefix for retrieve_model * Disable agent tracing and auto-add litellm/ prefix for retrieve_model * Preserve supported retrieve_model prefixes * Remove temporary retrieve_model tests * Limit tracing disablement to demo execution	2026-03-29 00:55:57 +08:00
Ray	d50c293309	Simplify agentic vectorless RAG demo (#191 ) * Simplify and fix agentic RAG demo * Show labeled reasoning output in RAG demo * Comment out reasoning model settings by default	2026-03-28 09:42:46 +08:00
Ray	4002dc94de	Rename demo script and update README wording	2026-03-28 04:56:05 +08:00
Ray	77722838e1	Restructure examples directory and improve document storage (#189 ) * Consolidate tests/ into examples/documents/ * Add line_count and reorder structure keys * Lazy-load documents with _meta.json index * Update demo script and add pre-shipped workspace * Extract shared helpers for JSON reading and meta entry building	2026-03-28 04:28:59 +08:00
Ray	74e549a23a	Merge pull request #184 from VectifyAI/cleanup/simplify-root-directory Simplify root directory	2026-03-27 16:33:24 +08:00
Ray	a7a9985223	Update README	2026-03-27 03:55:07 +08:00
Ray	e5ac754828	Simplify root directory	2026-03-27 03:30:13 +08:00
Ray	d7d5aed668	Update README	2026-03-27 03:21:20 +08:00
Ray	88ef448d1a	Add agentic vectorless RAG example to README highlights	2026-03-27 02:31:40 +08:00
Ray	9798aaae19	Update demo example paper and polish README	2026-03-27 01:22:03 +08:00
Kylin	5d4491f3bf	Add PageIndexClient with agent-based retrieval via OpenAI Agents SDK (#125 ) * Add PageIndexClient with retrieve, streaming support and litellm integration * Add OpenAI agents demo example * Update README with example agent demo section * Support separate retrieve_model configuration for index and retrieve	2026-03-26 23:19:50 +08:00
Kylin	2403be8f27	Integrate LiteLLM for multi-provider LLM support (#168 ) * Integrate litellm for multi-provider LLM support * recover the default config yaml * Use litellm.acompletion for native async support * fix tob * Rename llm_complete/allm_complete to llm_completion/llm_acompletion, remove unused llm_complete_stream * Pin litellm to version 1.82.0 * resolve comments * args from cli is used to overrides config.yaml * Fix get_page_tokens hardcoded model default Pass opt.model to get_page_tokens so tokenization respects the configured model instead of always using gpt-4o-2024-11-20. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * Remove explicit openai dependency from requirements.txt openai is no longer directly imported; it comes in as a transitive dependency of litellm. Pinning it explicitly risks version conflicts. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * Restore openai==1.101.0 pin in requirements.txt litellm==1.82.0 and openai-agents have conflicting openai version requirements, but openai==1.101.0 works at runtime for both. The pin is necessary to prevent litellm from pulling in openai>=2.x which would break openai-agents. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * Remove explicit openai dependency from requirements.txt openai is not directly used; it comes in as a transitive dependency of litellm. No openai-agents in this branch so no pin needed. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix an litellm error log * resolve comments --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-20 18:47:07 +08:00
Bukely_	4b4b20f9c4	Merge pull request #167 from VectifyAI/fix/list-index-shadowing Fix list_index variable shadowing in fix_incorrect_toc	2026-03-16 14:20:32 +08:00
BukeLy	85f17f9955	Fix list_index variable shadowing in fix_incorrect_toc The loop variable `list_index = page_index - start_index` was overwriting the outer `list_index = incorrect_item['list_index']`, causing results to be written back to wrong index positions. Rename the loop variable to `page_list_idx` to avoid shadowing. Closes #66	2026-03-16 14:19:51 +08:00
Bukely_	599d2ce497	Merge pull request #65 from luojiyin1987/fix/extract-toc-infinite-loop fix: prevent infinite loop in extract_toc_content	2026-03-16 13:34:05 +08:00
Bukely_	b487f9d7c7	Merge pull request #63 from luojiyin1987/fix/api-error-return fix: make ChatGPT_API_with_finish_reason return consistent tuple	2026-03-16 13:24:54 +08:00
Bukely_	959452d3cb	Merge pull request #142 from VectifyAI/fix/allow-all-users-dedupe Allow all users to trigger issue dedup	2026-03-04 10:52:53 +08:00
BukeLy	8d36e1d4b6	Allow all users to trigger issue dedup via claude-code-action Issues are opened by external users who don't have write permissions. Add allowed_non_write_users: "*" so claude-code-action runs for all issue authors, not just repo collaborators.	2026-03-04 10:51:15 +08:00
Bukely_	38d130aeca	Merge pull request #133 from VectifyAI/fix/allow-bot-trigger Allow github-actions bot to trigger claude-code-action	2026-03-02 18:42:20 +08:00
BukeLy	813eb3546d	Allow github-actions bot to trigger claude-code-action Backfill workflow triggers issue-dedupe via gh workflow run, which makes the actor github-actions. Add it to allowed_bots so claude-code-action accepts the trigger.	2026-03-02 18:40:03 +08:00
Bukely_	f7d6f62f61	Merge pull request #132 from VectifyAI/fix/backfill-dedupe-pagination Fix backfill-dedupe pagination: replace gh issue list with gh api	2026-03-02 18:31:15 +08:00
BukeLy	3d41a730f1	Fix backfill: replace gh issue list with gh api for pagination gh issue list does not support --page flag. Switch to gh api with temp file to handle JSON containing control characters in issue bodies.	2026-03-02 18:30:45 +08:00
Bukely_	30d7de64d4	Merge pull request #128 from VectifyAI/copilot/add-github-actions-setup Add GitHub Actions automation for issue deduplication and auto-close	2026-03-02 18:06:59 +08:00
BukeLy	e388e1b8b3	Fix backfill pagination: use raw count instead of filtered count The pagination loop was breaking early because it checked the count of jq-filtered results rather than the raw API response count.	2026-03-02 18:01:34 +08:00
BukeLy	5fa180744d	Fix issues from Copilot review: 403 retry, comments pagination, backfill pagination - Only retry 403 when rate-limit headers indicate throttling, not permission errors - Add fetchAllComments() with pagination for issues with 100+ comments - Add pagination loop in backfill workflow to handle repos with 200+ open issues	2026-03-02 17:45:57 +08:00
BukeLy	7df8510bde	Simplify scripts: unify bot detection, remove redundant API calls and TOCTOU checks	2026-03-02 17:23:33 +08:00
BukeLy	fd9330c434	Refactor issue dedup system to use claude-code-action with /dedupe command Replace the copilot-generated inline search logic with a claude-code-action based architecture inspired by anthropic/claude-code's approach: - Add .claude/commands/dedupe.md with 5-parallel-search strategy - Add scripts/comment-on-duplicates.sh with 3-day grace period warning - Rewrite issue-dedupe.yml to use claude-code-action + /dedupe command - Rewrite autoclose script to check bot comments, human activity, and thumbsdown - Rewrite backfill to trigger dedupe workflow per issue with rate limiting - Add concurrency control, timeout, input validation, and rate limit retry - Remove gh.sh (unnecessary), backfill-dedupe.js (replaced by workflow trigger)	2026-03-02 17:05:44 +08:00
copilot-swe-agent[bot]	b3cb9531a4	Add GitHub Actions workflows for issue deduplication and auto-close Co-authored-by: BukeLy <19304666+BukeLy@users.noreply.github.com>	2026-03-02 03:54:18 +00:00
copilot-swe-agent[bot]	f56261cee1	Initial plan	2026-03-02 03:42:51 +00:00
Matias Insaurralde	cf52a678a3	fix: rename tob_extractor_prompt typo to toc_extractor_prompt (#109 ) Signed-off-by: Matías Insaurralde <matias@insaurral.de>	2026-02-27 15:16:19 +08:00
Marcos Gómez	89bcb9240a	Merge pull request #118 from mooncos/patch-1 Fix typo in header for the step: Extract JSON results	2026-02-27 15:15:53 +08:00
Mingtian Zhang	a061d53fa5	Update README.md	2026-02-10 14:31:30 +08:00
Ray	884209e8fd	Update README.md	2026-01-25 22:11:25 +08:00
Ray	8a4959d59c	Update README.md	2026-01-25 17:33:36 +08:00
Ray	f6695c11f7	update link	2026-01-24 14:09:21 +08:00
luojiyin	ac9ceaf2ee	fix: prevent infinite loop in extract_toc_content The while loop exit condition used len(chat_history), but chat_history was rebuilt every iteration with exactly 2 elements, making the check len(chat_history) > 5 never true. Replace with explicit attempt counter and max_attempts limit.	2026-01-19 12:34:39 +08:00
luojiyin	87962b4d42	fix: make ChatGPT_API_with_finish_reason return consistent tuple Signed-off-by: luojiyin <luojiyin@hotmail.com>	2026-01-19 12:27:35 +08:00

1 2 3 4 5 ...

280 commits