Commit graph

261 commits

Author SHA1 Message Date
Ray
a7a9985223 Update README 2026-03-27 03:55:07 +08:00
Ray
d7d5aed668 Update README 2026-03-27 03:21:20 +08:00
Ray
88ef448d1a Add agentic vectorless RAG example to README highlights 2026-03-27 02:31:40 +08:00
Ray
9798aaae19 Update demo example paper and polish README 2026-03-27 01:22:03 +08:00
Kylin
5d4491f3bf
Add PageIndexClient with agent-based retrieval via OpenAI Agents SDK (#125)
* Add PageIndexClient with retrieve, streaming support and litellm integration
* Add OpenAI agents demo example
* Update README with example agent demo section
* Support separate retrieve_model configuration for index and retrieve
2026-03-26 23:19:50 +08:00
Kylin
2403be8f27
Integrate LiteLLM for multi-provider LLM support (#168)
* Integrate litellm for multi-provider LLM support

* recover the default config yaml

* Use litellm.acompletion for native async support

* fix tob

* Rename llm_complete/allm_complete to llm_completion/llm_acompletion, remove unused llm_complete_stream

* Pin litellm to version 1.82.0

* resolve comments

* args from cli is used to overrides config.yaml

* Fix get_page_tokens hardcoded model default

Pass opt.model to get_page_tokens so tokenization respects the
configured model instead of always using gpt-4o-2024-11-20.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* Remove explicit openai dependency from requirements.txt

openai is no longer directly imported; it comes in as a transitive
dependency of litellm. Pinning it explicitly risks version conflicts.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* Restore openai==1.101.0 pin in requirements.txt

litellm==1.82.0 and openai-agents have conflicting openai version
requirements, but openai==1.101.0 works at runtime for both.
The pin is necessary to prevent litellm from pulling in openai>=2.x
which would break openai-agents.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* Remove explicit openai dependency from requirements.txt

openai is not directly used; it comes in as a transitive dependency
of litellm. No openai-agents in this branch so no pin needed.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* fix an litellm error log

* resolve comments

---------

Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-20 18:47:07 +08:00
Bukely_
4b4b20f9c4
Merge pull request #167 from VectifyAI/fix/list-index-shadowing
Fix list_index variable shadowing in fix_incorrect_toc
2026-03-16 14:20:32 +08:00
BukeLy
85f17f9955 Fix list_index variable shadowing in fix_incorrect_toc
The loop variable `list_index = page_index - start_index` was
overwriting the outer `list_index = incorrect_item['list_index']`,
causing results to be written back to wrong index positions.

Rename the loop variable to `page_list_idx` to avoid shadowing.

Closes #66
2026-03-16 14:19:51 +08:00
Bukely_
599d2ce497
Merge pull request #65 from luojiyin1987/fix/extract-toc-infinite-loop
fix: prevent infinite loop in extract_toc_content
2026-03-16 13:34:05 +08:00
Bukely_
b487f9d7c7
Merge pull request #63 from luojiyin1987/fix/api-error-return
fix: make ChatGPT_API_with_finish_reason return consistent tuple
2026-03-16 13:24:54 +08:00
Bukely_
959452d3cb
Merge pull request #142 from VectifyAI/fix/allow-all-users-dedupe
Allow all users to trigger issue dedup
2026-03-04 10:52:53 +08:00
BukeLy
8d36e1d4b6 Allow all users to trigger issue dedup via claude-code-action
Issues are opened by external users who don't have write permissions.
Add allowed_non_write_users: "*" so claude-code-action runs for all
issue authors, not just repo collaborators.
2026-03-04 10:51:15 +08:00
Bukely_
38d130aeca
Merge pull request #133 from VectifyAI/fix/allow-bot-trigger
Allow github-actions bot to trigger claude-code-action
2026-03-02 18:42:20 +08:00
BukeLy
813eb3546d Allow github-actions bot to trigger claude-code-action
Backfill workflow triggers issue-dedupe via gh workflow run, which
makes the actor github-actions. Add it to allowed_bots so
claude-code-action accepts the trigger.
2026-03-02 18:40:03 +08:00
Bukely_
f7d6f62f61
Merge pull request #132 from VectifyAI/fix/backfill-dedupe-pagination
Fix backfill-dedupe pagination: replace gh issue list with gh api
2026-03-02 18:31:15 +08:00
BukeLy
3d41a730f1 Fix backfill: replace gh issue list with gh api for pagination
gh issue list does not support --page flag. Switch to gh api with
temp file to handle JSON containing control characters in issue bodies.
2026-03-02 18:30:45 +08:00
Bukely_
30d7de64d4
Merge pull request #128 from VectifyAI/copilot/add-github-actions-setup
Add GitHub Actions automation for issue deduplication and auto-close
2026-03-02 18:06:59 +08:00
BukeLy
e388e1b8b3 Fix backfill pagination: use raw count instead of filtered count
The pagination loop was breaking early because it checked the count
of jq-filtered results rather than the raw API response count.
2026-03-02 18:01:34 +08:00
BukeLy
5fa180744d Fix issues from Copilot review: 403 retry, comments pagination, backfill pagination
- Only retry 403 when rate-limit headers indicate throttling, not permission errors
- Add fetchAllComments() with pagination for issues with 100+ comments
- Add pagination loop in backfill workflow to handle repos with 200+ open issues
2026-03-02 17:45:57 +08:00
BukeLy
7df8510bde Simplify scripts: unify bot detection, remove redundant API calls and TOCTOU checks 2026-03-02 17:23:33 +08:00
BukeLy
fd9330c434 Refactor issue dedup system to use claude-code-action with /dedupe command
Replace the copilot-generated inline search logic with a claude-code-action
based architecture inspired by anthropic/claude-code's approach:

- Add .claude/commands/dedupe.md with 5-parallel-search strategy
- Add scripts/comment-on-duplicates.sh with 3-day grace period warning
- Rewrite issue-dedupe.yml to use claude-code-action + /dedupe command
- Rewrite autoclose script to check bot comments, human activity, and thumbsdown
- Rewrite backfill to trigger dedupe workflow per issue with rate limiting
- Add concurrency control, timeout, input validation, and rate limit retry
- Remove gh.sh (unnecessary), backfill-dedupe.js (replaced by workflow trigger)
2026-03-02 17:05:44 +08:00
copilot-swe-agent[bot]
b3cb9531a4 Add GitHub Actions workflows for issue deduplication and auto-close
Co-authored-by: BukeLy <19304666+BukeLy@users.noreply.github.com>
2026-03-02 03:54:18 +00:00
copilot-swe-agent[bot]
f56261cee1 Initial plan 2026-03-02 03:42:51 +00:00
Matias Insaurralde
cf52a678a3
fix: rename tob_extractor_prompt typo to toc_extractor_prompt (#109)
Signed-off-by: Matías Insaurralde <matias@insaurral.de>
2026-02-27 15:16:19 +08:00
Marcos Gómez
89bcb9240a
Merge pull request #118 from mooncos/patch-1
Fix typo in header for the step: Extract JSON results
2026-02-27 15:15:53 +08:00
Mingtian Zhang
a061d53fa5
Update README.md 2026-02-10 14:31:30 +08:00
Ray
884209e8fd
Update README.md 2026-01-25 22:11:25 +08:00
Ray
8a4959d59c
Update README.md 2026-01-25 17:33:36 +08:00
Ray
f6695c11f7 update link 2026-01-24 14:09:21 +08:00
luojiyin
ac9ceaf2ee
fix: prevent infinite loop in extract_toc_content
The while loop exit condition used len(chat_history), but chat_history
was rebuilt every iteration with exactly 2 elements, making the check
len(chat_history) > 5 never true.

Replace with explicit attempt counter and max_attempts limit.
2026-01-19 12:34:39 +08:00
luojiyin
87962b4d42
fix: make ChatGPT_API_with_finish_reason return consistent tuple
Signed-off-by: luojiyin <luojiyin@hotmail.com>
2026-01-19 12:27:35 +08:00
Ray
823c11e642 Ignore notebooks for language stats 2026-01-08 14:05:16 +08:00
Ray
14a05acba1
Update README.md 2025-12-22 08:26:04 +08:00
Ray
db6c58d741
Update README.md 2025-12-21 01:15:33 +08:00
Ray
f869aba81c
Update README.md 2025-12-19 12:09:06 +08:00
Ray
dbdff103df
Update README.md 2025-12-19 11:53:31 +08:00
Ray
b365d6dcd2
Update README.md 2025-12-19 10:05:35 +08:00
Ray
e59f04a6b3
Update README.md 2025-12-19 05:06:46 +08:00
Ray
3161aee250
Update README.md 2025-12-19 03:50:44 +08:00
Ray
2ee7446e52
Update README.md 2025-12-06 03:18:04 +08:00
Ray
a7e97be6a7 Update README.md 2025-11-21 01:30:22 +08:00
Ray
1ae1a84d2a
Update README.md 2025-11-21 01:25:46 +08:00
Mingtian Zhang
4b2e74277b
Add cookbook file 2025-11-20 15:20:13 +08:00
Mingtian Zhang
4d8e019c47
Revise cloud service links and clean up README
Updated cloud service references and removed redundant sections.
2025-11-19 23:55:24 +08:00
Mingtian Zhang
d8b9f6350b
Revise README content and links
Updated links and descriptions in the README for clarity and accuracy.
2025-11-19 23:50:45 +08:00
Mingtian Zhang
8ed58e225e
Update recent releases in README.md 2025-11-19 21:48:49 +08:00
Ray
03bb108aab fix notebook 2025-11-18 00:51:43 +08:00
Ray
9c48a61040 add notebook 2025-11-17 21:43:25 +08:00
Ray
4baa4255a3
Update README.md 2025-11-13 03:08:26 +08:00
Ray
9dfcdae571
Update README.md 2025-11-13 03:01:09 +08:00