didi
|
56d0af1e9e
|
pre-commit modify
|
2024-10-22 12:37:08 +08:00 |
|
didi
|
462b7d9fd9
|
Update README.md
|
2024-10-22 11:45:40 +08:00 |
|
didi
|
2ccee333db
|
Update Readme
|
2024-10-22 11:44:14 +08:00 |
|
didi
|
5775b20024
|
Update Readme
|
2024-10-22 11:41:59 +08:00 |
|
didi
|
e575b629a1
|
Resolve comment and modify readme
|
2024-10-22 11:41:45 +08:00 |
|
didi
|
5aa62b76ce
|
Update
|
2024-10-22 11:08:08 +08:00 |
|
didi
|
fcc5e19160
|
mv aflow from example to ext
|
2024-10-22 10:54:06 +08:00 |
|
didi
|
0b69ffe198
|
Update download data.py and rm json files
|
2024-10-22 10:41:14 +08:00 |
|
didi
|
66b523953d
|
Update llm.py & handle exception
|
2024-10-22 10:28:23 +08:00 |
|
didi
|
828d187681
|
change context_fill to xml_fill
|
2024-10-22 09:34:02 +08:00 |
|
didi
|
35acb98d7a
|
Update optimizer.py
|
2024-10-21 23:26:26 +08:00 |
|
didi
|
d8c7174fc0
|
Update HotpotQA's init round
|
2024-10-21 23:13:29 +08:00 |
|
didi
|
23eec00b00
|
Update operator.py
|
2024-10-21 23:11:48 +08:00 |
|
didi
|
2d1d7ca219
|
Update Operator & Benchmark
|
2024-10-21 23:08:51 +08:00 |
|
Zhaoyang Yu
|
fe3fca514a
|
Create download_data.py
Need to run the script in the aflow/data path
|
2024-10-21 16:34:48 +08:00 |
|
didi
|
c194415b35
|
Update mbpp & math's eval
|
2024-10-21 12:50:17 +08:00 |
|
didi
|
efa00f8bbb
|
Update
|
2024-10-21 12:46:17 +08:00 |
|
didi
|
ade10684b7
|
Update Operator's code
|
2024-10-21 11:27:50 +08:00 |
|
Zhaoyang Yu
|
6ebf3c47c2
|
Update drop.py
Change comments into English, fix the in/out params' type.
fix too many values to unpack in line 140.
Unify the quotes.
Remove "if" at line 148
|
2024-10-19 11:40:27 +08:00 |
|
didi
|
17f3cd4955
|
Refactor Evaluator
|
2024-10-19 07:41:59 +08:00 |
|
didi
|
5d6fa7a68f
|
Update readme.md
|
2024-10-19 07:35:44 +08:00 |
|
didi
|
ebcacdd648
|
Update print error
|
2024-10-18 13:57:01 +08:00 |
|
didi
|
2b788b21f6
|
Update Annotation to English, And Update Operator.json
|
2024-10-18 13:50:51 +08:00 |
|
better629
|
d99054ab5e
|
Merge branch 'main' into main
|
2024-10-17 16:25:31 +08:00 |
|
didi
|
6aedc4a068
|
Update AFlow
|
2024-10-17 15:47:09 +08:00 |
|
Zhaoyang Yu
|
cea3473002
|
Update evaluator.py
change the data path of DROP
|
2024-10-16 20:19:19 +08:00 |
|
Zhaoyang Yu
|
859ee3d2e3
|
fix test()
|
2024-10-16 20:15:15 +08:00 |
|
Zhaoyang Yu
|
390b65fda3
|
Update HotpotQA
|
2024-10-16 19:55:10 +08:00 |
|
didi
|
eea94865ad
|
Update Eval
|
2024-10-16 12:06:34 +08:00 |
|
didi
|
bb229f2319
|
Update AFlolw
|
2024-10-16 11:49:18 +08:00 |
|
didi
|
eae351466f
|
Update AFlow
|
2024-10-16 11:44:01 +08:00 |
|
femto
|
a7efa27ce0
|
rm
|
2024-10-11 16:48:08 +08:00 |
|
didi
|
040a7324eb
|
Update test_curve.py
|
2024-09-26 21:56:54 +08:00 |
|
didi
|
e8f6186a56
|
update
|
2024-09-26 20:06:57 +08:00 |
|
didi
|
f14830b16a
|
Update optimizer.py
|
2024-09-25 16:47:12 +08:00 |
|
didi
|
8dfe2de34c
|
Update
|
2024-09-25 16:46:20 +08:00 |
|
didi
|
6a84a9d49b
|
Update HumanEval Eval
|
2024-09-24 19:28:03 +08:00 |
|
didi
|
c7f44e956d
|
Update
|
2024-09-22 21:45:53 +08:00 |
|
didi
|
99a9f7b6e9
|
update humaneval data path and add baseline data
|
2024-09-22 19:35:06 +08:00 |
|
didi
|
e3bcedc298
|
Update Benchmark's data
|
2024-09-22 15:54:32 +08:00 |
|
didi
|
22e8f9d7fc
|
Update baseline and benchmark; update evaluator
|
2024-09-22 15:46:50 +08:00 |
|
didi
|
63f3f884c9
|
Update for fengwei
|
2024-09-16 18:13:30 +08:00 |
|
didi
|
53890a5f86
|
更新了HotpotQA BenchMark 代码与对应的Self Consistency 实现
|
2024-09-13 12:56:18 +08:00 |
|
didi
|
0704f341de
|
更新了eval索引的入口
|
2024-09-11 17:53:52 +08:00 |
|
didi
|
b805da0bbe
|
更新了eval BUG,同时更新了新的baseline
|
2024-09-11 17:00:14 +08:00 |
|
didi
|
b9a2d94da2
|
更新了xml-compile方法,更新了剩余Baseline
|
2024-09-11 15:21:33 +08:00 |
|
Zhaoyang Yu
|
f691c5f439
|
Update QA
|
2024-09-10 21:51:38 +08:00 |
|
didi
|
bdf865eb0d
|
Merge branch 'main' of https://github.com/didiforgithub/MetaGPT-MathAI
|
2024-09-10 19:06:17 +08:00 |
|
didi
|
7f45ef6231
|
Merge branch 'main' of https://github.com/didiforgithub/MetaGPT-MathAI
|
2024-09-10 19:04:05 +08:00 |
|
Zhaoyang Yu
|
445a2e6048
|
Update QA
|
2024-09-10 18:59:13 +08:00 |
|