Commit graph

4373 commits

Author SHA1 Message Date
didi
bdf865eb0d Merge branch 'main' of https://github.com/didiforgithub/MetaGPT-MathAI 2024-09-10 19:06:17 +08:00
didi
7f45ef6231 Merge branch 'main' of https://github.com/didiforgithub/MetaGPT-MathAI 2024-09-10 19:04:05 +08:00
Zhaoyang Yu
445a2e6048 Update QA 2024-09-10 18:59:13 +08:00
didi
ab112462a5 添加了cost计算 2024-09-10 18:51:18 +08:00
Zhaoyang Yu
257b994409 Update drop.py 2024-09-10 18:45:54 +08:00
Zhaoyang Yu
68e87da378 Update Hotpotqa 2024-09-10 18:27:20 +08:00
Zhaoyang Yu
4ce18d7f48 Update baselines 2024-09-10 16:51:26 +08:00
didi
0b0a49d772 更新 hotpotqa Baseline 2024-09-10 12:48:28 +08:00
didi
c7c34cda7d Update human Eval 2024-09-10 10:58:39 +08:00
didi
62ffa730e0 update humaneval baseline & hotpotqa baseline 2024-09-10 10:57:21 +08:00
didi
7ffe68b499 重构了Evaluator 2024-09-09 18:19:22 +08:00
didi
4e0a896bdc 提交baseline例子;修改context-fill 格式识别方式 2024-09-09 17:17:15 +08:00
didi
ca560a844f 合入Eval与Optimize
当前问题
1. eval 存在部分参数差异(path,csv测试)
2. optimize 尝试新流程(优化后的optimize曲线);optimize 模版书写
3. optimize 在各个数据集上跑通
4. 创建baseline folder
5. 创建experiment data收集方法
6. 从ags中移出
2024-09-08 23:04:01 +08:00
didi
c3903412b4 Update Operator Optimize Method. 2024-09-02 16:47:03 +08:00
didi
d97f90f9c7 Update 2024-08-26 08:40:10 +08:00
didi
7c2501e08b Update 2024-08-26 08:30:24 +08:00
didi
6a01a679ce test for multi llm 2024-08-25 22:17:27 +08:00
didi
1593e98c45 Update Multi LLM Config & Basic Evaluator 2024-08-25 22:16:39 +08:00
didi
a3ff25430e Update ConText Fill 2024-08-23 20:43:29 +08:00
didi
02c7c4ea47 Update 混乱版本 2024-08-23 14:21:04 +08:00
didi
2937d9a13a Update 粗糙版本 优化 2024-08-23 13:57:35 +08:00
Zhaoyang Yu
c9989c069e Update llm.py 2024-08-15 20:21:07 +08:00
didi
008c5f0f1f Update 2024-08-06 10:53:12 +08:00
didi
47470fb74c Update 2024-08-01 17:00:04 +08:00
didi
bdfa6eb512 Update 2024-08-01 14:56:42 +08:00
didi
3fc3d217a8 Update 2024-07-29 22:00:07 +08:00
didi
eac4b6c3e6 Update GitNore 2024-07-27 01:57:06 +08:00
didi
772d2aea56 Update 2024-07-25 10:47:17 +08:00
didi
ca1c8f8c5c Update 2024-07-22 15:27:07 +08:00
didi
89b0c4ce30 Update 2024-07-17 23:08:41 +08:00
didi
e0955c5bf9 Update Sota baseline 2024-07-16 10:15:35 +08:00
didi
8a241054c7 Update 2024-07-14 09:12:33 +08:00
didi
7fa68d5649 Update operator.py 2024-07-11 19:59:21 +08:00
didi
eb97b54a20 Update 2024-07-11 16:18:34 +08:00
didi
4af2315c77 Update humaneval 2024-07-10 16:23:38 +08:00
didi
86033a1037 Update 2024-07-09 14:51:27 +08:00
didi
aeac3fe3f9 Update AGS 2024-07-04 14:31:13 +08:00
didi
4d376649cc update ensemble example 2024-07-01 16:56:47 +08:00
didi
f1ce1330d7 Update he 2024-07-01 14:16:35 +08:00
didi
fd432fa132 Update 2024-06-29 16:42:24 +08:00
didi
8f1cf58af2 Update AGS 2024-06-29 16:33:07 +08:00
better629
9f8f0a27fd
Merge pull request #1324 from usamimeri/for_config
optimize error message when validating api_key failed
2024-06-22 14:59:01 +08:00
usamimeri_renko
76b009513a
Update llm_config.py 2024-06-22 14:57:47 +08:00
better629
116d3e96f5
Merge pull request #1360 from usamimeri/for_qianfan
update qianfan and dashscope version
2024-06-22 14:47:05 +08:00
better629
5fbc5b7688
Merge pull request #1284 from usamimeri/update_qwen
update qwen token count
2024-06-22 14:39:54 +08:00
usamimeri_renko
cb89295db8
Update requirements.txt 2024-06-20 17:05:23 +08:00
usamimeri_renko
4797c91168
Update latest qwen price and max token 2024-06-20 16:33:54 +08:00
usamimeri_renko
b9c6c91cfe update qianfan version 2024-06-20 10:28:26 +08:00
Alexander Wu
38cea1daf2
Merge pull request #1338 from Stitch-z/main
fix: 修复父类role代码改动而影响到子类role-教程助手功能的问题
2024-06-12 15:16:40 +08:00
Stitch-z
c9e45c6e88
Merge pull request #17 from Stitch-z/feature-tutorial-assistant
fix: 修复父类role代码改动而影响到子类role-教程助手功能的问题
2024-06-12 15:08:17 +08:00