MetaGPT/expo
Yizhou Chi aea524b4ea 1. update readme
2. fix duration
2024-09-04 14:49:18 +08:00
..
evaluation update prompt (specify whether each set has target label) 2024-09-03 16:31:03 +08:00
experimenter 1. update readme 2024-09-04 14:49:18 +08:00
insights Rename insight generate to instruction generator 2024-09-02 09:59:47 +08:00
results add expo 2024-08-30 16:26:05 +08:00
data.yaml update readme 2024-09-03 13:40:23 +08:00
dataset.py update prompt (specify whether each set has target label) 2024-09-03 16:31:03 +08:00
datasets.yaml update readme 2024-09-03 13:40:23 +08:00
MCTS.py ensure experimenter not evaluating csv from other experiments 2024-09-04 10:41:10 +08:00
README.md 1. update readme 2024-09-04 14:49:18 +08:00
requirements.txt 1. 暂时在expo文件夹里单独放一个requirements.txt 2024-09-02 20:23:45 +08:00
research_assistant.py remove deprecated comments 2024-09-02 16:51:33 +08:00
run_exp_augmentation.py 1. Rewrite logger message 2024-09-02 10:21:30 +08:00
run_experiment.py add custom experimenter 2024-09-02 20:00:46 +08:00
run_mcts.py fix bug - make rollout more consistent 2024-08-30 20:35:17 +08:00
utils.py add experimenter 2024-08-30 19:55:40 +08:00

Expo

1. Data Preparation

2. Configs

Data Config

datasets.yaml 提供数据集对应的指标和基础提示词

data.yaml 继承了datasets.yaml以及一些路径信息,需要将datasets_dir指到数据集合集的根目录下

LLM Config

llm:
  api_type: 'openai'
  model: deepseek-coder
  base_url: "https://oneapi.deepwisdom.ai/v1"
  api_key: sk-xxx
  temperature: 0.5

Budget

实验轮次 k = 10, 20

提示词使用

通过执行dataset.py中的generate_task_requirement函数获取提示词

3. Evaluation

运行各个框架运行后框架需要提供Dev和Test的dev_predictions.csvtest_predictions.csv column name为target

  • 使用CustomExperimenter
experimenter = CustomExperimenter(task="titanic")
score_dict = experimenter.evaluate_pred_files(dev_pred_path, test_pred_path)

4. Baselines

DS Agent

提供github链接并说明使用的命令以及参数设置

AIDE

提供github链接并说明使用的命令以及参数设置

Autogluon

Setup

pip install -U pip
pip install -U setuptools wheel

CPU version of pytorch has smaller footprint - see installation instructions in
pytorch documentation - https://pytorch.org/get-started/locally/
pip install torch==2.3.1 torchvision==0.18.1 --index-url https://download.pytorch.org/whl/cpu

pip install autogluon

提供github链接并说明使用的命令以及参数设置

Base DI

For setup, check 5.

  • python run_experiment.py --exp_mode base --task titanic

DI RandomSearch

For setup, check 5.

  • Single insight python run_experiment.py --exp_mode aug --task titanic --aug_mode single

  • Set insight python run_experiment.py --exp_mode aug --task titanic --aug_mode set

5. DI MCTS

Run DI MCTS

Setup

In the root directory,

pip install -e .

cd expo

pip install -r requirements.txt

Run

  • python run_experiment.py --exp_mode mcts --task titanic --rollout 5

If the dataset has reg metric, remember to use --low_is_better:

  • python run_experiment.py --exp_mode mcts --task househouse_prices --rollout 5 --low_is_better