From 1cdffc3d8550538195fce30cea4d913a24be9c04 Mon Sep 17 00:00:00 2001 From: Yizhou Chi Date: Sat, 14 Sep 2024 20:49:49 +0800 Subject: [PATCH] =?UTF-8?q?FE=20prompt:=20FE=E9=80=9A=E5=B8=B8=E4=B8=8D?= =?UTF-8?q?=E4=BC=9Amake=20changes=E8=80=8C=E6=98=AF=E5=8A=A0=E6=96=B0?= =?UTF-8?q?=E7=9A=84=E7=89=B9=E5=BE=81=20DI=20prompt:=20=E8=A6=81=E6=B1=82?= =?UTF-8?q?=E8=AE=A9predictions=E6=9C=80=E7=BB=88=E7=BB=93=E6=9E=9C?= =?UTF-8?q?=E4=B8=80=E8=87=B4=EF=BC=8C=E5=B9=B6=E6=8F=90=E4=BE=9B=E4=BE=8B?= =?UTF-8?q?=E5=AD=90?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit --- expo/data/dataset.py | 1 + metagpt/prompts/task_type.py | 2 +- 2 files changed, 2 insertions(+), 1 deletion(-) diff --git a/expo/data/dataset.py b/expo/data/dataset.py index 9748cb8c2..28bd26d2e 100644 --- a/expo/data/dataset.py +++ b/expo/data/dataset.py @@ -40,6 +40,7 @@ DI_INSTRUCTION = """ 1. Save the prediction results of BOTH the dev set and test set in `dev_predictions.csv` and `test_predictions.csv` respectively in the output directory. - Both files should contain a single column named `target` with the predicted values. 2. Make sure the prediction results are in the same format as the target column in the training set. +- For instance, if the target column is categorical, the prediction results should be categorical as well. ## Output Performance Print the train and dev set performance in the last step. diff --git a/metagpt/prompts/task_type.py b/metagpt/prompts/task_type.py index 6b230fc9e..599d437c5 100644 --- a/metagpt/prompts/task_type.py +++ b/metagpt/prompts/task_type.py @@ -26,7 +26,7 @@ The current task is about feature engineering. when performing it, please adhere - Avoid creating redundant or excessively numerous features in one step. - Exclude ID columns from feature generation and remove them. - Each feature engineering operation performed on the train set must also applies to the dev/test separately at the same time. -- **ATTENTION** Do NOT use the label column to create features or make any changes to the label column, except for cat encoding. +- **ATTENTION** Do NOT use the label column to create features, except for cat encoding. - Use the data from previous task result if exist, do not mock or reload data yourself. - Always copy the DataFrame before processing it and use the copy to process. """