From 1cdffc3d8550538195fce30cea4d913a24be9c04 Mon Sep 17 00:00:00 2001
From: Yizhou Chi <chiyizhou@fuzhi.ai>
Date: Sat, 14 Sep 2024 20:49:49 +0800
Subject: [PATCH] =?UTF-8?q?FE=20prompt:=20FE=E9=80=9A=E5=B8=B8=E4=B8=8D?=
 =?UTF-8?q?=E4=BC=9Amake=20changes=E8=80=8C=E6=98=AF=E5=8A=A0=E6=96=B0?=
 =?UTF-8?q?=E7=9A=84=E7=89=B9=E5=BE=81=20DI=20prompt:=20=E8=A6=81=E6=B1=82?=
 =?UTF-8?q?=E8=AE=A9predictions=E6=9C=80=E7=BB=88=E7=BB=93=E6=9E=9C?=
 =?UTF-8?q?=E4=B8=80=E8=87=B4=EF=BC=8C=E5=B9=B6=E6=8F=90=E4=BE=9B=E4=BE=8B?=
 =?UTF-8?q?=E5=AD=90?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

---
 expo/data/dataset.py         | 1 +
 metagpt/prompts/task_type.py | 2 +-
 2 files changed, 2 insertions(+), 1 deletion(-)

diff --git a/expo/data/dataset.py b/expo/data/dataset.py
index 9748cb8c2..28bd26d2e 100644
--- a/expo/data/dataset.py
+++ b/expo/data/dataset.py
@@ -40,6 +40,7 @@ DI_INSTRUCTION = """
 1. Save the prediction results of BOTH the dev set and test set in `dev_predictions.csv` and `test_predictions.csv` respectively in the output directory. 
 - Both files should contain a single column named `target` with the predicted values.
 2. Make sure the prediction results are in the same format as the target column in the training set. 
+- For instance, if the target column is categorical, the prediction results should be categorical as well.
 
 ## Output Performance
 Print the train and dev set performance in the last step.
diff --git a/metagpt/prompts/task_type.py b/metagpt/prompts/task_type.py
index 6b230fc9e..599d437c5 100644
--- a/metagpt/prompts/task_type.py
+++ b/metagpt/prompts/task_type.py
@@ -26,7 +26,7 @@ The current task is about feature engineering. when performing it, please adhere
 - Avoid creating redundant or excessively numerous features in one step.
 - Exclude ID columns from feature generation and remove them.
 - Each feature engineering operation performed on the train set must also applies to the dev/test separately at the same time.
-- **ATTENTION** Do NOT use the label column to create features or make any changes to the label column, except for cat encoding.
+- **ATTENTION** Do NOT use the label column to create features, except for cat encoding.
 - Use the data from previous task result if exist, do not mock or reload data yourself.
 - Always copy the DataFrame before processing it and use the copy to process.
 """