2. add support to datasets that have both train and test 3. create data folder 4. fix new instruction bug