Merge pull request #1044 from Yevanchen/swebench_di

Add sklearn68 dataset
This commit is contained in:
Sirui Hong 2024-03-19 19:26:52 +08:00 committed by GitHub
commit f964e31fbc
No known key found for this signature in database
GPG key ID: B5690EEEBB952194
2 changed files with 14 additions and 0 deletions

View file

@ -3,6 +3,12 @@ # Dataset Description
The index of sub_swebench is a subset of swebench, with two columns in total, each column containing 50 id_instance.
The id_instance is a balanced subset of pass and fail samples for CognitionAI on swebench.
The index of scikit-learn-68 is another subset of CognitionAI in swebench (all tasks of the scikit-learn type), with a total of two columns
- pass12
- fail56
Sampling list:https://github.com/CognitionAI/devin-swebench-results/tree/main/
Original datasethttps://huggingface.co/datasets/princeton-nlp/SWE-bench/
@ -69,3 +75,11 @@ ### After balanced sampling:
- Xarray (pydata): 2
- Matplotlib: 2
- Sphinx: 1
## scikit-learn-68 dataset Description
instance_id_pass:12
instance_id_fail:56

Binary file not shown.
Can't render this file because it contains an unexpected character in line 2 and column 280.