Merge pull request #1038 from Yevanchen/swebench_di

Swebench di
This commit is contained in:
Sirui Hong 2024-03-19 15:52:08 +08:00 committed by GitHub
commit f7ca991979
No known key found for this signature in database
GPG key ID: B5690EEEBB952194
2 changed files with 71 additions and 0 deletions

View file

@ -0,0 +1,71 @@
# Dataset Description
The index of sub_swebench is a subset of swebench, with two columns in total, each column containing 50 id_instance.
The id_instance is a balanced subset of pass and fail samples for CognitionAI on swebench.
Sampling list:https://github.com/CognitionAI/devin-swebench-results/tree/main/
Original datasethttps://huggingface.co/datasets/princeton-nlp/SWE-bench/
## fail dataset Description
There are a total of 491 txt files listed.
In the original dataset, the distribution of pass case categories is:
- astropy: 24
- django: 160
- matplotlib: 42
- mwaskom: 4
- pallets: 3
- psf: 9
- pydata: 29
- pylint-dev: 13
- pytest-dev: 20
- scikit-learn: 56
- sphinx-doc: 46
- sympy: 85
### After balanced sampling:
There are a total of 50 txt files listed.
- Django: 16
- Scikit-Learn: 6
- Sympy: 10
- sphinx-doc:5
- matplotlib: 4
- pydata: 3
- astropy: 2
- pytest-dev: 2
- psf: 1
- pylint-dev: 1
## pass dataset Description
There are a total of 79 txt files listed.
In the original dataset, the distribution of pass case categories is:
- astropy: 4
- django: 38
- matplotlib: 3
- pydata: 3
- pytest-dev: 6
- scikit-learn: 12
- sphinx-doc: 2
- sympy: 11
### After balanced sampling:
There are a total of 50 txt files listed.
- Django: 23
- Scikit-Learn: 8
- Sympy: 7
- Pytest: 4
- Astropy: 3
- Xarray (pydata): 2
- Matplotlib: 2
- Sphinx: 1

Binary file not shown.
Can't render this file because it has a wrong number of fields in line 2.