mirror of
https://github.com/FoundationAgents/MetaGPT.git
synced 2026-06-17 15:35:21 +02:00
commit
f7ca991979
2 changed files with 71 additions and 0 deletions
71
sub_swebench_dataset/readme.md
Normal file
71
sub_swebench_dataset/readme.md
Normal file
|
|
@ -0,0 +1,71 @@
|
|||
# Dataset Description
|
||||
|
||||
The index of sub_swebench is a subset of swebench, with two columns in total, each column containing 50 id_instance.
|
||||
|
||||
The id_instance is a balanced subset of pass and fail samples for CognitionAI on swebench.
|
||||
Sampling list:https://github.com/CognitionAI/devin-swebench-results/tree/main/
|
||||
Original dataset:https://huggingface.co/datasets/princeton-nlp/SWE-bench/
|
||||
|
||||
## fail dataset Description:
|
||||
|
||||
There are a total of 491 txt files listed.
|
||||
In the original dataset, the distribution of pass case categories is:
|
||||
|
||||
- astropy: 24
|
||||
- django: 160
|
||||
- matplotlib: 42
|
||||
- mwaskom: 4
|
||||
- pallets: 3
|
||||
- psf: 9
|
||||
- pydata: 29
|
||||
- pylint-dev: 13
|
||||
- pytest-dev: 20
|
||||
- scikit-learn: 56
|
||||
- sphinx-doc: 46
|
||||
- sympy: 85
|
||||
|
||||
### After balanced sampling:
|
||||
|
||||
There are a total of 50 txt files listed.
|
||||
|
||||
- Django: 16
|
||||
- Scikit-Learn: 6
|
||||
- Sympy: 10
|
||||
- sphinx-doc:5
|
||||
- matplotlib: 4
|
||||
- pydata: 3
|
||||
- astropy: 2
|
||||
- pytest-dev: 2
|
||||
- psf: 1
|
||||
- pylint-dev: 1
|
||||
|
||||
|
||||
|
||||
## pass dataset Description:
|
||||
|
||||
|
||||
|
||||
There are a total of 79 txt files listed.
|
||||
In the original dataset, the distribution of pass case categories is:
|
||||
|
||||
- astropy: 4
|
||||
- django: 38
|
||||
- matplotlib: 3
|
||||
- pydata: 3
|
||||
- pytest-dev: 6
|
||||
- scikit-learn: 12
|
||||
- sphinx-doc: 2
|
||||
- sympy: 11
|
||||
|
||||
### After balanced sampling:
|
||||
|
||||
There are a total of 50 txt files listed.
|
||||
|
||||
- Django: 23
|
||||
- Scikit-Learn: 8
|
||||
- Sympy: 7
|
||||
- Pytest: 4
|
||||
- Astropy: 3
|
||||
- Xarray (pydata): 2
|
||||
- Matplotlib: 2
|
||||
- Sphinx: 1
|
||||
BIN
sub_swebench_dataset/sub_swebench.csv
Normal file
BIN
sub_swebench_dataset/sub_swebench.csv
Normal file
Binary file not shown.
|
Can't render this file because it has a wrong number of fields in line 2.
|
Loading…
Add table
Add a link
Reference in a new issue