nyx/tests/eval_corpus
2026-06-03 07:35:57 -05:00
..
ground_truth feat(eval-corpus): add Track R.2 polyglot corpora (RailsGoat, DVWA, DVPWA, gosec, RustSec) with curated manifests, negative controls, and CI validation 2026-06-01 10:04:38 -05:00
budget.toml feat(eval-corpus): add Track R.2 polyglot corpora (RailsGoat, DVWA, DVPWA, gosec, RustSec) with curated manifests, negative controls, and CI validation 2026-06-01 10:04:38 -05:00
check_surface.sh [pitboss/grind] deferred session-0010 (20260517T044708Z-e058) 2026-05-17 03:44:24 -05:00
manifest_gt_convert.py feat(eval-corpus): add Track R.2 polyglot corpora (RailsGoat, DVWA, DVPWA, gosec, RustSec) with curated manifests, negative controls, and CI validation 2026-06-01 10:04:38 -05:00
owasp_gt_convert.py feat(eval-corpus): implement OWASP Benchmark v1.2 acceptance with precision/recall floors, confirmed-rate tracking, and per-(cap,lang) budget enforcement 2026-05-29 15:39:27 -05:00
report.py feat(dynamic, eval): enhance hardening validation, CI budget tuning, and source-keyed target-dir isolation 2026-06-03 07:35:57 -05:00
run.sh feat(eval-corpus): add Track R.2 polyglot corpora (RailsGoat, DVWA, DVPWA, gosec, RustSec) with curated manifests, negative controls, and CI validation 2026-06-01 10:04:38 -05:00
run_full.sh feat(eval-corpus): add Track R.2 polyglot corpora (RailsGoat, DVWA, DVPWA, gosec, RustSec) with curated manifests, negative controls, and CI validation 2026-06-01 10:04:38 -05:00
sard_gt_convert.py introduce ground-truth converters for OWASP and SARD datasets 2026-05-12 16:16:26 -04:00
tabulate.py feat(dynamic, eval): enhance hardening validation, CI budget tuning, and source-keyed target-dir isolation 2026-06-03 07:35:57 -05:00
test_manifest_gt_convert.py feat(eval-corpus): add Track R.2 polyglot corpora (RailsGoat, DVWA, DVPWA, gosec, RustSec) with curated manifests, negative controls, and CI validation 2026-06-01 10:04:38 -05:00
test_tabulate_regression.py feat(ssa): optimize branch condition handling via constant folding, enhance precision for taint analysis, and expand OWASP Benchmark support 2026-06-02 13:41:45 -05:00