JECS aggregates per-model conformal p-values via their maximum and reconstructs a conservative envelope of the max-p null distribution to select benchmarks with global contamination rate control.
Journal of the Royal Statistical Society Series B: Statistical Methodology , volume=
5 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 5roles
background 1polarities
background 1representative citing papers
A generalization of the Benjamini-Hochberg procedure controls the FDR curve below any specified level in location families, and the standard procedure simultaneously controls the entire curve for free.
PRADAS derives a Bayes-optimal mirror statistic for any splitting scheme, establishes asymptotic FDR control under weak dependence, and optimizes the split ratio as a stopping time to improve power over standard equal-split methods.
Domino guarantees k-bFDR control under arbitrary dependence via the closure principle, extending boundary FDR methods to general settings for both p-values and e-values.
The weighted Holm procedure (WHP) based on ordered weighted p-values is uniformly more powerful than the weighted alternative Holm procedure (WAP) based on ordered raw p-values, with stronger optimality properties under FWER control.
citing papers explorer
-
Provable Joint Decontamination for Benchmarking Multiple Large Language Models
JECS aggregates per-model conformal p-values via their maximum and reconstructs a conservative envelope of the max-p null distribution to select benchmarks with global contamination rate control.
-
Simultaneous false discovery rate control in location families
A generalization of the Benjamini-Hochberg procedure controls the FDR curve below any specified level in location families, and the standard procedure simultaneously controls the entire curve for free.
-
PRADAS: PRior-Assisted DAta Splitting for False Discovery Rate Control
PRADAS derives a Bayes-optimal mirror statistic for any splitting scheme, establishes asymptotic FDR control under weak dependence, and optimizes the split ratio as a stopping time to improve power over standard equal-split methods.
-
Generalized Boundary FDR Control under Arbitrary Dependence: An Approach on Closure Principle
Domino guarantees k-bFDR control under arbitrary dependence via the closure principle, extending boundary FDR methods to general settings for both p-values and e-values.
-
Weighted Holm Procedures: Theory, Properties, and Recommendations
The weighted Holm procedure (WHP) based on ordered weighted p-values is uniformly more powerful than the weighted alternative Holm procedure (WAP) based on ordered raw p-values, with stronger optimality properties under FWER control.