hub

Optimal Inference After Model Selection

William Fithian, Dennis Sun, Jonathan Taylor · 2014 · math.ST · arXiv 1410.2597

12 Pith papers cite this work. Polarity classification is still indexing.

12 Pith papers citing it

open full Pith review browse 12 citing papers arXiv PDF

abstract

To perform inference after model selection, we propose controlling the selective type I error; i.e., the error rate of a test given that it was performed. By doing so, we recover long-run frequency properties among selected hypotheses analogous to those that apply in the classical (non-adaptive) context. Our proposal is closely related to data splitting and has a similar intuitive justification, but is more powerful. Exploiting the classical theory of Lehmann and Scheff\'e (1955), we derive most powerful unbiased selective tests and confidence intervals for inference in exponential family models after arbitrary selection procedures. For linear regression, we derive new selective z-tests that generalize recent proposals for inference after model selection and improve on their power, and new selective t-tests that do not require knowledge of the error variance.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 1 other 1

citation-polarity summary

background 1 unclear 1

representative citing papers

Post-ADC Inference: Valid Inference After Active Data Collection

stat.ML · 2026-05-12 · unverdicted · novelty 7.0

Post-ADC inference supplies valid p-values and confidence intervals for data-dependent targets after active data collection by extending selective inference to correct for both adaptive sampling bias and post-hoc target selection, relying only on noise assumptions.

In-Sample Evaluation of Subgroups Identified by Generic Machine Learning

stat.ME · 2026-05-04 · unverdicted · novelty 7.0

A conditional adaptive perturbation approach enables valid in-sample inference for machine learning-identified subgroups with nonregular boundaries via triple robustness.

Integrating Diagnostic Checks into Estimation

econ.EM · 2026-04-17 · unverdicted · novelty 7.0

Residualizing estimators against diagnostic check statistics eliminates selective reporting distortions, reduces variance when the model is correct, and minimizes worst-case bias under local misspecification.

$\phi$-Table: A Statistical Explanation for Global SHAP

stat.ML · 2025-12-08 · unverdicted · novelty 6.0

The φ-table extends SHAP rankings into a statistical table by fitting standardized linear surrogates to the model response and reporting direction, uncertainty, fidelity, and stability.

Improving Power by Conditioning on Less in Post-selection Inference for Changepoints

stat.ME · 2023-01-13 · unverdicted · novelty 6.0

Monte Carlo approximation of selective p-values in changepoint post-selection inference that conditions on less to improve power while remaining valid for any sample size.

Selective Inference via Marginal Screening for High Dimensional Classification

stat.ME · 2019-06-26 · unverdicted · novelty 6.0

Derives asymptotic selective inference for high-dimensional logistic regression post marginal screening to enable valid hypothesis testing.

Towards Reliable LLM Evaluation: Correcting the Winner's Curse in Adaptive Benchmarking

stat.ML · 2026-05-07 · unverdicted · novelty 6.0

SIREN corrects winner's curse bias in adaptive LLM benchmarking via selection-aware repeated splits and bootstrap for valid procedure-level confidence intervals.

A Leakage Bound for Confidence Sets after Black-Box Selection

math.ST · 2026-04-29 · unverdicted · novelty 6.0

Selected-target noncoverage after black-box selection is bounded by nominal fixed-target noncoverage plus average total variation distance between marginal and conditional laws of the inferential data.

Post-Screening Portfolio Selection

q-fin.PM · 2026-04-19 · unverdicted · novelty 6.0

A Lasso-based screening step followed by low-dimensional mean-variance optimization on the selected assets improves high-dimensional portfolio construction, with a defactoring extension for strong factors.

Statistical Test for Diffusion-Based Anomaly Localization via Selective Inference

stat.ML · 2024-02-19 · unverdicted · novelty 5.0

A selective inference framework is proposed to provide p-values controlling false positive rates for diffusion-based anomaly localization in images.

Weighted Holm Procedures: Theory, Properties, and Recommendations

stat.ME · 2026-04-21 · conditional · novelty 5.0

The weighted Holm procedure (WHP) based on ordered weighted p-values is uniformly more powerful than the weighted alternative Holm procedure (WAP) based on ordered raw p-values, with stronger optimality properties under FWER control.

Inference conditional on selection: a review

stat.ME · 2026-04-10 · unverdicted · novelty 2.0

The review covers selective inference techniques that provide conditional guarantees for inference after data-dependent selection, demonstrated with examples from winner inference, regression trees, clustering, and single-cell RNA sequencing.

citing papers explorer

Showing 12 of 12 citing papers.

Post-ADC Inference: Valid Inference After Active Data Collection stat.ML · 2026-05-12 · unverdicted · none · ref 7
Post-ADC inference supplies valid p-values and confidence intervals for data-dependent targets after active data collection by extending selective inference to correct for both adaptive sampling bias and post-hoc target selection, relying only on noise assumptions.
In-Sample Evaluation of Subgroups Identified by Generic Machine Learning stat.ME · 2026-05-04 · unverdicted · none · ref 13
A conditional adaptive perturbation approach enables valid in-sample inference for machine learning-identified subgroups with nonregular boundaries via triple robustness.
Integrating Diagnostic Checks into Estimation econ.EM · 2026-04-17 · unverdicted · none · ref 2
Residualizing estimators against diagnostic check statistics eliminates selective reporting distortions, reduces variance when the model is correct, and minimizes worst-case bias under local misspecification.
$\phi$-Table: A Statistical Explanation for Global SHAP stat.ML · 2025-12-08 · unverdicted · none · ref 2 · internal anchor
The φ-table extends SHAP rankings into a statistical table by fitting standardized linear surrogates to the model response and reporting direction, uncertainty, fidelity, and stability.
Improving Power by Conditioning on Less in Post-selection Inference for Changepoints stat.ME · 2023-01-13 · unverdicted · none · ref 18 · internal anchor
Monte Carlo approximation of selective p-values in changepoint post-selection inference that conditions on less to improve power while remaining valid for any sample size.
Selective Inference via Marginal Screening for High Dimensional Classification stat.ME · 2019-06-26 · unverdicted · none · ref 3 · internal anchor
Derives asymptotic selective inference for high-dimensional logistic regression post marginal screening to enable valid hypothesis testing.
Towards Reliable LLM Evaluation: Correcting the Winner's Curse in Adaptive Benchmarking stat.ML · 2026-05-07 · unverdicted · none · ref 12
SIREN corrects winner's curse bias in adaptive LLM benchmarking via selection-aware repeated splits and bootstrap for valid procedure-level confidence intervals.
A Leakage Bound for Confidence Sets after Black-Box Selection math.ST · 2026-04-29 · unverdicted · none · ref 5
Selected-target noncoverage after black-box selection is bounded by nominal fixed-target noncoverage plus average total variation distance between marginal and conditional laws of the inferential data.
Post-Screening Portfolio Selection q-fin.PM · 2026-04-19 · unverdicted · none · ref 40
A Lasso-based screening step followed by low-dimensional mean-variance optimization on the selected assets improves high-dimensional portfolio construction, with a defactoring extension for strong factors.
Statistical Test for Diffusion-Based Anomaly Localization via Selective Inference stat.ML · 2024-02-19 · unverdicted · none · ref 10 · internal anchor
A selective inference framework is proposed to provide p-values controlling false positive rates for diffusion-based anomaly localization in images.
Weighted Holm Procedures: Theory, Properties, and Recommendations stat.ME · 2026-04-21 · conditional · none · ref 108
The weighted Holm procedure (WHP) based on ordered weighted p-values is uniformly more powerful than the weighted alternative Holm procedure (WAP) based on ordered raw p-values, with stronger optimality properties under FWER control.
Inference conditional on selection: a review stat.ME · 2026-04-10 · unverdicted · none · ref 1
The review covers selective inference techniques that provide conditional guarantees for inference after data-dependent selection, demonstrated with examples from winner inference, regression trees, clustering, and single-cell RNA sequencing.

Optimal Inference After Model Selection

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer