Prediction-powered inference with imputed covariates and nonuniform sampling.arXiv preprint arXiv:2501.18577

Dan M Kluger, Kerri Lu, Tijana Zrnic, Sherrie Wang, Stephen Bates · 2025 · arXiv 2501.18577

11 Pith papers cite this work. Polarity classification is still indexing.

11 Pith papers citing it

representative citing papers

Valid Inference with Synthetic Data via Task Exchangeability

stat.ME · 2026-06-11 · unverdicted · novelty 6.0

Proposes task exchangeability as a condition for valid inference when using synthetic data in scientific research, with methods and extensions demonstrated on surveys and AI evaluations.

Divide-and-shrink: An efficient and heterogeneity-agnostic approach for transfer estimation using summary statistics

stat.ME · 2026-06-08 · unverdicted · novelty 6.0

dShrink is a model-free transfer estimator using summary statistics that is guaranteed to have lower expected quadratic error than the target-only estimator under arbitrary population heterogeneity.

On prediction-powered inference for quantile regression via convolution smoothing

stat.ME · 2026-06-02 · unverdicted · novelty 6.0

Introduces convolution smoothing of the check-loss for prediction-powered quantile regression, derives asymptotics under misspecification, and proposes an ensemble estimator.

Optimized Labeling Resource Allocation for Prediction-Assisted Inference via OPAL

stat.ME · 2026-06-02 · unverdicted · novelty 6.0

OPAL learns optimal smooth labeling policies from ML uncertainty scores to enable low-variance prediction-assisted inference with finite-sample coverage guarantees.

Learning U-Statistics with Active Inference

stat.ML · 2026-05-12 · unverdicted · novelty 6.0

Active inference framework for U-statistics using augmented IPW to optimize label queries and minimize variance under budget constraints.

Empirical Bayes Rebiasing

stat.ME · 2026-05-08 · unverdicted · novelty 6.0

Empirical Bayes rebiasing learns the bias distribution from paired noisy estimates to produce shorter calibrated intervals than full debiasing while maintaining coverage.

Augmented transfer regression learning for completely missing covariates

stat.ME · 2026-05-06 · unverdicted · novelty 6.0

A doubly robust, asymptotically normal estimator for regression with completely missing covariates across populations, combining importance weighting and moment imputation under a sub-population shift assumption.

Estimate Level Adjustment For Inference With Proxies Under Random Distribution Shifts

stat.ME · 2026-05-07 · unverdicted · novelty 5.0

A framework models proxy-primary outcome discrepancies as random effects at the parameter level, estimated from aggregated historical observations to calibrate inferences under distribution shifts.

Active Hypothesis Testing under Computational Budgets with Applications to GWAS and LLM

stat.ME · 2025-12-01 · unverdicted · novelty 5.0

Active hypothesis testing framework uses auxiliary statistics for data-adaptive budget allocation to produce valid p-values or e-values with optimality under independence and admissibility under dependence.

Semiparametric semi-supervised learning for general targets under distribution shift and decaying overlap

math.ST · 2025-05-09 · unverdicted · novelty 5.0

Introduces D2S3 semiparametric framework that extends AIPW estimators to semi-supervised settings with MAR labeling, distribution shift, and decaying overlap, supplying corrected asymptotic rates instead of root-n convergence.

Industrializing Prediction-Powered Inference: The GLIDE Library for Reliable GenAI and Agentic Systems Evaluation

cs.AI · 2026-05-29 · unverdicted · novelty 3.0

GLIDE is a Python library that packages multiple PPI estimators and samplers for reliable GenAI evaluation and reports annotation savings in an agentic case study.

citing papers explorer

Showing 11 of 11 citing papers after filters.

Valid Inference with Synthetic Data via Task Exchangeability stat.ME · 2026-06-11 · unverdicted · none · ref 22
Proposes task exchangeability as a condition for valid inference when using synthetic data in scientific research, with methods and extensions demonstrated on surveys and AI evaluations.
Divide-and-shrink: An efficient and heterogeneity-agnostic approach for transfer estimation using summary statistics stat.ME · 2026-06-08 · unverdicted · none · ref 12
dShrink is a model-free transfer estimator using summary statistics that is guaranteed to have lower expected quadratic error than the target-only estimator under arbitrary population heterogeneity.
On prediction-powered inference for quantile regression via convolution smoothing stat.ME · 2026-06-02 · unverdicted · none · ref 7
Introduces convolution smoothing of the check-loss for prediction-powered quantile regression, derives asymptotics under misspecification, and proposes an ensemble estimator.
Optimized Labeling Resource Allocation for Prediction-Assisted Inference via OPAL stat.ME · 2026-06-02 · unverdicted · none · ref 34
OPAL learns optimal smooth labeling policies from ML uncertainty scores to enable low-variance prediction-assisted inference with finite-sample coverage guarantees.
Learning U-Statistics with Active Inference stat.ML · 2026-05-12 · unverdicted · none · ref 36
Active inference framework for U-statistics using augmented IPW to optimize label queries and minimize variance under budget constraints.
Empirical Bayes Rebiasing stat.ME · 2026-05-08 · unverdicted · none · ref 3
Empirical Bayes rebiasing learns the bias distribution from paired noisy estimates to produce shorter calibrated intervals than full debiasing while maintaining coverage.
Augmented transfer regression learning for completely missing covariates stat.ME · 2026-05-06 · unverdicted · none · ref 3
A doubly robust, asymptotically normal estimator for regression with completely missing covariates across populations, combining importance weighting and moment imputation under a sub-population shift assumption.
Estimate Level Adjustment For Inference With Proxies Under Random Distribution Shifts stat.ME · 2026-05-07 · unverdicted · none · ref 22
A framework models proxy-primary outcome discrepancies as random effects at the parameter level, estimated from aggregated historical observations to calibrate inferences under distribution shifts.
Active Hypothesis Testing under Computational Budgets with Applications to GWAS and LLM stat.ME · 2025-12-01 · unverdicted · none · ref 21
Active hypothesis testing framework uses auxiliary statistics for data-adaptive budget allocation to produce valid p-values or e-values with optimality under independence and admissibility under dependence.
Semiparametric semi-supervised learning for general targets under distribution shift and decaying overlap math.ST · 2025-05-09 · unverdicted · none · ref 7
Introduces D2S3 semiparametric framework that extends AIPW estimators to semi-supervised settings with MAR labeling, distribution shift, and decaying overlap, supplying corrected asymptotic rates instead of root-n convergence.
Industrializing Prediction-Powered Inference: The GLIDE Library for Reliable GenAI and Agentic Systems Evaluation cs.AI · 2026-05-29 · unverdicted · none · ref 9
GLIDE is a Python library that packages multiple PPI estimators and samplers for reliable GenAI evaluation and reports annotation savings in an agentic case study.

Prediction-powered inference with imputed covariates and nonuniform sampling.arXiv preprint arXiv:2501.18577

fields

years

verdicts

representative citing papers

citing papers explorer