Statistical Optimality of Prediction-Powered Inference

Jae Kwang Kim; Se Yoon Lee

arxiv: 2606.08730 · v1 · pith:OGGFJLOVnew · submitted 2026-06-07 · 🧮 math.ST · stat.TH

Statistical Optimality of Prediction-Powered Inference

Se Yoon Lee , Jae Kwang Kim This is my paper

Pith reviewed 2026-06-27 17:38 UTC · model grok-4.3

classification 🧮 math.ST stat.TH

keywords prediction-powered inferencesemiparametric efficiencyM-estimationsemi-supervised inferenceefficient influence functionasymptotic normalitycross-fitting

0 comments

The pith

Prediction-powered inference reaches the semiparametric efficiency bound when its predictor is score-calibrated.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper reframes prediction-powered inference as an M-estimation problem, linking the bias-corrected estimating equation directly to the ideal full-data equation. This framing yields consistency and asymptotic normality under simple random sampling without replacement. The work then identifies the efficient influence function and proves that the PPI estimator attains the semiparametric efficiency lower bound precisely when the predictor output equals the true conditional expectation of the estimating function. The result is extended to learned predictors via cross-fitting and a variance-corrected single-fit procedure for mean estimation.

Core claim

PPI can attain the semiparametric efficiency lower bound when the predictor is score-calibrated, that is, when the predictor's output aligns with the true conditional expectation of the estimating function. Framing PPI as M-estimation reveals that the bias-corrected PPI estimating equation matches the ideal full-data estimating equation, delivering consistency and asymptotic normality under simple random sampling without replacement.

What carries the argument

The score-calibrated predictor inside the bias-corrected PPI estimating equation, which aligns the estimator's influence function with the efficient influence function.

If this is right

The PPI estimator is consistent and asymptotically normal under simple random sampling without replacement.
Cross-fitting produces valid asymptotic theory when the prediction rule is learned from data.
A single-fit variant with variance correction attains efficiency in the special case of semiparametric mean estimation.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same efficiency result may apply to other M-estimators once the calibration condition is met.
Practical performance of PPI will degrade smoothly as the predictor deviates from exact score calibration.

Load-bearing premise

The machine learning predictor exactly equals the conditional expectation of the estimating function given the covariates.

What would settle it

A calculation or simulation in which the predictor is set away from the conditional expectation and the asymptotic variance of the PPI estimator is observed to exceed the semiparametric efficiency bound.

read the original abstract

The prediction-powered inference (PPI) proposed by Angelopoulos et al. (2023) is a popular method that leverages a small number of labeled samples and machine learning predictions for semi-supervised inference. While several variants of PPI have appeared in the literature, its rigorous statistical theory has not been fully developed. In this paper, we study the statistical optimality of PPI. Our contributions span both foundational theory and new methodology. First, we frame PPI as an M-estimation problem, revealing a link between the bias-corrected PPI estimating equation and the ideal full-data estimating equation. This connection leads to the consistency and asymptotic normality of the PPI estimator under simple random sampling without replacement. Next, we identify the efficient influence function and prove that PPI can attain the semiparametric efficiency lower bound when the predictor is score-calibrated, that is, when the predictor's output aligns with the true conditional expectation of the estimating function. Finally, for learned prediction rules, we develop asymptotic theory for cross-fitting and for a single-fit variant with variance correction in the special case of semiparametric mean estimation. Simulation experiments and a real-data application support these findings.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

PPI reaches the semiparametric efficiency bound under explicit score calibration, with the M-estimation framing supplying the main technical link.

read the letter

The paper supplies the missing optimality theory for prediction-powered inference. It shows that the bias-corrected PPI estimator attains the semiparametric efficiency bound once the predictor is score-calibrated, meaning its output equals the conditional expectation of the estimating function.

They frame the method as an M-estimation problem. This reveals that the PPI estimating equation matches the ideal full-data one, which immediately gives consistency and asymptotic normality under simple random sampling. They then derive the efficient influence function and verify that the PPI influence function coincides with it under the calibration condition.

For cases with learned predictors, they provide theory for cross-fitting and a single-fit version with variance correction, at least for mean estimation. The simulations and real-data example line up with the asymptotics.

The calibration condition is the main assumption, and it is stated clearly rather than hidden. It is not trivial to satisfy in practice, but the paper treats it as a requirement for the efficiency result instead of claiming it always holds. No circularity or internal gaps appear in the argument.

This work is for statisticians and machine learning researchers who use or study semi-supervised inference methods. Anyone wanting rigorous efficiency guarantees for PPI will find the M-estimation connection and the EIF match useful. It deserves a serious referee because the new asymptotic results and the efficiency attainment are concrete contributions beyond the original PPI paper.

I would send it to peer review.

Referee Report

0 major / 1 minor

Summary. The manuscript frames prediction-powered inference (PPI) as an M-estimation problem whose bias-corrected estimating equation matches the ideal full-data equation, establishes consistency and asymptotic normality under simple random sampling without replacement, identifies the efficient influence function, and proves that the PPI estimator attains the semiparametric efficiency lower bound precisely when the predictor is score-calibrated (i.e., equals the conditional expectation of the estimating function). It further develops asymptotic theory for cross-fitting and a variance-corrected single-fit variant in the special case of semiparametric mean estimation, with supporting simulation experiments and a real-data application.

Significance. If the derivations hold, the work supplies a rigorous semiparametric-efficiency justification for PPI, explicitly conditioning the efficiency result on the score-calibration assumption and thereby clarifying when the method is optimal. This strengthens the theoretical foundation for a widely used semi-supervised inference technique and provides practical extensions for learned predictors.

minor comments (1)

The abstract and introduction would benefit from a brief explicit statement of the sampling scheme (simple random sampling without replacement) when first introducing the consistency result, to avoid any ambiguity for readers unfamiliar with the PPI literature.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for their positive assessment of the manuscript, recognition of its contributions to the semiparametric theory of prediction-powered inference, and recommendation to accept.

Circularity Check

0 steps flagged

No significant circularity

full rationale

The paper frames PPI as M-estimation to link its bias-corrected estimating equation to the ideal full-data equation, then invokes standard semiparametric efficiency theory to identify the EIF and shows coincidence under the explicitly stated score-calibration assumption (predictor equals conditional expectation of the estimating function). This is a conditional result on an external modeling assumption, not a reduction of the efficiency claim to a quantity fitted or defined inside the paper. No self-citation load-bearing steps, no fitted inputs renamed as predictions, and no uniqueness theorems imported from the authors' prior work appear in the derivation chain. The argument is self-contained against external semiparametric benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

Review is based on abstract only; the ledger is therefore minimal and reflects only the conditions explicitly named in the abstract. The score-calibration condition is treated as a domain assumption rather than a free parameter.

axioms (2)

standard math Standard M-estimation theory and influence function arguments apply directly to the bias-corrected PPI estimating equation under simple random sampling without replacement.
The abstract states that framing PPI as M-estimation leads to consistency and asymptotic normality.
domain assumption The predictor can be made score-calibrated, i.e., its output equals the true conditional expectation of the estimating function.
This condition is required for the efficiency bound result and is presented as a key hypothesis.

pith-pipeline@v0.9.1-grok · 5723 in / 1359 out tokens · 18819 ms · 2026-06-27T17:38:34.063915+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

300 extracted references · 1 canonical work pages

[1]

and Brick, J

Baker, R. and Brick, J. M. and Bates, N. A. and Battaglia, M. and Couper, M. P. and Dever, J. A. and Gile, K. J. and Tourangeau, R. , title =
[2]

Sampling for web surveys , author=
[3]

Compositional Model Inference , author=
[4]

Journal of the Royal Statistical Society: Series A (Statistics in Society) , volume=

Perils and potentials of self-selected entry to epidemiological studies and surveys , author=. Journal of the Royal Statistical Society: Series A (Statistics in Society) , volume=. 2016 , publisher=

2016
[5]

and Kim, J

Chen, S. and Kim, J. K. , year =. Population empirical likelihood for nonparametric inference in survey sampling , journal =
[6]

and Terada, Y

Morikawa, K. and Terada, Y. and Kim, J. K. , year =. Semiparametric adaptive estimation under informative sampling , note =
[7]

, title =

Tripathi, G. , title =. 1999 , journal =

1999
[8]

Statistical Science , volume=

Combining survey data with other data sources , author=. Statistical Science , volume=. 2017 , publisher=

2017
[9]

Annals of Applied Statistics , volume=

An imputation approach for handling mixed-mode surveys , author=. Annals of Applied Statistics , volume=. 2016 , publisher=

2016
[10]

2017 , journal =

A measurement error model for survey data integration: combining information from two surveys , author =. 2017 , journal =

2017
[11]

Biometrika , volume=

Inference and missing data , author=. Biometrika , volume=. 1976 , publisher=

1976
[12]

Dever, J. A. and Valliant, R. , year = 2016, title =. Journal of Survey Statistics and Methodology , volume = 4, pages =

2016
[13]

and Dever, J

Valliant, R. and Dever, J. A , year = 2011, title =. Sociological Methods and Research , volume = 40, pages =

2011
[14]

arXiv preprint arXiv:2502.17741 , year=

A unified framework for semiparametrically efficient semi-supervised learning , author=. arXiv preprint arXiv:2502.17741 , year=

work page arXiv
[15]

Lee, Se Yoon and Kim, Jae Kwang , journal=
[16]

2020 , journal =

On Making Valid Inferences by Integrating Data from Surveys and Other Sources , author =. 2020 , journal =

2020
[17]

Statistical Science , volume=

Inference for nonprobability samples , author=. Statistical Science , volume=
[18]

Statistica Sinica , year=

DOUBLY ROBUST AND LOCALLY EFFICIENT ESTIMATION WITH MISSING OUTCOMES , author=. Statistica Sinica , year=
[19]

2023 , journal =

Prediction Powered Inference , author =. 2023 , journal =

2023
[20]

2025 , journal = jasa, note =

Debiased calibration estimation using generalized entropy in survey sampling , author =. 2025 , journal = jasa, note =

2025
[21]

Journal of the Indian Society of Agricultural Statistics , year =

Breidt, F Jay and McVey, Anita and Fuller, Wayne A , title =. Journal of the Indian Society of Agricultural Statistics , year =
[22]

Journal of the Royal Statistical Society Series B: Statistical Methodology , volume=

Biased-sample empirical likelihood weighting for missing data problems: an alternative to inverse probability weighting , author=. Journal of the Royal Statistical Society Series B: Statistical Methodology , volume=. 2023 , publisher=

2023
[23]

and Tareda, Y

Morikawa, K. and Tareda, Y. and Kim, J. K. , year =. Semiparametric adaptive estimation under informative sampling , note =
[24]

Biometrika , Year =

Parametric fractional imputation for missing data analysis , Author =. Biometrika , Year =
[25]

Biometrika , year =

Kim, Jae Kwang and Rao, Jon NK , title =. Biometrika , year =
[26]

2020 , volume =

Predictive mean matching imputation in survey sampling , author=. 2020 , volume =

2020
[27]

Journal of Elections, Public Opinion and Parties , year=

The 2006 Cooperative Congressional Election Study , author=. Journal of Elections, Public Opinion and Parties , year=

2006
[28]

Sociological Methods and Research , year=

Estimation for Volunteer Panel Web Surveys Using Propensity Score Adjustment and Calibration Adjustment , author=. Sociological Methods and Research , year=
[29]

Sampling Statistic , Author =
[30]

The Science of Web Surveys , Author =
[31]

Journal of Survey Statistics and Methodology , volume=

Imputation under informative sampling , author=. Journal of Survey Statistics and Methodology , volume=. 2016 , publisher=

2016
[32]

Rao, J. N. K. and Wu, C. F. J. , year = 1988, title =

1988
[33]

and Chambers, R

Zhang, L.-C. and Chambers, R. L. , year =. Analysis of Integrated Data , publisher =
[34]

Social Science Computer Review , Year =

Solving the nonresponse problem with sample matching? , Author =. Social Science Computer Review , Year =
[35]

Australian & New Zealand Journal of Statistics , Year =

Combining Household Surveys Using Mass Imputation to Estimate Population Totals , Author =. Australian & New Zealand Journal of Statistics , Year =
[36]

Journal of the American Statistical Association , year=

Doubly Robust Inference with Non-probability Survey Samples , author=. Journal of the American Statistical Association , year=
[37]

Approximation of probability distributions by convex mixtures of

Bacharoglou, Athanassia , journal=. Approximation of probability distributions by convex mixtures of
[38]

Journal of the American Statistical Association , volume=

Sliced inverse regression for dimension reduction , author=. Journal of the American Statistical Association , volume=. 1991 , publisher=

1991
[39]

Journal of the American Statistical Association , volume=

On the interpretation of regression plots , author=. Journal of the American Statistical Association , volume=. 1994 , publisher=

1994
[40]

2009 , publisher=

Regression graphics: Ideas for studying regressions through graphics , author=. 2009 , publisher=

2009
[41]

Journal of Machine Learning Research , volume=

Dimensionality reduction for supervised learning with reproducing kernel Hilbert spaces , author=. Journal of Machine Learning Research , volume=
[42]

2007 , journal = jasa, volume =

On directional regression for dimension reduction , author =. 2007 , journal = jasa, volume =

2007
[43]

Doubly robust inference when combining probability and non‐probability samples with high dimensional data , author =
[44]

Yang, Shu and Kim, J. K. , note =. Survey Data Integration: A review , year =
[45]

On the asymptotic normality of statistics with estimated parameters , author =
[46]

R. L. Anderson. Maximum likelihood estimates for the multivariate normal distribution when some observations are missing
[47]

J. F. Beaumont. Calibrated imputation in surveys under a quasi-model-assisted approach
[48]

Chang and P

T. Chang and P. S. Kott. Using calibration weighting to adjust for nonresponse under a plausible model. Biometrika
[49]

J. F. Beaumont and C. Bocci. Variance estimation when donor imputation is used to fill in missing values
[50]

J. G. Booth and J. P. Hobert. Maximizing generalized linear models with an automated Monte Carlo EM algorithm
[51]

Cao and A

W. Cao and A. A. Tsiatis and M. Davidian. Improving efficiency and robustness of the doubly robust estimator for a population mean with incomplete data
[52]

Chen and D.Y.H

S.X. Chen and D.Y.H. Leung and J. Qin. Improved Semiparametric Estimation Using Surrogate Data
[53]

K. Chen. Parametric models for response-biased sampling
[54]

Chen and J

J. Chen and J. Shao. Jackknife variance estimation for nearest neighbor imputation
[55]

Clayton and D

D. Clayton and D. Spiegelhalter and G. Dunn and A. Pickles. Analysis of longitudinal binary data from multiphase sampling
[56]

A.P. Dawid. Conditional Independence in Statistical Theory
[57]

A. P. Dempster and N. M. Laird and D. B. Rubin. Maximum likelihood from incomplete data via the EM algorithm
[58]

Efron and D

B. Efron and D. V. Hinkley. Assessing the accuracy of the maximum likelihood estimator: Observed versus expected Fisher Information. Biometrika
[59]

R. E. Fay. When are inferences from multiple imputation valid ?. Proc. Survey Res. Meth. Sect. , publisher =
[60]

Patterson, H. D. and Thompson, R. , year =. Recovery of inter-block information when block sises are unequal , journal =
[61]

, year =

Harville, D. , year =. Maximum likelihood approaches to variance component estimation , journal = jasa, volume =
[62]

Geman and D

S. Geman and D. Geman. Stochastic relaxation, Gibbs distributions, and the Bayeisan restoration of images
[63]

Gelman and X

A. Gelman and X. L. Meng and H. Stern. Posterior predictice assessmemt of model fitness via realized discrepancies (with discussion)
[64]

W. R. Gilks and P. Wild. Adaptive rejection sampling for Gibbs sampling. Applied Statistics
[65]

V. P. Godambe and V. M. Joshi. Admissibility and Bayes estimation in sampling finite populations - V. Annals of Mathematical Statistics
[66]

V. P. Godambe and M. E. Thompson. Parameters of superpopulation and survey population: their relationships and estimation
[67]

Newey and Daniel McFadden , title =

Whitney K. Newey and Daniel McFadden , title =. Handbook of Econometrics , editor =. 1994 , volume =

1994
[68]

A. W. van der Vaart , title =. 1998 , series =

1998
[69]

F. R. Hampel. The Influence Curve and its Role in Robust Estimation
[70]

Proceedings of the National Academy of Sciences , volume=

Cross-prediction-powered inference , author=. Proceedings of the National Academy of Sciences , volume=. 2024 , publisher=

2024
[71]

W. K. Hastings. Monte Carlo sampling methods using Markov chains and their applications
[72]

Henmi and S

M. Henmi and S. Eguchi. A paradox concerning nuisance parameters and projected estimating functions
[73]

Henmi and R

M. Henmi and R. Yoshida and S. Eguchi. Importance sampling via the estimated sampler. Biometrika
[74]

J. G. Ibrahim. Incomplete data in generalized linear models
[75]

Kalton and L

G. Kalton and L. Kish. Some efficient random imputation methods. Communications in Statistics: Series A
[76]

J. K. Kim. A note on approximate Bayesian bootstrap imputation
[77]

Statistical applications in genetics and molecular biology , volume=

Super learner , author=. Statistical applications in genetics and molecular biology , volume=. 2007 , publisher=

2007
[78]

J. K. Kim and M. J. Brick and W. A. Fuller and G. Kalton. On the bias of the multiple imputation variance estimator in survey sampling
[79]

J. K. Kim. Finite sample properties of multiple imputation estimators
[80]

J. K. Kim and W. A. Fuller. Fractional hot deck imputation. Biometrika

Showing first 80 references.

[1] [1]

and Brick, J

Baker, R. and Brick, J. M. and Bates, N. A. and Battaglia, M. and Couper, M. P. and Dever, J. A. and Gile, K. J. and Tourangeau, R. , title =

[2] [2]

Sampling for web surveys , author=

[3] [3]

Compositional Model Inference , author=

[4] [4]

Journal of the Royal Statistical Society: Series A (Statistics in Society) , volume=

Perils and potentials of self-selected entry to epidemiological studies and surveys , author=. Journal of the Royal Statistical Society: Series A (Statistics in Society) , volume=. 2016 , publisher=

2016

[5] [5]

and Kim, J

Chen, S. and Kim, J. K. , year =. Population empirical likelihood for nonparametric inference in survey sampling , journal =

[6] [6]

and Terada, Y

Morikawa, K. and Terada, Y. and Kim, J. K. , year =. Semiparametric adaptive estimation under informative sampling , note =

[7] [7]

, title =

Tripathi, G. , title =. 1999 , journal =

1999

[8] [8]

Statistical Science , volume=

Combining survey data with other data sources , author=. Statistical Science , volume=. 2017 , publisher=

2017

[9] [9]

Annals of Applied Statistics , volume=

An imputation approach for handling mixed-mode surveys , author=. Annals of Applied Statistics , volume=. 2016 , publisher=

2016

[10] [10]

2017 , journal =

A measurement error model for survey data integration: combining information from two surveys , author =. 2017 , journal =

2017

[11] [11]

Biometrika , volume=

Inference and missing data , author=. Biometrika , volume=. 1976 , publisher=

1976

[12] [12]

Dever, J. A. and Valliant, R. , year = 2016, title =. Journal of Survey Statistics and Methodology , volume = 4, pages =

2016

[13] [13]

and Dever, J

Valliant, R. and Dever, J. A , year = 2011, title =. Sociological Methods and Research , volume = 40, pages =

2011

[14] [14]

arXiv preprint arXiv:2502.17741 , year=

A unified framework for semiparametrically efficient semi-supervised learning , author=. arXiv preprint arXiv:2502.17741 , year=

work page arXiv

[15] [15]

Lee, Se Yoon and Kim, Jae Kwang , journal=

[16] [16]

2020 , journal =

On Making Valid Inferences by Integrating Data from Surveys and Other Sources , author =. 2020 , journal =

2020

[17] [17]

Statistical Science , volume=

Inference for nonprobability samples , author=. Statistical Science , volume=

[18] [18]

Statistica Sinica , year=

DOUBLY ROBUST AND LOCALLY EFFICIENT ESTIMATION WITH MISSING OUTCOMES , author=. Statistica Sinica , year=

[19] [19]

2023 , journal =

Prediction Powered Inference , author =. 2023 , journal =

2023

[20] [20]

2025 , journal = jasa, note =

Debiased calibration estimation using generalized entropy in survey sampling , author =. 2025 , journal = jasa, note =

2025

[21] [21]

Journal of the Indian Society of Agricultural Statistics , year =

Breidt, F Jay and McVey, Anita and Fuller, Wayne A , title =. Journal of the Indian Society of Agricultural Statistics , year =

[22] [22]

Journal of the Royal Statistical Society Series B: Statistical Methodology , volume=

Biased-sample empirical likelihood weighting for missing data problems: an alternative to inverse probability weighting , author=. Journal of the Royal Statistical Society Series B: Statistical Methodology , volume=. 2023 , publisher=

2023

[23] [23]

and Tareda, Y

Morikawa, K. and Tareda, Y. and Kim, J. K. , year =. Semiparametric adaptive estimation under informative sampling , note =

[24] [24]

Biometrika , Year =

Parametric fractional imputation for missing data analysis , Author =. Biometrika , Year =

[25] [25]

Biometrika , year =

Kim, Jae Kwang and Rao, Jon NK , title =. Biometrika , year =

[26] [26]

2020 , volume =

Predictive mean matching imputation in survey sampling , author=. 2020 , volume =

2020

[27] [27]

Journal of Elections, Public Opinion and Parties , year=

The 2006 Cooperative Congressional Election Study , author=. Journal of Elections, Public Opinion and Parties , year=

2006

[28] [28]

Sociological Methods and Research , year=

Estimation for Volunteer Panel Web Surveys Using Propensity Score Adjustment and Calibration Adjustment , author=. Sociological Methods and Research , year=

[29] [29]

Sampling Statistic , Author =

[30] [30]

The Science of Web Surveys , Author =

[31] [31]

Journal of Survey Statistics and Methodology , volume=

Imputation under informative sampling , author=. Journal of Survey Statistics and Methodology , volume=. 2016 , publisher=

2016

[32] [32]

Rao, J. N. K. and Wu, C. F. J. , year = 1988, title =

1988

[33] [33]

and Chambers, R

Zhang, L.-C. and Chambers, R. L. , year =. Analysis of Integrated Data , publisher =

[34] [34]

Social Science Computer Review , Year =

Solving the nonresponse problem with sample matching? , Author =. Social Science Computer Review , Year =

[35] [35]

Australian & New Zealand Journal of Statistics , Year =

Combining Household Surveys Using Mass Imputation to Estimate Population Totals , Author =. Australian & New Zealand Journal of Statistics , Year =

[36] [36]

Journal of the American Statistical Association , year=

Doubly Robust Inference with Non-probability Survey Samples , author=. Journal of the American Statistical Association , year=

[37] [37]

Approximation of probability distributions by convex mixtures of

Bacharoglou, Athanassia , journal=. Approximation of probability distributions by convex mixtures of

[38] [38]

Journal of the American Statistical Association , volume=

Sliced inverse regression for dimension reduction , author=. Journal of the American Statistical Association , volume=. 1991 , publisher=

1991

[39] [39]

Journal of the American Statistical Association , volume=

On the interpretation of regression plots , author=. Journal of the American Statistical Association , volume=. 1994 , publisher=

1994

[40] [40]

2009 , publisher=

Regression graphics: Ideas for studying regressions through graphics , author=. 2009 , publisher=

2009

[41] [41]

Journal of Machine Learning Research , volume=

Dimensionality reduction for supervised learning with reproducing kernel Hilbert spaces , author=. Journal of Machine Learning Research , volume=

[42] [42]

2007 , journal = jasa, volume =

On directional regression for dimension reduction , author =. 2007 , journal = jasa, volume =

2007

[43] [43]

Doubly robust inference when combining probability and non‐probability samples with high dimensional data , author =

[44] [44]

Yang, Shu and Kim, J. K. , note =. Survey Data Integration: A review , year =

[45] [45]

On the asymptotic normality of statistics with estimated parameters , author =

[46] [46]

R. L. Anderson. Maximum likelihood estimates for the multivariate normal distribution when some observations are missing

[47] [47]

J. F. Beaumont. Calibrated imputation in surveys under a quasi-model-assisted approach

[48] [48]

Chang and P

T. Chang and P. S. Kott. Using calibration weighting to adjust for nonresponse under a plausible model. Biometrika

[49] [49]

J. F. Beaumont and C. Bocci. Variance estimation when donor imputation is used to fill in missing values

[50] [50]

J. G. Booth and J. P. Hobert. Maximizing generalized linear models with an automated Monte Carlo EM algorithm

[51] [51]

Cao and A

W. Cao and A. A. Tsiatis and M. Davidian. Improving efficiency and robustness of the doubly robust estimator for a population mean with incomplete data

[52] [52]

Chen and D.Y.H

S.X. Chen and D.Y.H. Leung and J. Qin. Improved Semiparametric Estimation Using Surrogate Data

[53] [53]

K. Chen. Parametric models for response-biased sampling

[54] [54]

Chen and J

J. Chen and J. Shao. Jackknife variance estimation for nearest neighbor imputation

[55] [55]

Clayton and D

D. Clayton and D. Spiegelhalter and G. Dunn and A. Pickles. Analysis of longitudinal binary data from multiphase sampling

[56] [56]

A.P. Dawid. Conditional Independence in Statistical Theory

[57] [57]

A. P. Dempster and N. M. Laird and D. B. Rubin. Maximum likelihood from incomplete data via the EM algorithm

[58] [58]

Efron and D

B. Efron and D. V. Hinkley. Assessing the accuracy of the maximum likelihood estimator: Observed versus expected Fisher Information. Biometrika

[59] [59]

R. E. Fay. When are inferences from multiple imputation valid ?. Proc. Survey Res. Meth. Sect. , publisher =

[60] [60]

Patterson, H. D. and Thompson, R. , year =. Recovery of inter-block information when block sises are unequal , journal =

[61] [61]

, year =

Harville, D. , year =. Maximum likelihood approaches to variance component estimation , journal = jasa, volume =

[62] [62]

Geman and D

S. Geman and D. Geman. Stochastic relaxation, Gibbs distributions, and the Bayeisan restoration of images

[63] [63]

Gelman and X

A. Gelman and X. L. Meng and H. Stern. Posterior predictice assessmemt of model fitness via realized discrepancies (with discussion)

[64] [64]

W. R. Gilks and P. Wild. Adaptive rejection sampling for Gibbs sampling. Applied Statistics

[65] [65]

V. P. Godambe and V. M. Joshi. Admissibility and Bayes estimation in sampling finite populations - V. Annals of Mathematical Statistics

[66] [66]

V. P. Godambe and M. E. Thompson. Parameters of superpopulation and survey population: their relationships and estimation

[67] [67]

Newey and Daniel McFadden , title =

Whitney K. Newey and Daniel McFadden , title =. Handbook of Econometrics , editor =. 1994 , volume =

1994

[68] [68]

A. W. van der Vaart , title =. 1998 , series =

1998

[69] [69]

F. R. Hampel. The Influence Curve and its Role in Robust Estimation

[70] [70]

Proceedings of the National Academy of Sciences , volume=

Cross-prediction-powered inference , author=. Proceedings of the National Academy of Sciences , volume=. 2024 , publisher=

2024

[71] [71]

W. K. Hastings. Monte Carlo sampling methods using Markov chains and their applications

[72] [72]

Henmi and S

M. Henmi and S. Eguchi. A paradox concerning nuisance parameters and projected estimating functions

[73] [73]

Henmi and R

M. Henmi and R. Yoshida and S. Eguchi. Importance sampling via the estimated sampler. Biometrika

[74] [74]

J. G. Ibrahim. Incomplete data in generalized linear models

[75] [75]

Kalton and L

G. Kalton and L. Kish. Some efficient random imputation methods. Communications in Statistics: Series A

[76] [76]

J. K. Kim. A note on approximate Bayesian bootstrap imputation

[77] [77]

Statistical applications in genetics and molecular biology , volume=

Super learner , author=. Statistical applications in genetics and molecular biology , volume=. 2007 , publisher=

2007

[78] [78]

J. K. Kim and M. J. Brick and W. A. Fuller and G. Kalton. On the bias of the multiple imputation variance estimator in survey sampling

[79] [79]

J. K. Kim. Finite sample properties of multiple imputation estimators

[80] [80]

J. K. Kim and W. A. Fuller. Fractional hot deck imputation. Biometrika