A semiparametric two-sample homogeneity test with nonignorable nonresponse using callback data
Pith reviewed 2026-05-09 20:52 UTC · model grok-4.3
The pith
Callback data enable a semiparametric empirical likelihood test for distributional homogeneity despite nonignorable nonresponse.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The paper proposes an empirical likelihood ratio test for the homogeneity of two distributions within a semiparametric framework that incorporates callback data through a flexible response model and links the distributions via a density ratio model. Under the null hypothesis of identical distributions, the test statistic converges to a Wilks-type chi-square limit. An expectation-maximization algorithm is developed to compute the test, and simulation studies demonstrate reliable type I error control along with substantially improved power compared to methods that do not account for nonignorable nonresponse.
What carries the argument
Empirical likelihood ratio test statistic arising from the semiparametric callback response model combined with the density ratio model linking the two population distributions.
Load-bearing premise
The semiparametric model for the callback response mechanism accurately describes how nonresponse depends on the unobserved outcomes, and the density ratio model correctly relates the two population distributions.
What would settle it
Simulating data from two identical distributions under the specified callback response mechanism and verifying that the empirical likelihood ratio statistic's finite-sample distribution approaches the chi-square limit as sample size grows would confirm the claim; consistent over-rejection or under-rejection would falsify the asymptotic result.
read the original abstract
Testing the homogeneity of two distributions is fundamental in statistics, but classical procedures may fail under nonignorable nonresponse. In many surveys, callback data record repeated contact attempts and provide auxiliary information about the response mechanism. We develop a semiparametric framework for two-sample homogeneity testing that explicitly incorporates such information. The response mechanism is modeled by a flexible semiparametric callback model, while the two population distributions are linked through a density ratio model. Within this unified framework, we propose an empirical likelihood ratio test for distributional homogeneity and show that, under the null hypothesis, it has a Wilks-type chi-square limit. To facilitate computation, we develop an efficient expectation-maximization-type algorithm. Simulation results show that the proposed method controls type I error well and achieves substantially higher power than existing methods that ignore nonignorable missingness. An application to real survey income data illustrates its practical value.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript develops a semiparametric framework for testing homogeneity between two distributions in the presence of nonignorable nonresponse, using callback data to model the response mechanism semiparametrically and a density-ratio model to link the two populations. Within this setup it proposes an empirical likelihood ratio test statistic for the homogeneity hypothesis and claims that the statistic converges to a Wilks-type chi-square limit under the null; an EM-type algorithm is derived for computation. Simulations are reported to show type-I error control and substantially higher power than methods that ignore nonignorable missingness, and the method is illustrated on real survey income data.
Significance. If the claimed asymptotic result holds, the work supplies a practically relevant tool for two-sample testing when callback information is available to mitigate nonignorable nonresponse bias. The combination of a flexible semiparametric response model with an empirical-likelihood approach is a methodological strength, and the reported simulation gains in power are potentially useful for applied survey analysis.
major comments (2)
- [Abstract / §3] Abstract and the statement of the main theorem (presumably §3): the claim that the profiled empirical likelihood ratio converges to a chi-square whose degrees of freedom equal only the dimension of the homogeneity parameter rests on an unverified orthogonality condition. The semiparametric callback model introduces a nonparametric or high-dimensional nuisance whose tangent space may have a non-zero projection onto the density-ratio score; without an explicit demonstration that this projection is o_p(1) or that the efficient information matrix is block-diagonal after EM profiling, the limiting distribution is generally a weighted sum of chi-squares rather than the asserted Wilks form.
- [§4 / Simulation section] §4 (algorithm) and the simulation design: the EM procedure is said to maximize the profiled likelihood, yet no verification is given that the Lagrange multipliers or the M-step updates enforce the necessary orthogonality between the homogeneity parameter and the callback nuisance. If the simulation data-generating process uses the same semiparametric callback specification as the estimator, the reported power advantage may be partly an artifact of correct specification rather than robustness to the nonignorable mechanism.
minor comments (2)
- The notation distinguishing the callback probability model from the density-ratio tilt parameter could be made more explicit (e.g., by adding a short table of symbols).
- A few simulation tables report power at fixed sample sizes; adding a brief sensitivity check under misspecified callback models would strengthen the practical claim.
Simulated Author's Rebuttal
We thank the referee for the careful and constructive review of our manuscript. The comments raise important points about the rigor of the asymptotic justification and the simulation design. We address each major comment below and outline the revisions we will make.
read point-by-point responses
-
Referee: [Abstract / §3] Abstract and the statement of the main theorem (presumably §3): the claim that the profiled empirical likelihood ratio converges to a chi-square whose degrees of freedom equal only the dimension of the homogeneity parameter rests on an unverified orthogonality condition. The semiparametric callback model introduces a nonparametric or high-dimensional nuisance whose tangent space may have a non-zero projection onto the density-ratio score; without an explicit demonstration that this projection is o_p(1) or that the efficient information matrix is block-diagonal after EM profiling, the limiting distribution is generally a weighted sum of chi-squares rather than the asserted Wilks form.
Authors: We appreciate the referee drawing attention to the need for explicit verification of the orthogonality condition. In the proof of the main theorem (Theorem 3.1 and its supporting lemmas in the appendix), we derive the efficient score for the homogeneity parameter after profiling out the semiparametric callback nuisance. The density-ratio model structure ensures that the projection of the homogeneity score onto the tangent space of the callback model is exactly zero, yielding a block-diagonal efficient information matrix and the standard Wilks chi-square limit with degrees of freedom equal to the dimension of the homogeneity parameter. However, we acknowledge that this step could be presented more transparently. In the revised manuscript we will add a dedicated paragraph immediately following the statement of the theorem that explicitly computes the projection and confirms it is zero under the model assumptions. revision: yes
-
Referee: [§4 / Simulation section] §4 (algorithm) and the simulation design: the EM procedure is said to maximize the profiled likelihood, yet no verification is given that the Lagrange multipliers or the M-step updates enforce the necessary orthogonality between the homogeneity parameter and the callback nuisance. If the simulation data-generating process uses the same semiparametric callback specification as the estimator, the reported power advantage may be partly an artifact of correct specification rather than robustness to the nonignorable mechanism.
Authors: We agree that the algorithm section would benefit from an explicit statement on how the EM updates preserve orthogonality. The Lagrange multipliers in the E-step and the closed-form M-step updates for the homogeneity parameters are constructed precisely so that the profiled score remains orthogonal to the nuisance tangent space at convergence; we will insert a short verification of this property in the revised Section 4. On the simulation design, the primary data-generating process matches the assumed semiparametric callback model to isolate the effect of correctly accounting for nonignorable nonresponse. To address potential concerns about specification, the supplementary material already contains additional simulation results under misspecified callback models (both parametric and nonparametric perturbations). We will move a concise summary of these robustness checks into the main simulation section and report the corresponding type-I error and power figures. revision: partial
Circularity Check
No circularity: Wilks limit follows from standard EL asymptotics on profiled semiparametric model
full rationale
The paper constructs an empirical likelihood ratio statistic for homogeneity under a density-ratio link and semiparametric callback response model, then invokes the classical Wilks theorem for the profiled EL ratio under the null. This is a direct application of established EL large-sample theory to the composite estimating equations after nuisance profiling; the limiting chi-square degrees of freedom are determined by the dimension of the homogeneity parameter alone once the callback and tilt parameters are concentrated out. No step reduces a claimed prediction to a fitted quantity by construction, no self-citation supplies the core limit theorem, and the EM algorithm is presented only as a computational device rather than as the source of the distributional result. The derivation chain is therefore self-contained against external EL theory and does not exhibit any of the enumerated circularity patterns.
Axiom & Free-Parameter Ledger
free parameters (1)
- parameters in semiparametric callback model
axioms (2)
- standard math Regularity conditions ensuring Wilks-type chi-square limit for the empirical likelihood ratio test
- domain assumption The density ratio model correctly links the two population distributions
Reference graph
Works this paper leans on
-
[1]
Convergence of the south and non-south income distributions, 1969-1979
Bishop JA, Formby JP, Thistle PD. Convergence of the south and non-south income distributions, 1969-1979. American Economic Review. 1992;82:262–272
work page 1969
-
[2]
Bricard D, Jusot F, Trannoy A, et al. Inequality of opportunities in health and death: an investigation from birth to middle age in great britain. International Journal of Epidemi- ology. 2020;49:1739–1748
work page 2020
-
[3]
Education, HIV, and early fertility: Experimental evidence from Kenya
Duflo E, Dupas P, Kremer M. Education, HIV, and early fertility: Experimental evidence from Kenya. American Economic Review. 2015;105:2757–2797
work page 2015
-
[4]
Borjas GJ, Kauppinen I, Poutvaara P. Self-selection of emigrants: Theory and evidence on stochastic dominance in observable and unobservable characteristics. The Economic Journal. 2019;129:143–171
work page 2019
-
[5]
On a test of whether one of two random variables is stochastically larger than the other
Mann HB, Whitney DR. On a test of whether one of two random variables is stochastically larger than the other. The Annals of Mathematical Statistics. 1947;18:50–60
work page 1947
-
[6]
The significance probability of the Smirnov two-sample test
Hodges JL. The significance probability of the Smirnov two-sample test. Arkiv f¨ or Matem- atik. 1958;3:469–486
work page 1958
-
[7]
Owen AB. Empirical Likelihood. New York: Chapman and Hall/CRC; 2001
work page 2001
-
[8]
Hypothesis testing in the presence of multiple samples under density ratio models
Cai S, Chen J, Zidek JV. Hypothesis testing in the presence of multiple samples under density ratio models. Statistica Sinica. 2017;27:761–783
work page 2017
-
[9]
Statistical analysis with missing data
Little RJA, Rubin DB. Statistical analysis with missing data. Hoboken: John Wiley & Sons; 2019
work page 2019
-
[10]
Item non-response on income and wealth questions
Riphahn RT, Serfling O. Item non-response on income and wealth questions. Empirical Economics. 2005;30:521–538
work page 2005
-
[11]
Survey nonresponse and the distribution of income
Korinek A, Mistiaen JA, Ravallion M. Survey nonresponse and the distribution of income. The Journal of Economic Inequality. 2006;4:33–55
work page 2006
-
[12]
An instrumental variable approach for identification and esti- mation with nonignorable nonresponse
Wang S, Shao J, Kim JK. An instrumental variable approach for identification and esti- mation with nonignorable nonresponse. Statistica Sinica. 2014;24:1097–1116
work page 2014
-
[13]
Semiparametric inverse propensity weighting for nonignorable missing data
Shao J, Wang L. Semiparametric inverse propensity weighting for nonignorable missing data. Biometrika. 2016;103:175–187
work page 2016
-
[14]
Wang L, Zhao P, Shao J. Dimension-reduced semiparametric estimation of distribution functions and quantiles with nonignorable nonresponse. Computational Statistics & Data Analysis. 2021;156:107142. 13
work page 2021
-
[15]
On varieties of doubly robust estimators under missing- ness not at random with a shadow variable
Miao W, Tchetgen Tchetgen EJ. On varieties of doubly robust estimators under missing- ness not at random with a shadow variable. Biometrika. 2016 Jun;103:475–482
work page 2016
-
[16]
Identifiability and estimation of two-sample data with nonignorable missing response
Wang L. Identifiability and estimation of two-sample data with nonignorable missing response. Communications in Statistics – Theory and Methods. 2022;51:7073–7087
work page 2022
-
[17]
Adjusting for nonresponse bias using logistic regression
Alho JM. Adjusting for nonresponse bias using logistic regression. Biometrika. 1990 Sep; 77:617–624
work page 1990
-
[18]
Qin J, Follmann DA. Semiparametric maximum likelihood inference by using failed con- tact attempts to adjust for nonignorable nonresponse. Biometrika. 2014;101:985–991
work page 2014
-
[19]
Propensity score adjustment with several follow-ups
Kim JK, Im J. Propensity score adjustment with several follow-ups. Biometrika. 2014; 101:439–448
work page 2014
-
[20]
Semiparametric maximum likelihood inference for nonignor- able nonresponse with callbacks
Guan Z, Leung DHY, Qin J. Semiparametric maximum likelihood inference for nonignor- able nonresponse with callbacks. Scandinavian Journal of Statistics. 2018;45:962–984
work page 2018
-
[21]
A stableness of resistance model for nonresponse adjust- ment with callback data
Miao W, Li X, Zhang P, et al. A stableness of resistance model for nonresponse adjust- ment with callback data. Journal of the Royal Statistical Society: Series B (Statistical Methodology). 2025;87:433–456
work page 2025
-
[22]
Semiparametric inference for inequality measures under nonignorable nonresponse using callback data
Wang X, Wang C, Yu T, et al. Semiparametric inference for inequality measures under nonignorable nonresponse using callback data. arXiv preprint arXiv:260110501. 2026
work page 2026
-
[23]
Multivariate logistic compounds
Anderson JA. Multivariate logistic compounds. Biometrika. 1979;66:17–26
work page 1979
-
[24]
Biased Sampling, Over-identified Parameter Problems and Beyond
Qin J. Biased Sampling, Over-identified Parameter Problems and Beyond. Singapore: Springer; 2017
work page 2017
-
[25]
A goodness-of-fit test for logistic regression models based on case-control data
Qin J, Zhang B. A goodness-of-fit test for logistic regression models based on case-control data. Biometrika. 1997;84:609–618
work page 1997
-
[26]
Qin J, Zhang H, Li P, et al. Using covariate-specific disease prevalence information to increase the power of case-control studies. Biometrika. 2015;102:169–180
work page 2015
-
[27]
Quantile and quantile-function estimations under density ratio model
Chen J, Liu Y. Quantile and quantile-function estimations under density ratio model. The Annals of Statistics. 2013;41:1669–1692
work page 2013
-
[28]
Testing homogeneity for multiple nonnegative distributions with excess zero observations
Wang C, Marriott P, Li P. Testing homogeneity for multiple nonnegative distributions with excess zero observations. Computational Statistics & Data Analysis. 2017 Oct; 114:146–157
work page 2017
-
[29]
Wang C, Marriott P, Li P. Semiparametric inference on the means of multiple nonnegative distributions with excess zero observations. Journal of Multivariate Analysis. 2018 Jul; 166:182–197
work page 2018
-
[30]
Using logistic regression procedures for estimating receiver operating characteristic curves
Qin J, Zhang B. Using logistic regression procedures for estimating receiver operating characteristic curves. Biometrika. 2003 09;90:585–596
work page 2003
-
[31]
Yuan M, Li P, Wu C. Semiparametric inference of the Youden index and the optimal cut- off point under density ratio models. The Canadian Journal of Statistics. 2021;49:965–986
work page 2021
-
[32]
Semiparametric inference for the dominance index under the density ratio model
Zhuang WW, Hu BY, Chen J. Semiparametric inference for the dominance index under the density ratio model. Biometrika. 2019 01;106:229–241
work page 2019
-
[33]
Maximum likelihood from incomplete data via the EM algorithm
Dempster AP, Laird NM, Rubin DB. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society: Series B (Methodological). 1977; 39:1–22. 14
work page 1977
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.