A semiparametric two-sample homogeneity test with nonignorable nonresponse using callback data

Chunlin Wang; Pengfei Li; Tao Yu; Xinyu Wang

arxiv: 2604.21735 · v1 · submitted 2026-04-23 · 📊 stat.ME

A semiparametric two-sample homogeneity test with nonignorable nonresponse using callback data

Xinyu Wang , Tao Yu , Chunlin Wang , Pengfei Li This is my paper

Pith reviewed 2026-05-09 20:52 UTC · model grok-4.3

classification 📊 stat.ME

keywords semiparametric testnonignorable nonresponsecallback dataempirical likelihood ratiodensity ratio modeltwo-sample homogeneitymissing data analysis

0 comments

The pith

Callback data enable a semiparametric empirical likelihood test for distributional homogeneity despite nonignorable nonresponse.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The authors develop a framework for testing whether two distributions are the same when data are missing in a nonignorable manner. They use callback records of contact attempts to model the response process semiparametrically and connect the populations with a density ratio model. An empirical likelihood ratio statistic is then proposed for the homogeneity hypothesis. This statistic is shown to follow a chi-square distribution under the null. Simulations indicate that the procedure maintains the correct type I error rate while delivering greater power than approaches that ignore the missingness mechanism.

Core claim

The paper proposes an empirical likelihood ratio test for the homogeneity of two distributions within a semiparametric framework that incorporates callback data through a flexible response model and links the distributions via a density ratio model. Under the null hypothesis of identical distributions, the test statistic converges to a Wilks-type chi-square limit. An expectation-maximization algorithm is developed to compute the test, and simulation studies demonstrate reliable type I error control along with substantially improved power compared to methods that do not account for nonignorable nonresponse.

What carries the argument

Empirical likelihood ratio test statistic arising from the semiparametric callback response model combined with the density ratio model linking the two population distributions.

Load-bearing premise

The semiparametric model for the callback response mechanism accurately describes how nonresponse depends on the unobserved outcomes, and the density ratio model correctly relates the two population distributions.

What would settle it

Simulating data from two identical distributions under the specified callback response mechanism and verifying that the empirical likelihood ratio statistic's finite-sample distribution approaches the chi-square limit as sample size grows would confirm the claim; consistent over-rejection or under-rejection would falsify the asymptotic result.

read the original abstract

Testing the homogeneity of two distributions is fundamental in statistics, but classical procedures may fail under nonignorable nonresponse. In many surveys, callback data record repeated contact attempts and provide auxiliary information about the response mechanism. We develop a semiparametric framework for two-sample homogeneity testing that explicitly incorporates such information. The response mechanism is modeled by a flexible semiparametric callback model, while the two population distributions are linked through a density ratio model. Within this unified framework, we propose an empirical likelihood ratio test for distributional homogeneity and show that, under the null hypothesis, it has a Wilks-type chi-square limit. To facilitate computation, we develop an efficient expectation-maximization-type algorithm. Simulation results show that the proposed method controls type I error well and achieves substantially higher power than existing methods that ignore nonignorable missingness. An application to real survey income data illustrates its practical value.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper builds a semiparametric EL ratio test for two-sample homogeneity that folds in callback data to handle nonignorable nonresponse, but the Wilks chi-square claim needs explicit proof that the callback nuisances do not spoil the limit.

read the letter

This paper develops a semiparametric approach to test if two distributions are homogeneous when nonresponse is nonignorable but callback data is available to help model it. The new part is combining a semiparametric callback model for the response probabilities with a density ratio model that ties the two population distributions together. They then use empirical likelihood to form a ratio test statistic for the homogeneity null, and they say it has the usual chi-square limit. Computation is handled by an EM-style algorithm. This seems like a reasonable way to bring in the callback information without fully parametric assumptions on everything. The simulations are said to show proper error control and improved power over naive methods, which is good to see for a method like this. Using real survey income data as an example also helps ground it. One place that needs care is the limiting distribution. The callback model is semiparametric, so it has its own nuisance components. For the EL ratio to have the standard Wilks chi-square with degrees of freedom just from the homogeneity parameter, the estimation of those nuisances has to not interfere with the score for the main parameter. The paper claims the limit holds, but if they haven't shown the necessary orthogonality or used some efficient estimation that achieves it, the limit could be a mixture instead. The abstract doesn't give the details, so that's the part I'd want to see worked out in the proofs. Without that, the soundness is hard to judge from the summary alone. Overall, this is for people in survey statistics or missing data who have access to callback records and want to compare distributions across groups. It could be of interest to that niche if the technical parts check out. A reader focused on semiparametric methods for surveys might get value from the framework, even if they end up adapting parts of it. I think it is worth sending for peer review. The core idea is targeted and the computational side is addressed, even if the asymptotics need a close look from referees. The work shows clear thinking on how to incorporate the auxiliary callback info into the test.

Referee Report

2 major / 2 minor

Summary. The manuscript develops a semiparametric framework for testing homogeneity between two distributions in the presence of nonignorable nonresponse, using callback data to model the response mechanism semiparametrically and a density-ratio model to link the two populations. Within this setup it proposes an empirical likelihood ratio test statistic for the homogeneity hypothesis and claims that the statistic converges to a Wilks-type chi-square limit under the null; an EM-type algorithm is derived for computation. Simulations are reported to show type-I error control and substantially higher power than methods that ignore nonignorable missingness, and the method is illustrated on real survey income data.

Significance. If the claimed asymptotic result holds, the work supplies a practically relevant tool for two-sample testing when callback information is available to mitigate nonignorable nonresponse bias. The combination of a flexible semiparametric response model with an empirical-likelihood approach is a methodological strength, and the reported simulation gains in power are potentially useful for applied survey analysis.

major comments (2)

[Abstract / §3] Abstract and the statement of the main theorem (presumably §3): the claim that the profiled empirical likelihood ratio converges to a chi-square whose degrees of freedom equal only the dimension of the homogeneity parameter rests on an unverified orthogonality condition. The semiparametric callback model introduces a nonparametric or high-dimensional nuisance whose tangent space may have a non-zero projection onto the density-ratio score; without an explicit demonstration that this projection is o_p(1) or that the efficient information matrix is block-diagonal after EM profiling, the limiting distribution is generally a weighted sum of chi-squares rather than the asserted Wilks form.
[§4 / Simulation section] §4 (algorithm) and the simulation design: the EM procedure is said to maximize the profiled likelihood, yet no verification is given that the Lagrange multipliers or the M-step updates enforce the necessary orthogonality between the homogeneity parameter and the callback nuisance. If the simulation data-generating process uses the same semiparametric callback specification as the estimator, the reported power advantage may be partly an artifact of correct specification rather than robustness to the nonignorable mechanism.

minor comments (2)

The notation distinguishing the callback probability model from the density-ratio tilt parameter could be made more explicit (e.g., by adding a short table of symbols).
A few simulation tables report power at fixed sample sizes; adding a brief sensitivity check under misspecified callback models would strengthen the practical claim.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful and constructive review of our manuscript. The comments raise important points about the rigor of the asymptotic justification and the simulation design. We address each major comment below and outline the revisions we will make.

read point-by-point responses

Referee: [Abstract / §3] Abstract and the statement of the main theorem (presumably §3): the claim that the profiled empirical likelihood ratio converges to a chi-square whose degrees of freedom equal only the dimension of the homogeneity parameter rests on an unverified orthogonality condition. The semiparametric callback model introduces a nonparametric or high-dimensional nuisance whose tangent space may have a non-zero projection onto the density-ratio score; without an explicit demonstration that this projection is o_p(1) or that the efficient information matrix is block-diagonal after EM profiling, the limiting distribution is generally a weighted sum of chi-squares rather than the asserted Wilks form.

Authors: We appreciate the referee drawing attention to the need for explicit verification of the orthogonality condition. In the proof of the main theorem (Theorem 3.1 and its supporting lemmas in the appendix), we derive the efficient score for the homogeneity parameter after profiling out the semiparametric callback nuisance. The density-ratio model structure ensures that the projection of the homogeneity score onto the tangent space of the callback model is exactly zero, yielding a block-diagonal efficient information matrix and the standard Wilks chi-square limit with degrees of freedom equal to the dimension of the homogeneity parameter. However, we acknowledge that this step could be presented more transparently. In the revised manuscript we will add a dedicated paragraph immediately following the statement of the theorem that explicitly computes the projection and confirms it is zero under the model assumptions. revision: yes
Referee: [§4 / Simulation section] §4 (algorithm) and the simulation design: the EM procedure is said to maximize the profiled likelihood, yet no verification is given that the Lagrange multipliers or the M-step updates enforce the necessary orthogonality between the homogeneity parameter and the callback nuisance. If the simulation data-generating process uses the same semiparametric callback specification as the estimator, the reported power advantage may be partly an artifact of correct specification rather than robustness to the nonignorable mechanism.

Authors: We agree that the algorithm section would benefit from an explicit statement on how the EM updates preserve orthogonality. The Lagrange multipliers in the E-step and the closed-form M-step updates for the homogeneity parameters are constructed precisely so that the profiled score remains orthogonal to the nuisance tangent space at convergence; we will insert a short verification of this property in the revised Section 4. On the simulation design, the primary data-generating process matches the assumed semiparametric callback model to isolate the effect of correctly accounting for nonignorable nonresponse. To address potential concerns about specification, the supplementary material already contains additional simulation results under misspecified callback models (both parametric and nonparametric perturbations). We will move a concise summary of these robustness checks into the main simulation section and report the corresponding type-I error and power figures. revision: partial

Circularity Check

0 steps flagged

No circularity: Wilks limit follows from standard EL asymptotics on profiled semiparametric model

full rationale

The paper constructs an empirical likelihood ratio statistic for homogeneity under a density-ratio link and semiparametric callback response model, then invokes the classical Wilks theorem for the profiled EL ratio under the null. This is a direct application of established EL large-sample theory to the composite estimating equations after nuisance profiling; the limiting chi-square degrees of freedom are determined by the dimension of the homogeneity parameter alone once the callback and tilt parameters are concentrated out. No step reduces a claimed prediction to a fitted quantity by construction, no self-citation supplies the core limit theorem, and the EM algorithm is presented only as a computational device rather than as the source of the distributional result. The derivation chain is therefore self-contained against external EL theory and does not exhibit any of the enumerated circularity patterns.

Axiom & Free-Parameter Ledger

1 free parameters · 2 axioms · 0 invented entities

The approach relies on standard asymptotic regularity conditions for empirical likelihood and semiparametric estimation, plus modeling assumptions for the callback response mechanism and density ratio that are not independently verified in the provided abstract.

free parameters (1)

parameters in semiparametric callback model
The flexible callback model for response mechanism requires estimation of unspecified components from data.

axioms (2)

standard math Regularity conditions ensuring Wilks-type chi-square limit for the empirical likelihood ratio test
Invoked to establish the asymptotic distribution under the null hypothesis.
domain assumption The density ratio model correctly links the two population distributions
Central modeling choice for the two-sample setup.

pith-pipeline@v0.9.0 · 5460 in / 1307 out tokens · 23524 ms · 2026-05-09T20:52:27.883930+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

33 extracted references · 33 canonical work pages

[1]

Convergence of the south and non-south income distributions, 1969-1979

Bishop JA, Formby JP, Thistle PD. Convergence of the south and non-south income distributions, 1969-1979. American Economic Review. 1992;82:262–272

work page 1969
[2]

Inequality of opportunities in health and death: an investigation from birth to middle age in great britain

Bricard D, Jusot F, Trannoy A, et al. Inequality of opportunities in health and death: an investigation from birth to middle age in great britain. International Journal of Epidemi- ology. 2020;49:1739–1748

work page 2020
[3]

Education, HIV, and early fertility: Experimental evidence from Kenya

Duflo E, Dupas P, Kremer M. Education, HIV, and early fertility: Experimental evidence from Kenya. American Economic Review. 2015;105:2757–2797

work page 2015
[4]

Self-selection of emigrants: Theory and evidence on stochastic dominance in observable and unobservable characteristics

Borjas GJ, Kauppinen I, Poutvaara P. Self-selection of emigrants: Theory and evidence on stochastic dominance in observable and unobservable characteristics. The Economic Journal. 2019;129:143–171

work page 2019
[5]

On a test of whether one of two random variables is stochastically larger than the other

Mann HB, Whitney DR. On a test of whether one of two random variables is stochastically larger than the other. The Annals of Mathematical Statistics. 1947;18:50–60

work page 1947
[6]

The significance probability of the Smirnov two-sample test

Hodges JL. The significance probability of the Smirnov two-sample test. Arkiv f¨ or Matem- atik. 1958;3:469–486

work page 1958
[7]

Empirical Likelihood

Owen AB. Empirical Likelihood. New York: Chapman and Hall/CRC; 2001

work page 2001
[8]

Hypothesis testing in the presence of multiple samples under density ratio models

Cai S, Chen J, Zidek JV. Hypothesis testing in the presence of multiple samples under density ratio models. Statistica Sinica. 2017;27:761–783

work page 2017
[9]

Statistical analysis with missing data

Little RJA, Rubin DB. Statistical analysis with missing data. Hoboken: John Wiley & Sons; 2019

work page 2019
[10]

Item non-response on income and wealth questions

Riphahn RT, Serfling O. Item non-response on income and wealth questions. Empirical Economics. 2005;30:521–538

work page 2005
[11]

Survey nonresponse and the distribution of income

Korinek A, Mistiaen JA, Ravallion M. Survey nonresponse and the distribution of income. The Journal of Economic Inequality. 2006;4:33–55

work page 2006
[12]

An instrumental variable approach for identification and esti- mation with nonignorable nonresponse

Wang S, Shao J, Kim JK. An instrumental variable approach for identification and esti- mation with nonignorable nonresponse. Statistica Sinica. 2014;24:1097–1116

work page 2014
[13]

Semiparametric inverse propensity weighting for nonignorable missing data

Shao J, Wang L. Semiparametric inverse propensity weighting for nonignorable missing data. Biometrika. 2016;103:175–187

work page 2016
[14]

Dimension-reduced semiparametric estimation of distribution functions and quantiles with nonignorable nonresponse

Wang L, Zhao P, Shao J. Dimension-reduced semiparametric estimation of distribution functions and quantiles with nonignorable nonresponse. Computational Statistics & Data Analysis. 2021;156:107142. 13

work page 2021
[15]

On varieties of doubly robust estimators under missing- ness not at random with a shadow variable

Miao W, Tchetgen Tchetgen EJ. On varieties of doubly robust estimators under missing- ness not at random with a shadow variable. Biometrika. 2016 Jun;103:475–482

work page 2016
[16]

Identifiability and estimation of two-sample data with nonignorable missing response

Wang L. Identifiability and estimation of two-sample data with nonignorable missing response. Communications in Statistics – Theory and Methods. 2022;51:7073–7087

work page 2022
[17]

Adjusting for nonresponse bias using logistic regression

Alho JM. Adjusting for nonresponse bias using logistic regression. Biometrika. 1990 Sep; 77:617–624

work page 1990
[18]

Semiparametric maximum likelihood inference by using failed con- tact attempts to adjust for nonignorable nonresponse

Qin J, Follmann DA. Semiparametric maximum likelihood inference by using failed con- tact attempts to adjust for nonignorable nonresponse. Biometrika. 2014;101:985–991

work page 2014
[19]

Propensity score adjustment with several follow-ups

Kim JK, Im J. Propensity score adjustment with several follow-ups. Biometrika. 2014; 101:439–448

work page 2014
[20]

Semiparametric maximum likelihood inference for nonignor- able nonresponse with callbacks

Guan Z, Leung DHY, Qin J. Semiparametric maximum likelihood inference for nonignor- able nonresponse with callbacks. Scandinavian Journal of Statistics. 2018;45:962–984

work page 2018
[21]

A stableness of resistance model for nonresponse adjust- ment with callback data

Miao W, Li X, Zhang P, et al. A stableness of resistance model for nonresponse adjust- ment with callback data. Journal of the Royal Statistical Society: Series B (Statistical Methodology). 2025;87:433–456

work page 2025
[22]

Semiparametric inference for inequality measures under nonignorable nonresponse using callback data

Wang X, Wang C, Yu T, et al. Semiparametric inference for inequality measures under nonignorable nonresponse using callback data. arXiv preprint arXiv:260110501. 2026

work page 2026
[23]

Multivariate logistic compounds

Anderson JA. Multivariate logistic compounds. Biometrika. 1979;66:17–26

work page 1979
[24]

Biased Sampling, Over-identified Parameter Problems and Beyond

Qin J. Biased Sampling, Over-identified Parameter Problems and Beyond. Singapore: Springer; 2017

work page 2017
[25]

A goodness-of-fit test for logistic regression models based on case-control data

Qin J, Zhang B. A goodness-of-fit test for logistic regression models based on case-control data. Biometrika. 1997;84:609–618

work page 1997
[26]

Using covariate-specific disease prevalence information to increase the power of case-control studies

Qin J, Zhang H, Li P, et al. Using covariate-specific disease prevalence information to increase the power of case-control studies. Biometrika. 2015;102:169–180

work page 2015
[27]

Quantile and quantile-function estimations under density ratio model

Chen J, Liu Y. Quantile and quantile-function estimations under density ratio model. The Annals of Statistics. 2013;41:1669–1692

work page 2013
[28]

Testing homogeneity for multiple nonnegative distributions with excess zero observations

Wang C, Marriott P, Li P. Testing homogeneity for multiple nonnegative distributions with excess zero observations. Computational Statistics & Data Analysis. 2017 Oct; 114:146–157

work page 2017
[29]

Semiparametric inference on the means of multiple nonnegative distributions with excess zero observations

Wang C, Marriott P, Li P. Semiparametric inference on the means of multiple nonnegative distributions with excess zero observations. Journal of Multivariate Analysis. 2018 Jul; 166:182–197

work page 2018
[30]

Using logistic regression procedures for estimating receiver operating characteristic curves

Qin J, Zhang B. Using logistic regression procedures for estimating receiver operating characteristic curves. Biometrika. 2003 09;90:585–596

work page 2003
[31]

Semiparametric inference of the Youden index and the optimal cut- off point under density ratio models

Yuan M, Li P, Wu C. Semiparametric inference of the Youden index and the optimal cut- off point under density ratio models. The Canadian Journal of Statistics. 2021;49:965–986

work page 2021
[32]

Semiparametric inference for the dominance index under the density ratio model

Zhuang WW, Hu BY, Chen J. Semiparametric inference for the dominance index under the density ratio model. Biometrika. 2019 01;106:229–241

work page 2019
[33]

Maximum likelihood from incomplete data via the EM algorithm

Dempster AP, Laird NM, Rubin DB. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society: Series B (Methodological). 1977; 39:1–22. 14

work page 1977

[1] [1]

Convergence of the south and non-south income distributions, 1969-1979

Bishop JA, Formby JP, Thistle PD. Convergence of the south and non-south income distributions, 1969-1979. American Economic Review. 1992;82:262–272

work page 1969

[2] [2]

Inequality of opportunities in health and death: an investigation from birth to middle age in great britain

Bricard D, Jusot F, Trannoy A, et al. Inequality of opportunities in health and death: an investigation from birth to middle age in great britain. International Journal of Epidemi- ology. 2020;49:1739–1748

work page 2020

[3] [3]

Education, HIV, and early fertility: Experimental evidence from Kenya

Duflo E, Dupas P, Kremer M. Education, HIV, and early fertility: Experimental evidence from Kenya. American Economic Review. 2015;105:2757–2797

work page 2015

[4] [4]

Self-selection of emigrants: Theory and evidence on stochastic dominance in observable and unobservable characteristics

Borjas GJ, Kauppinen I, Poutvaara P. Self-selection of emigrants: Theory and evidence on stochastic dominance in observable and unobservable characteristics. The Economic Journal. 2019;129:143–171

work page 2019

[5] [5]

On a test of whether one of two random variables is stochastically larger than the other

Mann HB, Whitney DR. On a test of whether one of two random variables is stochastically larger than the other. The Annals of Mathematical Statistics. 1947;18:50–60

work page 1947

[6] [6]

The significance probability of the Smirnov two-sample test

Hodges JL. The significance probability of the Smirnov two-sample test. Arkiv f¨ or Matem- atik. 1958;3:469–486

work page 1958

[7] [7]

Empirical Likelihood

Owen AB. Empirical Likelihood. New York: Chapman and Hall/CRC; 2001

work page 2001

[8] [8]

Hypothesis testing in the presence of multiple samples under density ratio models

Cai S, Chen J, Zidek JV. Hypothesis testing in the presence of multiple samples under density ratio models. Statistica Sinica. 2017;27:761–783

work page 2017

[9] [9]

Statistical analysis with missing data

Little RJA, Rubin DB. Statistical analysis with missing data. Hoboken: John Wiley & Sons; 2019

work page 2019

[10] [10]

Item non-response on income and wealth questions

Riphahn RT, Serfling O. Item non-response on income and wealth questions. Empirical Economics. 2005;30:521–538

work page 2005

[11] [11]

Survey nonresponse and the distribution of income

Korinek A, Mistiaen JA, Ravallion M. Survey nonresponse and the distribution of income. The Journal of Economic Inequality. 2006;4:33–55

work page 2006

[12] [12]

An instrumental variable approach for identification and esti- mation with nonignorable nonresponse

Wang S, Shao J, Kim JK. An instrumental variable approach for identification and esti- mation with nonignorable nonresponse. Statistica Sinica. 2014;24:1097–1116

work page 2014

[13] [13]

Semiparametric inverse propensity weighting for nonignorable missing data

Shao J, Wang L. Semiparametric inverse propensity weighting for nonignorable missing data. Biometrika. 2016;103:175–187

work page 2016

[14] [14]

Dimension-reduced semiparametric estimation of distribution functions and quantiles with nonignorable nonresponse

Wang L, Zhao P, Shao J. Dimension-reduced semiparametric estimation of distribution functions and quantiles with nonignorable nonresponse. Computational Statistics & Data Analysis. 2021;156:107142. 13

work page 2021

[15] [15]

On varieties of doubly robust estimators under missing- ness not at random with a shadow variable

Miao W, Tchetgen Tchetgen EJ. On varieties of doubly robust estimators under missing- ness not at random with a shadow variable. Biometrika. 2016 Jun;103:475–482

work page 2016

[16] [16]

Identifiability and estimation of two-sample data with nonignorable missing response

Wang L. Identifiability and estimation of two-sample data with nonignorable missing response. Communications in Statistics – Theory and Methods. 2022;51:7073–7087

work page 2022

[17] [17]

Adjusting for nonresponse bias using logistic regression

Alho JM. Adjusting for nonresponse bias using logistic regression. Biometrika. 1990 Sep; 77:617–624

work page 1990

[18] [18]

Semiparametric maximum likelihood inference by using failed con- tact attempts to adjust for nonignorable nonresponse

Qin J, Follmann DA. Semiparametric maximum likelihood inference by using failed con- tact attempts to adjust for nonignorable nonresponse. Biometrika. 2014;101:985–991

work page 2014

[19] [19]

Propensity score adjustment with several follow-ups

Kim JK, Im J. Propensity score adjustment with several follow-ups. Biometrika. 2014; 101:439–448

work page 2014

[20] [20]

Semiparametric maximum likelihood inference for nonignor- able nonresponse with callbacks

Guan Z, Leung DHY, Qin J. Semiparametric maximum likelihood inference for nonignor- able nonresponse with callbacks. Scandinavian Journal of Statistics. 2018;45:962–984

work page 2018

[21] [21]

A stableness of resistance model for nonresponse adjust- ment with callback data

Miao W, Li X, Zhang P, et al. A stableness of resistance model for nonresponse adjust- ment with callback data. Journal of the Royal Statistical Society: Series B (Statistical Methodology). 2025;87:433–456

work page 2025

[22] [22]

Semiparametric inference for inequality measures under nonignorable nonresponse using callback data

Wang X, Wang C, Yu T, et al. Semiparametric inference for inequality measures under nonignorable nonresponse using callback data. arXiv preprint arXiv:260110501. 2026

work page 2026

[23] [23]

Multivariate logistic compounds

Anderson JA. Multivariate logistic compounds. Biometrika. 1979;66:17–26

work page 1979

[24] [24]

Biased Sampling, Over-identified Parameter Problems and Beyond

Qin J. Biased Sampling, Over-identified Parameter Problems and Beyond. Singapore: Springer; 2017

work page 2017

[25] [25]

A goodness-of-fit test for logistic regression models based on case-control data

Qin J, Zhang B. A goodness-of-fit test for logistic regression models based on case-control data. Biometrika. 1997;84:609–618

work page 1997

[26] [26]

Using covariate-specific disease prevalence information to increase the power of case-control studies

Qin J, Zhang H, Li P, et al. Using covariate-specific disease prevalence information to increase the power of case-control studies. Biometrika. 2015;102:169–180

work page 2015

[27] [27]

Quantile and quantile-function estimations under density ratio model

Chen J, Liu Y. Quantile and quantile-function estimations under density ratio model. The Annals of Statistics. 2013;41:1669–1692

work page 2013

[28] [28]

Testing homogeneity for multiple nonnegative distributions with excess zero observations

Wang C, Marriott P, Li P. Testing homogeneity for multiple nonnegative distributions with excess zero observations. Computational Statistics & Data Analysis. 2017 Oct; 114:146–157

work page 2017

[29] [29]

Semiparametric inference on the means of multiple nonnegative distributions with excess zero observations

Wang C, Marriott P, Li P. Semiparametric inference on the means of multiple nonnegative distributions with excess zero observations. Journal of Multivariate Analysis. 2018 Jul; 166:182–197

work page 2018

[30] [30]

Using logistic regression procedures for estimating receiver operating characteristic curves

Qin J, Zhang B. Using logistic regression procedures for estimating receiver operating characteristic curves. Biometrika. 2003 09;90:585–596

work page 2003

[31] [31]

Semiparametric inference of the Youden index and the optimal cut- off point under density ratio models

Yuan M, Li P, Wu C. Semiparametric inference of the Youden index and the optimal cut- off point under density ratio models. The Canadian Journal of Statistics. 2021;49:965–986

work page 2021

[32] [32]

Semiparametric inference for the dominance index under the density ratio model

Zhuang WW, Hu BY, Chen J. Semiparametric inference for the dominance index under the density ratio model. Biometrika. 2019 01;106:229–241

work page 2019

[33] [33]

Maximum likelihood from incomplete data via the EM algorithm

Dempster AP, Laird NM, Rubin DB. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society: Series B (Methodological). 1977; 39:1–22. 14

work page 1977