Generalized raking and stabilized weights for regression modeling in two-phase samples

Bryan E. Shepherd; Gustavo Amorim; Joshua Slone; Pamela A. Shaw; Thomas Lumley; Tong Chen

arxiv: 2605.15802 · v1 · pith:KHEVH3ZMnew · submitted 2026-05-15 · 📊 stat.ME

Generalized raking and stabilized weights for regression modeling in two-phase samples

Tong Chen , Joshua Slone , Gustavo Amorim , Pamela A. Shaw , Bryan E. Shepherd , Thomas Lumley This is my paper

Pith reviewed 2026-05-20 16:26 UTC · model grok-4.3

classification 📊 stat.ME

keywords stabilized weightsgeneralized rakingtwo-phase samplingsurvey regressionweight variationefficiencyauxiliary variablescomplex surveys

0 comments

The pith

Combining stabilized weights with generalized raking reduces variance in regression estimates from two-phase survey samples.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper aims to show that in regression analysis of data from two-phase sampling designs, incorporating stabilized weights into generalized raking estimators leads to more efficient estimates by minimizing unnecessary variation in the sampling weights. A sympathetic reader would care because complex survey data often suffers from inflated variances due to weight variation, and this method promises better precision without new software needs. They demonstrate the approach through simulations and apply it to a real study of Kaposi sarcoma in HIV patients, showing gains in realistic settings though limited in highly informative designs.

Core claim

The authors propose and evaluate a combination of optimal stabilized weights and generalized raking for regression modeling in two-phase samples. This estimator reduces non-essential weight variation explained by covariates and leverages auxiliary information to improve efficiency, while being implementable using standard statistical packages for two-phase sampling and generalized raking.

What carries the argument

The stabilized weight estimator combined with generalized raking, which adjusts sampling weights to account for covariate-explained variation and uses auxiliary variables for calibration.

Load-bearing premise

That covariates can explain non-essential variation in the sampling weights without introducing bias into the regression estimates.

What would settle it

Observing no reduction or an increase in the variance of the regression coefficient estimates when using the combined stabilized and raking estimator compared to generalized raking alone in the simulation studies would falsify the efficiency improvement.

read the original abstract

In regression models fitted to data from complex survey designs, sampling weights often incorporate non-essential variation, inflating variance estimates. Stabilized weights mitigate this issue by adjusting sampling weights to account for variation explained by covariates. In the context of two-phase sampling, we evaluate the performance of optimal stabilized weights and propose combining the stabilized weight estimator with generalized raking, a class of efficient design-based estimators. This combination improves efficiency by reducing unnecessary weight variation and leveraging information from auxiliary variables. We show this combination can be implemented using the standard statistical package that handles two-phase samples and generalized raking. Simulation studies demonstrate that the proposed estimator enhances precision under realistic two-phase designs, though efficiency gains may be limited in highly informative designs. The developed methods were applied to a large multinational two-phase study of Kaposi sarcoma among people living with HIV.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This paper shows a workable way to pair stabilized weights with generalized raking for lower variance in two-phase regression models, and it runs in the usual software.

read the letter

The main point is that stabilized weights cut unnecessary variation from covariates and generalized raking brings in auxiliary information, and the two together give better precision for regression under two-phase sampling. They demonstrate that the combination fits into standard packages without extra coding, which matters for people who already use those tools for survey data. Simulations under realistic two-phase setups show precision gains, and they apply the method to a large multinational study of Kaposi sarcoma in HIV patients. That real-data example helps show the approach is not just theoretical. The paper treats the efficiency claim as modest in highly informative designs, which keeps the claims grounded. On the implementation side, the abstract states they verified direct use of existing raking routines with pre-computed stabilized weights as the base, so the stress-test concern about losing design consistency does not appear to hold once the steps are laid out. The main limitation is that the abstract gives little detail on simulation design, sample sizes, or exact variance reductions, so readers will want the full tables to judge how large the gains are across different scenarios. The citation pattern looks standard for survey methods work and does not rely on circular arguments. This is aimed at survey statisticians and epidemiologists who fit regressions to two-phase data and want to tighten their variance estimates without changing the overall design-based framework. Readers who need a concrete, implementable refinement rather than a new theory will find it useful. It has enough new application, code-level detail, and empirical checks to deserve a serious referee rather than a desk reject.

Referee Report

2 major / 1 minor

Summary. The manuscript proposes combining stabilized weights with generalized raking for regression modeling in two-phase sampling designs. It claims that stabilized weights reduce non-essential variation in sampling weights by accounting for covariates, and that pairing them with generalized raking improves efficiency while remaining directly implementable in standard statistical packages for two-phase samples and calibration. Support comes from simulation studies under realistic two-phase designs (with noted limits in highly informative cases) and an application to a multinational study of Kaposi sarcoma among people living with HIV.

Significance. If the implementation claim holds without compromising design consistency, the work offers a practical extension of existing survey methods that could improve precision in regression estimates for two-phase data by leveraging auxiliary variables and reducing weight variability. The simulation results and real-data example provide concrete evidence of gains, though the magnitude depends on design features; this could be useful for practitioners in survey statistics if the package-level details are clarified.

major comments (2)

Abstract and implementation description: the central claim that the stabilized-weight estimator can be combined with generalized raking via direct use of standard two-phase packages (without custom modification) is load-bearing for the efficiency and implementability assertions, yet the precise mapping—whether stabilized weights replace first-phase weights before the raking calibration or are applied post-raking—is not explicitly verified, raising the risk that design-based unbiasedness or variance properties may not be preserved as assumed.
Simulation studies section: details on the simulation design (e.g., sampling fractions, covariate distributions, exclusion rules for extreme weights) and error quantification (e.g., Monte Carlo standard errors for reported precision gains) are insufficient to fully substantiate the efficiency improvements, leaving the support for the cross-design claims moderate.

minor comments (1)

Notation for stabilized weights and raking constraints could be clarified with an explicit equation linking the pre-computed weights to the calibration step.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive comments, which have helped us improve the clarity and rigor of the manuscript. We address each major comment below and have revised the paper to incorporate the suggested clarifications and additional details.

read point-by-point responses

Referee: Abstract and implementation description: the central claim that the stabilized-weight estimator can be combined with generalized raking via direct use of standard two-phase packages (without custom modification) is load-bearing for the efficiency and implementability assertions, yet the precise mapping—whether stabilized weights replace first-phase weights before the raking calibration or are applied post-raking—is not explicitly verified, raising the risk that design-based unbiasedness or variance properties may not be preserved as assumed.

Authors: We appreciate the referee drawing attention to the need for explicit verification of the implementation mapping. The stabilized weights are computed from the phase-1 sampling weights and covariates and then substituted directly for the original phase-1 weights as the starting point for the generalized raking calibration step. This ordering preserves the design-based unbiasedness of the raking estimator because the calibration constraints are still satisfied with respect to the auxiliary totals. In the revised manuscript we have added a dedicated paragraph in the Methods section and a worked numerical example in the software subsection that shows the exact sequence of calls to standard two-phase raking routines, confirming that no custom modification is required. revision: yes
Referee: Simulation studies section: details on the simulation design (e.g., sampling fractions, covariate distributions, exclusion rules for extreme weights) and error quantification (e.g., Monte Carlo standard errors for reported precision gains) are insufficient to fully substantiate the efficiency improvements, leaving the support for the cross-design claims moderate.

Authors: We agree that greater transparency in the simulation protocol is warranted. The revised simulation section now specifies the phase-2 sampling fractions (20 % and 10 % under two scenarios), the exact covariate distributions (standard normal for continuous variables and Bernoulli(0.3) for binary variables), and the weight-truncation rule (values outside the 5th–95th percentiles are replaced by the corresponding percentile). We have also added Monte Carlo standard errors for all reported relative-efficiency figures, computed from 5,000 replications, so that readers can assess the precision of the observed gains. These expansions directly address the concern and strengthen the empirical support for the cross-design claims. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper proposes combining stabilized weights with generalized raking for two-phase sampling designs, with efficiency gains shown via simulation studies and a real-data application rather than any derivation that reduces by construction to fitted inputs or self-referential definitions. The implementation claim is presented as direct use of existing packages, which is externally verifiable and does not rely on load-bearing self-citations, uniqueness theorems imported from the authors' prior work, or ansatzes smuggled via citation. The central result is an empirical and methodological combination whose performance is tested against external benchmarks, making the derivation self-contained.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on standard survey sampling assumptions such as known inclusion probabilities and availability of auxiliary variables for stabilization and raking. No new free parameters, invented entities, or ad-hoc axioms are introduced based on the abstract.

axioms (1)

domain assumption Sampling probabilities are known and correctly specified; auxiliary variables are available and related to weight variation.
Implicit foundation for all weighting and raking methods in complex surveys.

pith-pipeline@v0.9.0 · 5680 in / 1140 out tokens · 85855 ms · 2026-05-20T16:26:28.105708+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

stabilised weights ... q(Xi, Zi)∝ E(e²i |Xi,Zi) / E(di e²i |Xi,Zi)
IndisputableMonolith/Foundation/AbsoluteFloorClosure.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

two-phase sampling ... generalised raking ... calibration constraints

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

41 extracted references · 41 canonical work pages

[1]

Newey , journal =

Whitney K. Newey , journal =. The Asymptotic Variance of Semiparametric Estimators , urldate =

work page
[2]

1998 , journal =

Improving survey‐weighted least squares regression , author =. 1998 , journal =

work page 1998
[3]

Sankhya: The Indian Journal of Statistics, Series B , volume=

Parametric and semi-parametric estimation of regression models fitted to survey data , author=. Sankhya: The Indian Journal of Statistics, Series B , volume=

work page
[4]

Epidemiology , volume=

Marginal Structural Models and Causal Inference in Epidemiology , author=. Epidemiology , volume=. 2000 , publisher=

work page 2000
[5]

Fitting Regression Models to Survey Data , urldate =

Thomas Lumley and Alastair Scott , journal =. Fitting Regression Models to Survey Data , urldate =

work page
[6]

Weighting in the regression analysis of survey data with a cross-national application , urldate =

Chris Skinner and Ben Mason , journal =. Weighting in the regression analysis of survey data with a cross-national application , urldate =

work page
[7]

Journal of the Royal Statistical Society Series A: Statistics in Society , volume =

Chambers, Ray and Ranjbar, Setareh and Salvati, Nicola and Pacini, Barbara , title = ". Journal of the Royal Statistical Society Series A: Statistics in Society , volume =

work page
[8]

International Statistical Review , volume=

Connections between survey calibration estimators and semiparametric models for incomplete data , author=. International Statistical Review , volume=. 2011 , publisher=

work page 2011
[9]

Journal of the American Statistical Association , volume=

Estimation of regression coefficients when some regressors are not always observed , author=. Journal of the American Statistical Association , volume=. 1994 , publisher=

work page 1994
[10]

Journal of the American Statistical Association , volume=

Calibration estimators in survey sampling , author=. Journal of the American Statistical Association , volume=. 1992 , publisher=

work page 1992
[11]

Data validation in multinational observational studies with error-prone data: applying an optimal validation sampling strategy in a study of

Amorim, Gustavo and Slone, Joshua and Semeere, Aggrey and Diero, Lameck and Otero, Larissa and Crabtree-Ramirez, Brenda and Tao, Ran and Duda, Stephany N and Musick, Beverly and Yiannoutsos, Constantin and Lumley, Thomas and Shaw, Pamela A and Shepherd, Bryan E , journal=. Data validation in multinational observational studies with error-prone data: apply...

work page
[12]

Analysis approaches to combine error-prone data with a subset of validated data: an application to a multinational study of

Slone, Joshua and Amorim, Gustavo and Semeere, Aggrey and Diero, Lameck and Otero, Larissa and Crabtree-Ramirez, Brenda and Tao, Ran and Duda, Stephany N and Musick, Beverly and Yiannoutsos, Constantin and Lumley, Thomas and Shaw, Pamela A and Shepherd, Bryan E , journal=. Analysis approaches to combine error-prone data with a subset of validated data: an...

work page
[13]

American Journal of Epidemiology , volume=

Using the whole cohort in the analysis of case-cohort data , author=. American Journal of Epidemiology , volume=. 2009 , publisher=

work page 2009
[14]

Statistics in Biosciences , volume=

Using the whole cohort in the analysis of case-control data , author=. Statistics in Biosciences , volume=. 2013 , publisher=

work page 2013
[15]

and Han, Kyunghee and Chen, Tong and Bian, Aihua and Pugh, Shannon and Duda, Stephany N

Shepherd, Bryan E. and Han, Kyunghee and Chen, Tong and Bian, Aihua and Pugh, Shannon and Duda, Stephany N. and Lumley, Thomas and Heerman, William J. and Shaw, Pamela A. , title = ". Biometrics , volume =

work page
[16]

Case-control studies with complex sampling , volume =

Alastair Scott and Chris Wild , journal =. Case-control studies with complex sampling , volume =

work page
[17]

Roderick J. A. Little , journal =. Models for nonresponse in sample surveys , volume =

work page
[18]

Statistical Science , number =

Andrew Gelman , title =. Statistical Science , number =

work page
[19]

Neyman , title =

J. Neyman , title =. Journal of the American Statistical Association , volume =

work page
[20]

D. G. Horvitz and D. J. Thompson , title =. Journal of the American Statistical Association , volume =

work page
[21]

Generalized raking procedures in survey sampling , volume =

Jean-Claude Deville and Carl-Erik Särndal and Olivier Sautory , journal =. Generalized raking procedures in survey sampling , volume =

work page
[22]

and Wild, C

Scott, A. and Wild, C. , title = ". Biometrika , volume =

work page
[23]

Journal of the American Statistical Association , volume =

Ran Tao and Donglin Zeng and Dan-Yu Lin , title =. Journal of the American Statistical Association , volume =

work page
[24]

Holt and T

D. Holt and T. M. F. Smith and P. D. Winter , journal =. Regression analysis of data from complex surveys , volume =

work page
[25]

Scandinavian Journal of Statistics , volume =

Samuelsen, Sven Ove and Ånestad, Hallvard and Skrondal, Anders , title =. Scandinavian Journal of Statistics , volume =

work page
[26]

American Journal of Epidemiology , volume =

Ganna, Andrea and Reilly, Marie and de Faire, Ulf and Pedersen, Nancy and Magnusson, Patrik and Ingelsson, Erik , title =. American Journal of Epidemiology , volume =

work page
[27]

Statistical Methods in Medical Research , volume =

Hisashi Noma and Shiro Tanaka , title =. Statistical Methods in Medical Research , volume =

work page
[28]

Statistics in Medicine , volume =

Chen, Tong and Lumley, Thomas , title =. Statistics in Medicine , volume =

work page
[29]

Y Lin , title =

Michal Kulich and D. Y Lin , title =. Journal of the American Statistical Association , volume =

work page
[30]

Statistics in Medicine , volume =

Metcalf, Patricia and Scott, Alastair , title =. Statistics in Medicine , volume =

work page
[31]

J. A. Anderson , journal =. Separate sample logistic discrimination , volume =

work page
[32]

R. L. Prentice and R. Pyke , journal =. Logistic disease incidence models and case-control studies , volume =

work page
[33]

2010 , author =

Complex Surveys: A Guide to Analysis Using R: A Guide to Analysis Using R , publisher =. 2010 , author =

work page 2010
[34]

2024 , url =

R: A Language and Environment for Statistical Computing , author =. 2024 , url =

work page 2024
[35]

and Lumley, Thomas and Shepherd, Bryan E

Amorim, Gustavo and Tao, Ran and Lotspeich, Sarah and Shaw, Pamela A. and Lumley, Thomas and Shepherd, Bryan E. , title =. Journal of the Royal Statistical Society Series A: Statistics in Society , volume =

work page
[36]

and Cook, Richard J

McIsaac, Michael A. and Cook, Richard J. , title =. Statistics in Medicine , volume =

work page
[37]

Statistical Science , year =

Hastie, Trevor and Tibshirani, Robert , title =. Statistical Science , year =

work page
[38]

Friedman, J. H. , title =. 1984 , number =

work page 1984
[39]

2009 , publisher =

Sampling Statistics , author =. 2009 , publisher =

work page 2009
[40]

and Lumley, Thomas , title =

Han, Kyunghee and Shaw, Pamela A. and Lumley, Thomas , title =. Statistics in Medicine , volume =

work page
[41]

Breslow, N. E. and Cain, K. C. , title =. Biometrika , volume =

work page

[1] [1]

Newey , journal =

Whitney K. Newey , journal =. The Asymptotic Variance of Semiparametric Estimators , urldate =

work page

[2] [2]

1998 , journal =

Improving survey‐weighted least squares regression , author =. 1998 , journal =

work page 1998

[3] [3]

Sankhya: The Indian Journal of Statistics, Series B , volume=

Parametric and semi-parametric estimation of regression models fitted to survey data , author=. Sankhya: The Indian Journal of Statistics, Series B , volume=

work page

[4] [4]

Epidemiology , volume=

Marginal Structural Models and Causal Inference in Epidemiology , author=. Epidemiology , volume=. 2000 , publisher=

work page 2000

[5] [5]

Fitting Regression Models to Survey Data , urldate =

Thomas Lumley and Alastair Scott , journal =. Fitting Regression Models to Survey Data , urldate =

work page

[6] [6]

Weighting in the regression analysis of survey data with a cross-national application , urldate =

Chris Skinner and Ben Mason , journal =. Weighting in the regression analysis of survey data with a cross-national application , urldate =

work page

[7] [7]

Journal of the Royal Statistical Society Series A: Statistics in Society , volume =

Chambers, Ray and Ranjbar, Setareh and Salvati, Nicola and Pacini, Barbara , title = ". Journal of the Royal Statistical Society Series A: Statistics in Society , volume =

work page

[8] [8]

International Statistical Review , volume=

Connections between survey calibration estimators and semiparametric models for incomplete data , author=. International Statistical Review , volume=. 2011 , publisher=

work page 2011

[9] [9]

Journal of the American Statistical Association , volume=

Estimation of regression coefficients when some regressors are not always observed , author=. Journal of the American Statistical Association , volume=. 1994 , publisher=

work page 1994

[10] [10]

Journal of the American Statistical Association , volume=

Calibration estimators in survey sampling , author=. Journal of the American Statistical Association , volume=. 1992 , publisher=

work page 1992

[11] [11]

Data validation in multinational observational studies with error-prone data: applying an optimal validation sampling strategy in a study of

Amorim, Gustavo and Slone, Joshua and Semeere, Aggrey and Diero, Lameck and Otero, Larissa and Crabtree-Ramirez, Brenda and Tao, Ran and Duda, Stephany N and Musick, Beverly and Yiannoutsos, Constantin and Lumley, Thomas and Shaw, Pamela A and Shepherd, Bryan E , journal=. Data validation in multinational observational studies with error-prone data: apply...

work page

[12] [12]

Analysis approaches to combine error-prone data with a subset of validated data: an application to a multinational study of

Slone, Joshua and Amorim, Gustavo and Semeere, Aggrey and Diero, Lameck and Otero, Larissa and Crabtree-Ramirez, Brenda and Tao, Ran and Duda, Stephany N and Musick, Beverly and Yiannoutsos, Constantin and Lumley, Thomas and Shaw, Pamela A and Shepherd, Bryan E , journal=. Analysis approaches to combine error-prone data with a subset of validated data: an...

work page

[13] [13]

American Journal of Epidemiology , volume=

Using the whole cohort in the analysis of case-cohort data , author=. American Journal of Epidemiology , volume=. 2009 , publisher=

work page 2009

[14] [14]

Statistics in Biosciences , volume=

Using the whole cohort in the analysis of case-control data , author=. Statistics in Biosciences , volume=. 2013 , publisher=

work page 2013

[15] [15]

and Han, Kyunghee and Chen, Tong and Bian, Aihua and Pugh, Shannon and Duda, Stephany N

Shepherd, Bryan E. and Han, Kyunghee and Chen, Tong and Bian, Aihua and Pugh, Shannon and Duda, Stephany N. and Lumley, Thomas and Heerman, William J. and Shaw, Pamela A. , title = ". Biometrics , volume =

work page

[16] [16]

Case-control studies with complex sampling , volume =

Alastair Scott and Chris Wild , journal =. Case-control studies with complex sampling , volume =

work page

[17] [17]

Roderick J. A. Little , journal =. Models for nonresponse in sample surveys , volume =

work page

[18] [18]

Statistical Science , number =

Andrew Gelman , title =. Statistical Science , number =

work page

[19] [19]

Neyman , title =

J. Neyman , title =. Journal of the American Statistical Association , volume =

work page

[20] [20]

D. G. Horvitz and D. J. Thompson , title =. Journal of the American Statistical Association , volume =

work page

[21] [21]

Generalized raking procedures in survey sampling , volume =

Jean-Claude Deville and Carl-Erik Särndal and Olivier Sautory , journal =. Generalized raking procedures in survey sampling , volume =

work page

[22] [22]

and Wild, C

Scott, A. and Wild, C. , title = ". Biometrika , volume =

work page

[23] [23]

Journal of the American Statistical Association , volume =

Ran Tao and Donglin Zeng and Dan-Yu Lin , title =. Journal of the American Statistical Association , volume =

work page

[24] [24]

Holt and T

D. Holt and T. M. F. Smith and P. D. Winter , journal =. Regression analysis of data from complex surveys , volume =

work page

[25] [25]

Scandinavian Journal of Statistics , volume =

Samuelsen, Sven Ove and Ånestad, Hallvard and Skrondal, Anders , title =. Scandinavian Journal of Statistics , volume =

work page

[26] [26]

American Journal of Epidemiology , volume =

Ganna, Andrea and Reilly, Marie and de Faire, Ulf and Pedersen, Nancy and Magnusson, Patrik and Ingelsson, Erik , title =. American Journal of Epidemiology , volume =

work page

[27] [27]

Statistical Methods in Medical Research , volume =

Hisashi Noma and Shiro Tanaka , title =. Statistical Methods in Medical Research , volume =

work page

[28] [28]

Statistics in Medicine , volume =

Chen, Tong and Lumley, Thomas , title =. Statistics in Medicine , volume =

work page

[29] [29]

Y Lin , title =

Michal Kulich and D. Y Lin , title =. Journal of the American Statistical Association , volume =

work page

[30] [30]

Statistics in Medicine , volume =

Metcalf, Patricia and Scott, Alastair , title =. Statistics in Medicine , volume =

work page

[31] [31]

J. A. Anderson , journal =. Separate sample logistic discrimination , volume =

work page

[32] [32]

R. L. Prentice and R. Pyke , journal =. Logistic disease incidence models and case-control studies , volume =

work page

[33] [33]

2010 , author =

Complex Surveys: A Guide to Analysis Using R: A Guide to Analysis Using R , publisher =. 2010 , author =

work page 2010

[34] [34]

2024 , url =

R: A Language and Environment for Statistical Computing , author =. 2024 , url =

work page 2024

[35] [35]

and Lumley, Thomas and Shepherd, Bryan E

Amorim, Gustavo and Tao, Ran and Lotspeich, Sarah and Shaw, Pamela A. and Lumley, Thomas and Shepherd, Bryan E. , title =. Journal of the Royal Statistical Society Series A: Statistics in Society , volume =

work page

[36] [36]

and Cook, Richard J

McIsaac, Michael A. and Cook, Richard J. , title =. Statistics in Medicine , volume =

work page

[37] [37]

Statistical Science , year =

Hastie, Trevor and Tibshirani, Robert , title =. Statistical Science , year =

work page

[38] [38]

Friedman, J. H. , title =. 1984 , number =

work page 1984

[39] [39]

2009 , publisher =

Sampling Statistics , author =. 2009 , publisher =

work page 2009

[40] [40]

and Lumley, Thomas , title =

Han, Kyunghee and Shaw, Pamela A. and Lumley, Thomas , title =. Statistics in Medicine , volume =

work page

[41] [41]

Breslow, N. E. and Cain, K. C. , title =. Biometrika , volume =

work page