A Model-Robust G-Computation Method for Analyzing Hybrid Control Studies Without Assuming Exchangeability

Peisong Han; Wei Zhang; Zhiwei Zhang

arxiv: 2602.02809 · v2 · submitted 2026-02-02 · 📊 stat.ME

A Model-Robust G-Computation Method for Analyzing Hybrid Control Studies Without Assuming Exchangeability

Zhiwei Zhang , Peisong Han , Wei Zhang This is my paper

Pith reviewed 2026-05-16 07:56 UTC · model grok-4.3

classification 📊 stat.ME

keywords hybrid control designg-computationmodel robustnessvariable selectionexternal controlsexchangeabilitycausal inferenceoutcome regression

0 comments

The pith

A g-computation method with variable selection stays consistent for hybrid control studies even if the outcome model is misspecified and without assuming exchangeability between internal and external controls.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a model-robust g-computation approach for hybrid control designs that combine a randomized trial with external control data. It identifies that one specific version of g-computation, which uses variable selection in the outcome regression, protects against bias from outcome model misspecification. This yields a simple estimator that is consistent and asymptotically normal under minimal assumptions while gaining efficiency from similarities across the two control groups. A sympathetic reader would care because hybrid designs promise smaller or faster trials but introduce bias risks that standard methods cannot handle without strong exchangeability assumptions.

Core claim

A particular version of the g-computation method with variable selection is protected against misspecification of the outcome regression model. This observation produces a model-robust g-computation estimator that is remarkably simple to implement, consistent and asymptotically normal under minimal assumptions, and able to improve efficiency by exploiting similarities between the internal and external control groups.

What carries the argument

The g-computation estimator that applies variable selection when fitting the outcome regression model to adjust for baseline covariates related to the control outcome.

If this is right

The method requires no assumption that internal and external control outcomes are exchangeable after conditioning on measured covariates.
The estimator remains consistent and asymptotically normal under minimal assumptions.
Efficiency gains occur by borrowing strength from similarities between the internal and external control groups.
The procedure is simple enough to implement with standard software.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The approach could be applied directly to other settings where external data are fused with trials but full exchangeability is doubtful.
It suggests that variable selection itself can serve as a built-in robustness device in causal estimators that average predicted outcomes.
Regulatory analyses of hybrid designs may adopt the method to reduce reliance on unverifiable exchangeability claims.

Load-bearing premise

The protection against outcome model misspecification holds only for one particular version of g-computation that includes variable selection.

What would settle it

A simulation in which the outcome regression is deliberately misspecified, exchangeability between internal and external controls is violated, and the proposed estimator nevertheless converges to the true treatment effect as sample size grows.

read the original abstract

There is growing interest in a hybrid control design for treatment evaluation, where a randomized controlled trial is augmented with external control data from a previous trial or a real world data source. The hybrid control design has the potential to improve efficiency but also carries the risk of introducing bias. The potential bias in a hybrid control study can be mitigated by adjusting for baseline covariates that are related to the control outcome. Existing methods that serve this purpose commonly assume that the internal and external control outcomes are exchangeable upon conditioning on a set of measured covariates. Possible violations of the exchangeability assumption can be addressed using a g-computation method with variable selection under a correctly specified outcome regression model. In this article, we note that a particular version of this g-computation method is protected against misspecification of the outcome regression model. This observation leads to a model-robust g-computation method that is remarkably simple and easy to implement, consistent and asymptotically normal under minimal assumptions, and able to improve efficiency by exploiting similarities between the internal and external control groups. The method is evaluated in a simulation study and illustrated using real data from HIV treatment trials.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper claims a g-computation estimator for hybrid controls stays consistent without exchangeability by using one specific variable-selection version that protects against outcome-model misspecification, but the exact conditions for that protection are not visible in the abstract.

read the letter

The key takeaway is that this paper claims a simple g-computation tweak with variable selection gives model robustness for hybrid controls without needing exchangeability between internal and external groups. It builds on existing g-computation ideas but points out that one version stays consistent even if the outcome model is misspecified. That is the main novelty, and they demonstrate it through simulations and a real-data example from HIV trials. The method looks straightforward to code up and can borrow information from external controls to gain efficiency when the groups are similar enough. The simulations seem to support the efficiency claims under various scenarios. The real data illustration helps show practical applicability. The potential weak point is whether the variable selection step reliably delivers the promised protection. The abstract mentions minimal assumptions but does not detail the selection algorithm or prove why consistency holds when the model is wrong. If the selection depends on tuning parameters that might miss important covariates, the robustness may not be as automatic as stated. The full paper needs to lay out the exact conditions clearly. This matches the stress-test concern that the central claim rests on unstated details of the procedure. This work is for people in biostatistics who handle hybrid trial designs or external control borrowing. It deserves a serious referee because the problem is timely and the proposed fix is concrete, even if some technical details will likely need expansion during review.

Referee Report

2 major / 2 minor

Summary. The manuscript proposes a model-robust g-computation method for hybrid control studies, where an RCT is augmented with external control data. It claims that a particular version of g-computation combined with variable selection remains consistent for the average treatment effect even under outcome regression misspecification, without requiring the exchangeability assumption between internal and external controls. The resulting estimator is asserted to be consistent and asymptotically normal under minimal assumptions, simple to implement, and capable of efficiency gains by pooling similar control groups. The method is evaluated in simulations and illustrated with HIV treatment trial data.

Significance. If the claimed robustness property holds, the approach would provide a practical tool for incorporating external controls in hybrid designs while reducing bias risk from non-exchangeability, potentially increasing statistical efficiency in settings with limited internal controls. This could be particularly useful in medical statistics for real-world evidence integration, provided the minimal assumptions are weaker than standard exchangeability.

major comments (2)

[Abstract and §3] Abstract and §3 (Methods): The central claim that a specific g-computation-plus-variable-selection procedure is protected against outcome model misspecification (and thus consistent without exchangeability) is load-bearing for the entire contribution, yet the abstract supplies no equation, algorithm, or regularity conditions for the variable selection step (e.g., penalty, tuning, or post-selection inference). Without these, it is impossible to verify whether the protection is automatic or requires correct selection with probability approaching 1, as noted in the stress-test concern.
[§4] §4 (Asymptotics): The assertion of asymptotic normality under 'minimal assumptions' requires an explicit statement of those assumptions and at least a proof sketch or key expansion steps; the current presentation leaves the reader unable to confirm whether the expansion implicitly relies on correct model selection or other unstated conditions that would undermine the 'model-robust' label.

minor comments (2)

[§5] §5 (Simulations): The simulation design should explicitly report the variable selection method used (e.g., LASSO with CV) and the exact covariate sets in the data-generating process to allow readers to assess whether the reported efficiency gains are robust to different selection behaviors.
[Notation] Notation throughout: Define all symbols (e.g., the precise form of the g-computation functional and the selected covariate set) at first use to improve readability for readers unfamiliar with hybrid-control extensions of g-computation.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We appreciate the referee's detailed review of our manuscript on the model-robust g-computation method for hybrid control studies. The comments highlight important areas for clarification regarding the variable selection procedure and asymptotic theory. We address each point below and will incorporate revisions to enhance the manuscript's clarity and rigor.

read point-by-point responses

Referee: [Abstract and §3] Abstract and §3 (Methods): The central claim that a specific g-computation-plus-variable-selection procedure is protected against outcome model misspecification (and thus consistent without exchangeability) is load-bearing for the entire contribution, yet the abstract supplies no equation, algorithm, or regularity conditions for the variable selection step (e.g., penalty, tuning, or post-selection inference). Without these, it is impossible to verify whether the protection is automatic or requires correct selection with probability approaching 1, as noted in the stress-test concern.

Authors: We thank the referee for pointing this out. The protection against misspecification arises because the g-computation estimator, when combined with variable selection that includes all necessary covariates for the control outcome, targets the correct marginal mean even under misspecification of the functional form. In the revised version, we will expand the abstract to include a brief description of the variable selection step (using L1-penalized regression with cross-validation) and add equations in §3 detailing the algorithm. We will also specify the regularity conditions, such as the selection consistency rate, to clarify that the robustness does not require perfect recovery of the true model but rather sufficient covariate inclusion. This addresses the concern about whether it is automatic. revision: yes
Referee: [§4] §4 (Asymptotics): The assertion of asymptotic normality under 'minimal assumptions' requires an explicit statement of those assumptions and at least a proof sketch or key expansion steps; the current presentation leaves the reader unable to confirm whether the expansion implicitly relies on correct model selection or other unstated conditions that would undermine the 'model-robust' label.

Authors: We agree that more detail is needed here. The minimal assumptions include standard regularity conditions for M-estimators (e.g., differentiability, bounded variance) plus conditions on the variable selection ensuring that the selected model spans the necessary space for unbiased g-computation. In the revision, we will explicitly list these assumptions in §4 and provide a proof sketch using Taylor expansion of the estimating equations, demonstrating that the bias term vanishes due to the robustness property without requiring the outcome model to be correctly specified. This will confirm that the asymptotic normality holds under the stated minimal assumptions without hidden reliance on correct selection. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation builds on standard g-computation without reducing to self-fit or self-citation

full rationale

The paper observes that a particular version of g-computation with variable selection is protected against outcome-model misspecification and uses this to motivate a model-robust estimator for hybrid controls. No equation or step equates the target parameter to a fitted quantity by construction, nor does the central claim rest on a self-citation chain whose validity is presupposed. The method is presented as consistent under minimal assumptions, with simulation and real-data evaluation providing external checks. This yields a low circularity score consistent with honest non-findings for papers whose core logic remains independent of its own inputs.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review provides no explicit free parameters, axioms, or invented entities; the method is described as relying on minimal assumptions whose details are not supplied.

pith-pipeline@v0.9.0 · 5501 in / 1056 out tokens · 23702 ms · 2026-05-16T07:56:39.628537+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

40 extracted references · 40 canonical work pages

[1]

Massicotte P, Julian JA, Gent M, Shields K, Marzinotto V, Szecht man B et al. (2003). An open-label randomized controlled trial of low molecular weight heparin compared to heparin and coumadin for the treatment of venous thromboembolic events in children: the REVIVE trial.Thrombosis Research, 109, 85–92

work page 2003
[2]

Jansen-Van Der Weide MC, Gaasterland CM, Roes KC, Pontes C, Vives R Sancho A et al. (2018). Rare disease registries: potential applications towards impact on development of new drug treatments.Orphanet Journal of Rare Diseases, 13, 1–11

work page 2018
[3]

https://www.fda.gov/media/164960/download

Food and Drug Administration (2023a).Guidance for Industry: Considerations for the Design and Conduct of Externally Controlled Trials for Drug and Biological Products. https://www.fda.gov/media/164960/download

work page
[4]

Power prior distributions for regression models.Statistical Science, 15, 46–60

Ibrahim JG, Chen MH (2000). Power prior distributions for regression models.Statistical Science, 15, 46–60

work page 2000
[5]

Summarizing historical information on controls in clinical trials.Clinical Trials, 7, 5–18

Neuenschwander B, Capkun-Niggli G, Branson M, Spiegelhalter DJ (2010). Summarizing historical information on controls in clinical trials.Clinical Trials, 7, 5–18

work page 2010
[6]

Augmenting control arms with real-world data for cancer trials: Hybrid control arm methods and considerations

Tan WK, Segal BD, Curtis MD, Baxi SS, Capra WB, Garrett-Mayer E, Hobbs BP, Hong DS, Hubbard RA, Zhu J, Sarkar S, Samant M (2022). Augmenting control arms with real-world data for cancer trials: Hybrid control arm methods and considerations. Contemporary Clinical Trials Communications, 30, 101000

work page 2022
[7]

A frequentist approach to dynamic borrow- ing.Biometrical Journal, 65, e2100406

Li R, Lin R, Huang J, Tian L, Zhu J (2023). A frequentist approach to dynamic borrow- ing.Biometrical Journal, 65, e2100406

work page 2023
[8]

Propensity score- integrated power prior approach for incorporating real-world evidence in single-arm clinical studies.Journal of Biopharmaceutical Statistics, 29, 731–748

Wang C, Li H, Chen WC, Lu N, Tiwari R, Xu Y, Yue LQ (2019). Propensity score- integrated power prior approach for incorporating real-world evidence in single-arm clinical studies.Journal of Biopharmaceutical Statistics, 29, 731–748

work page 2019
[9]

Chen WC, Wang C, Li H, Lu N, Tiwari R, Xu Y, Yue LQ (2020). Propensity score- integrated composite likelihood approach for augmenting the control arm of a randomized controlled trial by incorporating real-world data.Journal of Biopharmaceutical Statistics, 30, 508–520

work page 2020
[10]

Lu N, Wang C, Chen WC, Li H, Song C, Tiwari R, Xu Y, Yue LQ (2022). Propensity score-integrated power prior approach for augmenting the control arm of a randomized controlled trial by incorporating multiple external data sources.Journal of Biopharma- ceutical Statistics, 32, 158–169. 18

work page 2022
[11]

Covariate handling approaches in combination with dynamic borrowing for hybrid control studies.Pharmaceutical Statistics, 22, 619–632

Fu C, Pang H, Zhou S, Zhu J (2023). Covariate handling approaches in combination with dynamic borrowing for hybrid control studies.Pharmaceutical Statistics, 22, 619–632

work page 2023
[12]

A propensity-score integrated approach to Bayesian dynamic power prior borrowing.Statistics in Biopharmaceutical Research, 16, 182–191

Wang J, Zhang H, Tiwari R (2023). A propensity-score integrated approach to Bayesian dynamic power prior borrowing.Statistics in Biopharmaceutical Research, 16, 182–191

work page 2023
[13]

Hierarchical commensurate and power prior models for adaptive incorporation of historical information in clinical trials.Biometrics, 67, 1047–1056

Hobbs BP, Carlin BP, Mandrekar SJ, Sargent DJ (2011). Hierarchical commensurate and power prior models for adaptive incorporation of historical information in clinical trials.Biometrics, 67, 1047–1056

work page 2011
[14]

Use of historical control data for assessing treatment effects in clinical trials.Pharmaceutical Statistics, 13, 41–54

Viele K, Berry S, Neuenschwander B, Amzal B, Chen F, Enas N, Hobbs B, Ibrahim JG, Kinnersley N, Lindborg S, Micallef S, Roychoudhury S, Thompson L (2014). Use of historical control data for assessing treatment effects in clinical trials.Pharmaceutical Statistics, 13, 41–54

work page 2014
[15]

Adaptive power priors with empirical Bayes for clinical trials.Pharmaceutical Statistics, 16, 349–360

Gravestock I, Held L (2017). Adaptive power priors with empirical Bayes for clinical trials.Pharmaceutical Statistics, 16, 349–360

work page 2017
[16]

A new approach to causal inference in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect

Robins JM (1986). A new approach to causal inference in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect. Mathematical Modelling, 7, 1393–1512

work page 1986
[17]

Reducing bias in observational studies using sub- classification on the propensity score.Journal of the American Statistical Association, 79, 516–524

Rosenbaum PR, Rubin DB (1984). Reducing bias in observational studies using sub- classification on the propensity score.Journal of the American Statistical Association, 79, 516–524

work page 1984
[18]

Constructing a control group using multivariate matched sampling methods that incorporate the propensity score.The American Statis- tician, 39, 33–38

Rosenbaum PR, Rubin DB (1985). Constructing a control group using multivariate matched sampling methods that incorporate the propensity score.The American Statis- tician, 39, 33–38

work page 1985
[19]

Marginal structural models and causal inference in epidemiology.Epidemiology, 11, 550–560

Robins JM, Hernan MA, Brumback B (2000). Marginal structural models and causal inference in epidemiology.Epidemiology, 11, 550–560

work page 2000
[20]

Spring-Verlag, New York

van der Laan MJ, Robins JM (2003).Unified Methods for Censored Longitudinal Data and Causality. Spring-Verlag, New York

work page 2003
[21]

Springer, New York

van der Laan MJ, Rose S (2011).Targeted Learning: Causal Inference for Observational and Experimental Data. Springer, New York

work page 2011
[22]

The central role of the propensity score in ob- servational studies for causal effects.Biometrika, 70, 41–55

Rosenbaum PR and Rubin DB (1983). The central role of the propensity score in ob- servational studies for causal effects.Biometrika, 70, 41–55

work page 1983
[23]

Zhang Z, Liu J, Liu W (2025a). Outcome regression methods for analyzing hybrid control studies: Balancing bias and variability.Statistics in Biopharmaceutical Research, https://doi.org/10.1080/19466315.2025.2537066

work page doi:10.1080/19466315.2025.2537066 2025
[24]

Addressing non-exchangeability in hy- brid control studies: A variable selection approach.Pharmaceutical Statistics, https://doi.org/10.1002/pst.70056

Zhang Z, Liu J, Han P (2025b). Addressing non-exchangeability in hy- brid control studies: A variable selection approach.Pharmaceutical Statistics, https://doi.org/10.1002/pst.70056. 19

work page doi:10.1002/pst.70056
[25]

Improving efficiency of inference in clinical trials with external control data.Biometrics, 79, 394–403

Li X, Miao W, Lu F, Zhou XH (2023). Improving efficiency of inference in clinical trials with external control data.Biometrics, 79, 394–403

work page 2023
[26]

A causal inference framework for leveraging external controls in hybrid trials.Biometrics, 80(4), ujae095

Valancius M, Pang H, Zhu J, Cole SR, Funk MJ, Kosorok MR (2024). A causal inference framework for leveraging external controls in hybrid trials.Biometrics, 80(4), ujae095

work page 2024
[27]

Improving randomized controlled trial analysis via data-adaptive borrowing.Biometrika, 112, asae069

Gao C, Yang S, Shan M, Ye W, Lipkovich I, Faries D (2025). Improving randomized controlled trial analysis via data-adaptive borrowing.Biometrika, 112, asae069

work page 2025
[28]

Adaptive data-borrowing for improving treatment effect estimation using external controls

Yang Q, Li J, Wu P (2025). Adaptive data-borrowing for improving treatment effect estimation using external controls. InAdvances in Neural Information Processing Systems (NeurIPS 2025)

work page 2025
[29]

The adaptive lasso and its oracle properties.Journal of the American Statistical Association, 101, 1418–1429

Zou H (2006). The adaptive lasso and its oracle properties.Journal of the American Statistical Association, 101, 1418–1429

work page 2006
[30]

On the robustness of the adaptive lasso to model misspecification.Biometrika, 99, 717–731

Lu W, Goldberg Y, Fine JP (2012). On the robustness of the adaptive lasso to model misspecification.Biometrika, 99, 717–731

work page 2012
[31]

Covariate adjustment in randomized trials with binary outcomes: targeted maximum likelihood estimation.Statistics in Medicine, 28, 39–64

Moore KL, van der Laan MJ (2009). Covariate adjustment in randomized trials with binary outcomes: targeted maximum likelihood estimation.Statistics in Medicine, 28, 39–64

work page 2009
[32]

Toward better practice of covariate adjustment in analyzing randomized clinical trials.Journal of the American Statistical Association, 118, 2370–2382

Ye T, Shao J, Yi Y, Zhao Q (2023). Toward better practice of covariate adjustment in analyzing randomized clinical trials.Journal of the American Statistical Association, 118, 2370–2382

work page 2023
[33]

Available at https://www.fda.gov/regulatory-information/search-fda-guidance-documents/adjusting- covariates-randomized-clinical-trials-drugs-and-biological-products

Food and Drug Administration (2023b).Guidance for Industry: Adjusting for co- variates in randomized clinical trials for drugs and biological products. Available at https://www.fda.gov/regulatory-information/search-fda-guidance-documents/adjusting- covariates-randomized-clinical-trials-drugs-and-biological-products

work page
[34]

Conditional estimation and inference to address observed covariate imbalance in randomized clinical trials.Clinical Trials, 16, 122–131

Zhang Z, Tang L, Liu C, Berger VW (2019). Conditional estimation and inference to address observed covariate imbalance in randomized clinical trials.Clinical Trials, 16, 122–131

work page 2019
[35]

Cambridge University Press, Cam- bridge, UK

van der Vaart AW (1998).Asymptotic Statistics. Cambridge University Press, Cam- bridge, UK

work page 1998
[36]

Schuler A, Walsh D, Hall D, Walsh J, Fisher C, Critical Path for Alzheimer’s Dis- ease, Alzheimer’s Disease Neuroimaging Initiative, Alzheimer’s Disease Cooperative Study (2021) Increasing the efficiency of randomized trial estimates via linear adjustment for a prognostic score.International Journal of Biostatistics, 18, 329–356

work page 2021
[37]

Covariate adjustment for two-sample treatment comparisons in randomized clinical trials: a principled yet flexible approach

Tsiatis AA, Davidian M, Zhang M, Lu X (2008). Covariate adjustment for two-sample treatment comparisons in randomized clinical trials: a principled yet flexible approach. Statistics in Medicine, 27, 4658–4677. 20

work page 2008
[38]

Merigan TC, Amato DA, Balsley J et al. (1991). Placebo-controlled trial to evaluate zidovudine in treatment of human immunodeficiency virus infection in asymptomatic pa- tients with hemophilia.Blood, 78, 900–906

work page 1991
[39]

Volberding PA, Lagakos SW, Koch MA et al. (1990). Zidovudine in asymptomatic human immunodeficiency virus infection—a controlled trial in persons with fewer than 500 CD4-positive cells per cubic millimeter.New England Journal of Medicine, 322, 941– 949

work page 1990
[40]

Springer-Verlag, New York

van der Vaart AW, Wellner JA (1996).Weak Convergence and Empirical Processes with Applications to Statistics. Springer-Verlag, New York. 21 Table 1: Simulation results in Scenario A (continuous outcome, correct working models): empirical bias, standard deviation (SD), and coverage proportion (CP) for estimating (µ 0, δ) using six different estimation meth...

work page 1996

[1] [1]

Massicotte P, Julian JA, Gent M, Shields K, Marzinotto V, Szecht man B et al. (2003). An open-label randomized controlled trial of low molecular weight heparin compared to heparin and coumadin for the treatment of venous thromboembolic events in children: the REVIVE trial.Thrombosis Research, 109, 85–92

work page 2003

[2] [2]

Jansen-Van Der Weide MC, Gaasterland CM, Roes KC, Pontes C, Vives R Sancho A et al. (2018). Rare disease registries: potential applications towards impact on development of new drug treatments.Orphanet Journal of Rare Diseases, 13, 1–11

work page 2018

[3] [3]

https://www.fda.gov/media/164960/download

Food and Drug Administration (2023a).Guidance for Industry: Considerations for the Design and Conduct of Externally Controlled Trials for Drug and Biological Products. https://www.fda.gov/media/164960/download

work page

[4] [4]

Power prior distributions for regression models.Statistical Science, 15, 46–60

Ibrahim JG, Chen MH (2000). Power prior distributions for regression models.Statistical Science, 15, 46–60

work page 2000

[5] [5]

Summarizing historical information on controls in clinical trials.Clinical Trials, 7, 5–18

Neuenschwander B, Capkun-Niggli G, Branson M, Spiegelhalter DJ (2010). Summarizing historical information on controls in clinical trials.Clinical Trials, 7, 5–18

work page 2010

[6] [6]

Augmenting control arms with real-world data for cancer trials: Hybrid control arm methods and considerations

Tan WK, Segal BD, Curtis MD, Baxi SS, Capra WB, Garrett-Mayer E, Hobbs BP, Hong DS, Hubbard RA, Zhu J, Sarkar S, Samant M (2022). Augmenting control arms with real-world data for cancer trials: Hybrid control arm methods and considerations. Contemporary Clinical Trials Communications, 30, 101000

work page 2022

[7] [7]

A frequentist approach to dynamic borrow- ing.Biometrical Journal, 65, e2100406

Li R, Lin R, Huang J, Tian L, Zhu J (2023). A frequentist approach to dynamic borrow- ing.Biometrical Journal, 65, e2100406

work page 2023

[8] [8]

Propensity score- integrated power prior approach for incorporating real-world evidence in single-arm clinical studies.Journal of Biopharmaceutical Statistics, 29, 731–748

Wang C, Li H, Chen WC, Lu N, Tiwari R, Xu Y, Yue LQ (2019). Propensity score- integrated power prior approach for incorporating real-world evidence in single-arm clinical studies.Journal of Biopharmaceutical Statistics, 29, 731–748

work page 2019

[9] [9]

Chen WC, Wang C, Li H, Lu N, Tiwari R, Xu Y, Yue LQ (2020). Propensity score- integrated composite likelihood approach for augmenting the control arm of a randomized controlled trial by incorporating real-world data.Journal of Biopharmaceutical Statistics, 30, 508–520

work page 2020

[10] [10]

Lu N, Wang C, Chen WC, Li H, Song C, Tiwari R, Xu Y, Yue LQ (2022). Propensity score-integrated power prior approach for augmenting the control arm of a randomized controlled trial by incorporating multiple external data sources.Journal of Biopharma- ceutical Statistics, 32, 158–169. 18

work page 2022

[11] [11]

Covariate handling approaches in combination with dynamic borrowing for hybrid control studies.Pharmaceutical Statistics, 22, 619–632

Fu C, Pang H, Zhou S, Zhu J (2023). Covariate handling approaches in combination with dynamic borrowing for hybrid control studies.Pharmaceutical Statistics, 22, 619–632

work page 2023

[12] [12]

A propensity-score integrated approach to Bayesian dynamic power prior borrowing.Statistics in Biopharmaceutical Research, 16, 182–191

Wang J, Zhang H, Tiwari R (2023). A propensity-score integrated approach to Bayesian dynamic power prior borrowing.Statistics in Biopharmaceutical Research, 16, 182–191

work page 2023

[13] [13]

Hierarchical commensurate and power prior models for adaptive incorporation of historical information in clinical trials.Biometrics, 67, 1047–1056

Hobbs BP, Carlin BP, Mandrekar SJ, Sargent DJ (2011). Hierarchical commensurate and power prior models for adaptive incorporation of historical information in clinical trials.Biometrics, 67, 1047–1056

work page 2011

[14] [14]

Use of historical control data for assessing treatment effects in clinical trials.Pharmaceutical Statistics, 13, 41–54

Viele K, Berry S, Neuenschwander B, Amzal B, Chen F, Enas N, Hobbs B, Ibrahim JG, Kinnersley N, Lindborg S, Micallef S, Roychoudhury S, Thompson L (2014). Use of historical control data for assessing treatment effects in clinical trials.Pharmaceutical Statistics, 13, 41–54

work page 2014

[15] [15]

Adaptive power priors with empirical Bayes for clinical trials.Pharmaceutical Statistics, 16, 349–360

Gravestock I, Held L (2017). Adaptive power priors with empirical Bayes for clinical trials.Pharmaceutical Statistics, 16, 349–360

work page 2017

[16] [16]

A new approach to causal inference in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect

Robins JM (1986). A new approach to causal inference in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect. Mathematical Modelling, 7, 1393–1512

work page 1986

[17] [17]

Reducing bias in observational studies using sub- classification on the propensity score.Journal of the American Statistical Association, 79, 516–524

Rosenbaum PR, Rubin DB (1984). Reducing bias in observational studies using sub- classification on the propensity score.Journal of the American Statistical Association, 79, 516–524

work page 1984

[18] [18]

Constructing a control group using multivariate matched sampling methods that incorporate the propensity score.The American Statis- tician, 39, 33–38

Rosenbaum PR, Rubin DB (1985). Constructing a control group using multivariate matched sampling methods that incorporate the propensity score.The American Statis- tician, 39, 33–38

work page 1985

[19] [19]

Marginal structural models and causal inference in epidemiology.Epidemiology, 11, 550–560

Robins JM, Hernan MA, Brumback B (2000). Marginal structural models and causal inference in epidemiology.Epidemiology, 11, 550–560

work page 2000

[20] [20]

Spring-Verlag, New York

van der Laan MJ, Robins JM (2003).Unified Methods for Censored Longitudinal Data and Causality. Spring-Verlag, New York

work page 2003

[21] [21]

Springer, New York

van der Laan MJ, Rose S (2011).Targeted Learning: Causal Inference for Observational and Experimental Data. Springer, New York

work page 2011

[22] [22]

The central role of the propensity score in ob- servational studies for causal effects.Biometrika, 70, 41–55

Rosenbaum PR and Rubin DB (1983). The central role of the propensity score in ob- servational studies for causal effects.Biometrika, 70, 41–55

work page 1983

[23] [23]

Zhang Z, Liu J, Liu W (2025a). Outcome regression methods for analyzing hybrid control studies: Balancing bias and variability.Statistics in Biopharmaceutical Research, https://doi.org/10.1080/19466315.2025.2537066

work page doi:10.1080/19466315.2025.2537066 2025

[24] [24]

Addressing non-exchangeability in hy- brid control studies: A variable selection approach.Pharmaceutical Statistics, https://doi.org/10.1002/pst.70056

Zhang Z, Liu J, Han P (2025b). Addressing non-exchangeability in hy- brid control studies: A variable selection approach.Pharmaceutical Statistics, https://doi.org/10.1002/pst.70056. 19

work page doi:10.1002/pst.70056

[25] [25]

Improving efficiency of inference in clinical trials with external control data.Biometrics, 79, 394–403

Li X, Miao W, Lu F, Zhou XH (2023). Improving efficiency of inference in clinical trials with external control data.Biometrics, 79, 394–403

work page 2023

[26] [26]

A causal inference framework for leveraging external controls in hybrid trials.Biometrics, 80(4), ujae095

Valancius M, Pang H, Zhu J, Cole SR, Funk MJ, Kosorok MR (2024). A causal inference framework for leveraging external controls in hybrid trials.Biometrics, 80(4), ujae095

work page 2024

[27] [27]

Improving randomized controlled trial analysis via data-adaptive borrowing.Biometrika, 112, asae069

Gao C, Yang S, Shan M, Ye W, Lipkovich I, Faries D (2025). Improving randomized controlled trial analysis via data-adaptive borrowing.Biometrika, 112, asae069

work page 2025

[28] [28]

Adaptive data-borrowing for improving treatment effect estimation using external controls

Yang Q, Li J, Wu P (2025). Adaptive data-borrowing for improving treatment effect estimation using external controls. InAdvances in Neural Information Processing Systems (NeurIPS 2025)

work page 2025

[29] [29]

The adaptive lasso and its oracle properties.Journal of the American Statistical Association, 101, 1418–1429

Zou H (2006). The adaptive lasso and its oracle properties.Journal of the American Statistical Association, 101, 1418–1429

work page 2006

[30] [30]

On the robustness of the adaptive lasso to model misspecification.Biometrika, 99, 717–731

Lu W, Goldberg Y, Fine JP (2012). On the robustness of the adaptive lasso to model misspecification.Biometrika, 99, 717–731

work page 2012

[31] [31]

Covariate adjustment in randomized trials with binary outcomes: targeted maximum likelihood estimation.Statistics in Medicine, 28, 39–64

Moore KL, van der Laan MJ (2009). Covariate adjustment in randomized trials with binary outcomes: targeted maximum likelihood estimation.Statistics in Medicine, 28, 39–64

work page 2009

[32] [32]

Toward better practice of covariate adjustment in analyzing randomized clinical trials.Journal of the American Statistical Association, 118, 2370–2382

Ye T, Shao J, Yi Y, Zhao Q (2023). Toward better practice of covariate adjustment in analyzing randomized clinical trials.Journal of the American Statistical Association, 118, 2370–2382

work page 2023

[33] [33]

Available at https://www.fda.gov/regulatory-information/search-fda-guidance-documents/adjusting- covariates-randomized-clinical-trials-drugs-and-biological-products

Food and Drug Administration (2023b).Guidance for Industry: Adjusting for co- variates in randomized clinical trials for drugs and biological products. Available at https://www.fda.gov/regulatory-information/search-fda-guidance-documents/adjusting- covariates-randomized-clinical-trials-drugs-and-biological-products

work page

[34] [34]

Conditional estimation and inference to address observed covariate imbalance in randomized clinical trials.Clinical Trials, 16, 122–131

Zhang Z, Tang L, Liu C, Berger VW (2019). Conditional estimation and inference to address observed covariate imbalance in randomized clinical trials.Clinical Trials, 16, 122–131

work page 2019

[35] [35]

Cambridge University Press, Cam- bridge, UK

van der Vaart AW (1998).Asymptotic Statistics. Cambridge University Press, Cam- bridge, UK

work page 1998

[36] [36]

Schuler A, Walsh D, Hall D, Walsh J, Fisher C, Critical Path for Alzheimer’s Dis- ease, Alzheimer’s Disease Neuroimaging Initiative, Alzheimer’s Disease Cooperative Study (2021) Increasing the efficiency of randomized trial estimates via linear adjustment for a prognostic score.International Journal of Biostatistics, 18, 329–356

work page 2021

[37] [37]

Covariate adjustment for two-sample treatment comparisons in randomized clinical trials: a principled yet flexible approach

Tsiatis AA, Davidian M, Zhang M, Lu X (2008). Covariate adjustment for two-sample treatment comparisons in randomized clinical trials: a principled yet flexible approach. Statistics in Medicine, 27, 4658–4677. 20

work page 2008

[38] [38]

Merigan TC, Amato DA, Balsley J et al. (1991). Placebo-controlled trial to evaluate zidovudine in treatment of human immunodeficiency virus infection in asymptomatic pa- tients with hemophilia.Blood, 78, 900–906

work page 1991

[39] [39]

Volberding PA, Lagakos SW, Koch MA et al. (1990). Zidovudine in asymptomatic human immunodeficiency virus infection—a controlled trial in persons with fewer than 500 CD4-positive cells per cubic millimeter.New England Journal of Medicine, 322, 941– 949

work page 1990

[40] [40]

Springer-Verlag, New York

van der Vaart AW, Wellner JA (1996).Weak Convergence and Empirical Processes with Applications to Statistics. Springer-Verlag, New York. 21 Table 1: Simulation results in Scenario A (continuous outcome, correct working models): empirical bias, standard deviation (SD), and coverage proportion (CP) for estimating (µ 0, δ) using six different estimation meth...

work page 1996