Doubly Robust Estimators with Weak Overlap
Pith reviewed 2026-05-24 09:00 UTC · model grok-4.3
The pith
Trimming propensity scores for weak overlap breaks double robustness in causal estimators, but bias correction restores it while preserving the original target parameter.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Trimming propensity scores reduces variance but eliminates double robustness. Bias-corrected doubly robust estimators retain double robustness after trimming, preserving the original causal targets across unconfoundedness, instrumental variables, and difference-in-differences designs.
What carries the argument
Bias-corrected doubly robust estimator applied after propensity-score trimming
If this is right
- The same bias-correction approach works for unconfoundedness, instrumental-variables, and difference-in-differences identification.
- The resulting estimators deliver more precise point estimates in the four reported applications.
- The method recovers the original causal parameter rather than a trimmed or reweighted version of it.
Where Pith is reading between the lines
- Practitioners facing weak overlap may now trim more aggressively without sacrificing the robustness property that originally motivated doubly robust methods.
- The construction suggests that similar corrections could be derived for other forms of sample restriction or weighting that currently break double robustness.
- Because the target parameter stays fixed, comparisons across trimmed and untrimmed specifications become directly interpretable.
Load-bearing premise
The bias-correction term fully offsets the trimming-induced bias even when the propensity-score or outcome model is misspecified.
What would settle it
In a Monte Carlo design with correct outcome model but misspecified propensity scores, the trimmed estimator without correction exhibits bias while the bias-corrected version does not.
read the original abstract
Doubly robust (DR) estimators guard against model misspecification but remain sensitive to weak covariate overlap. We show that trimming propensity scores reduces variance but eliminates double robustness. We introduce DR estimators that retain double robustness after trimming through bias correction, preserving the original causal targets across unconfoundedness, instrumental variables, and difference-in-differences designs. In four applications, the proposed estimator yields more precise estimates: ruling out large mortality effects of Medicaid expansion, detecting workforce growth from mental health reform, recovering the Black--White test score gap without strong functional form restrictions, and recovering a positive 401(k) savings effect consistent with the prior literature.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript claims that trimming propensity scores in standard doubly robust (DR) estimators eliminates their double robustness property, but introduces bias-corrected DR estimators that restore double robustness after trimming while preserving the original causal targets. This construction is asserted to apply across unconfoundedness (ATE), instrumental variables, and difference-in-differences designs. Four empirical applications are presented in which the proposed estimators yield more precise estimates than untrimmed alternatives.
Significance. If the bias-correction construction holds, the result is significant for applied causal inference: weak overlap is ubiquitous, trimming is a common practical response, and the proposal decouples variance reduction from loss of double robustness or change in the target parameter. Coverage of three distinct identification strategies plus concrete applications strengthens potential impact.
major comments (2)
- [Abstract / theoretical development] The abstract asserts that trimming eliminates double robustness and that the bias correction restores it without altering the target, but provides no derivation or explicit statement of the correction term. The central claim is therefore not verifiable from the given information; a dedicated theoretical section with the explicit form of the bias correction and the proof that the estimand remains unchanged is required.
- [Theoretical claims] The abstract states that the method preserves the original causal targets across designs, yet the reader's summary notes the absence of derivation details, proofs, or error bars. Without these, it is impossible to confirm that the bias correction is parameter-free with respect to the original estimand or that it does not introduce new identification assumptions.
minor comments (1)
- [Empirical section] The abstract mentions four applications but does not specify sample sizes, trimming thresholds, or how standard errors are computed; these details should be added for reproducibility.
Simulated Author's Rebuttal
We thank the referee for the thoughtful comments on our manuscript. We address the major comments point by point below, clarifying the location of the requested theoretical material in the full paper while noting that abstracts are by design concise.
read point-by-point responses
-
Referee: [Abstract / theoretical development] The abstract asserts that trimming eliminates double robustness and that the bias correction restores it without altering the target, but provides no derivation or explicit statement of the correction term. The central claim is therefore not verifiable from the given information; a dedicated theoretical section with the explicit form of the bias correction and the proof that the estimand remains unchanged is required.
Authors: The full manuscript contains a dedicated theoretical section (Section 2) deriving the explicit form of the bias-correction term for the trimmed DR estimator and proving that the estimand is identical to the original untrimmed target. The abstract summarizes the result without the full derivation, which is standard practice; the claims are verifiable from the body of the paper. We are happy to add a one-sentence pointer to Section 2 in a revised abstract if the editor prefers. revision: partial
-
Referee: [Theoretical claims] The abstract states that the method preserves the original causal targets across designs, yet the reader's summary notes the absence of derivation details, proofs, or error bars. Without these, it is impossible to confirm that the bias correction is parameter-free with respect to the original estimand or that it does not introduce new identification assumptions.
Authors: Section 2 of the manuscript supplies the derivations and proofs for the unconfoundedness, IV, and DiD cases, establishing that the bias correction is parameter-free with respect to the original estimand and relies only on the original identification assumptions. Standard error bars appear in all four empirical applications (Section 4). The full text therefore contains the requested details. revision: no
Circularity Check
No significant circularity; derivation self-contained
full rationale
The paper presents a methodological extension of standard doubly robust estimators by adding a bias-correction term after propensity trimming. The abstract describes showing that trimming eliminates double robustness and then constructing corrected estimators that restore the property while preserving the original target parameter across designs. No quoted equations or steps reduce the claimed result to its own inputs by construction, fitted parameters renamed as predictions, or load-bearing self-citations. The central construction is presented as building on the existing DR framework with an explicit correction, without evidence of self-definitional loops or ansatz smuggling. This is the expected outcome for a paper whose contribution is an estimator modification rather than a closed derivation chain.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Unconfoundedness (or equivalent identification assumptions for IV and DiD)
- domain assumption Standard doubly robust property holds when either model is correct
Forward citations
Cited by 2 Pith papers
-
A Sensitivity Approach to Causal Inference Under Limited Overlap
A sensitivity analysis using worst-case confidence bounds on trimming bias to assess robustness of causal estimates under limited overlap by measuring required irregularity in the outcome function.
-
A Practical Guide to Instrumental Variables Methods with Heterogeneous Treatment Effects
A synthesis of how covariate-inclusive IV specifications produce weighted averages of subgroup LATEs, with recommendations for flexible models, assumption tests, and software to handle heterogeneous effects.
Reference graph
Works this paper leans on
-
[1]
Empirical process methods in econometrics,
Andrews, D. W. (1994): “Empirical process methods in econometrics,” Handbook of Economet- rics, 4, 2247–2294
work page 1994
-
[2]
Identification of Causal Effects Using Instrumental Variables,
Angrist, J. D., G. W. Imbens, and D. B. Rubin (1996): “Identification of Causal Effects Using Instrumental Variables,” Journal of the American Statistical Association , 91, 444–455
work page 1996
-
[3]
Doubly robust estimation in missing data and causa l inference models,
Bang, H. and J. M. Robins (2005): “Doubly robust estimation in missing data and causa l inference models,” Biometrics, 61, 962–973
work page 2005
-
[4]
Some new asymp- totic theory for least squares series: pointwise and unifor m results,
Belloni, A., V. Chernozhukov, D. Chetverikov, and K. Kato (2015): “Some new asymp- totic theory for least squares series: pointwise and unifor m results,” Journal of Econometrics , 186, 345–366
work page 2015
-
[5]
Program Evaluation and Causal Inference With High-Dimensional Dat a,
Belloni, A., V. Chernozhukov, I. Fern ´andez-V al, and C. Hansen (2017): “Program Evaluation and Causal Inference With High-Dimensional Dat a,” Econometrica, 85, 233–298
work page 2017
-
[6]
Inference on Treatment Effects after Selection among High-Dimensional Controls,
Belloni, A., V. Chernozhukov, and C. Hansen (2014): “Inference on Treatment Effects after Selection among High-Dimensional Controls,” The Review of Economic Studies , 81, 608–650
work page 2014
-
[7]
Difference-in-Differences with Multiple Time Periods,
Callaway, B. and P. H. C. Sant’Anna (2021): “Difference-in-Differences with Multiple Time Periods,” Journal of Econometrics , 225, 200–230
work page 2021
-
[8]
Heavy tail robust estimation and inference for ave rage treatment effects,
Chaudhuri, S. and J. B. Hill (2016): “Heavy tail robust estimation and inference for ave rage treatment effects,” Working paper
work page 2016
-
[9]
Dealing with limited overlap in estimation of average treatment effects,
Crump, R. K., V. J. Hotz, G. W. Imbens, and O. A. Mitnik (2009): “Dealing with limited overlap in estimation of average treatment effects,” Biometrika, 96, 187–199
work page 2009
-
[10]
Technology and Big Data Are Changing Economics: Mining Text to Track Methods,
Currie, J., H. Kleven, and E. Zwiers (2020): “Technology and Big Data Are Changing Economics: Mining Text to Track Methods,” AEA Papers and Proceedings , 110, 42–48
work page 2020
-
[11]
Nonparametric IV Estimation of Local Average Trea tment Effects with Covariates,
Frolich, M. (2007): “Nonparametric IV Estimation of Local Average Trea tment Effects with Covariates,” Journal of Econometrics , 139, 35–75
work page 2007
-
[12]
Hahn, J. (1998): “On the role of the propensity score in efficient semip arametric estimation of average treatment effects,” Econometrica, 315–331. 20
work page 1998
-
[13]
Heiler, P. and E. Kazak (2021): “Valid inference for treatment effect parameters und er irregular identification and many extreme propensity scores,” Journal of Econometrics , 222, 1083–1108
work page 2021
-
[14]
Inference on finite-population treatment effects under limited overlap,
Hong, H., M. P. Leung, and J. Li (2020): “Inference on finite-population treatment effects under limited overlap,” The Econometrics Journal , 23, 32–47
work page 2020
-
[15]
Identification and Estimation of Local Average Treatment Effects,
Imbens, G. W. and J. D. Angrist (1994): “Identification and Estimation of Local Average Treatment Effects,” Econometrica, 62, 467–75
work page 1994
-
[16]
Kang, J. D. Y. and J. L. Schafer (2007): “Demystifying Double Robustness: A Comparison of Alternative Strategies for Estimating a Population Mean from Incomplete Data.” Statistical Science, 22, 569–573
work page 2007
-
[17]
On uniform inference in nonlinear models with endo gene- ity,
Khan, S. and D. Nekipelov (2015): “On uniform inference in nonlinear models with endo gene- ity,” Working paper
work page 2015
-
[18]
Irregular identification, support conditions, an d inverse weight estimation,
Khan, S. and E. Tamer (2010): “Irregular identification, support conditions, an d inverse weight estimation,” Econometrica, 78, 2021–2042
work page 2010
-
[19]
Robust inference using inverse probability weigh ting,
Ma, X. and J. W ang (2020): “Robust inference using inverse probability weigh ting,” Journal of the American Statistical Association , 115, 1851–1860
work page 2020
-
[20]
Comment: Performance of Double-Robust Estimators When “Inverse Probability
Robins, J., M. Sued, Q. Lei-Gomez, and A. Rotnitzky (2007): “Comment: Performance of Double-Robust Estimators When “Inverse Probability” Weig hts Are Highly Variable,” Statistical Science, 22, 544–559
work page 2007
-
[21]
Estimation of Regression Coeffi- cients When Some Regressors Are Not Always Observed,
Robins, J. M., A. Rotnitzky, and L. P. Zhao (1994): “Estimation of Regression Coeffi- cients When Some Regressors Are Not Always Observed,” Journal of the American Statistical Association, 89, 846–866
work page 1994
-
[22]
What’s Trending in Difference-in-Differences? A Synthesis of the Recent Economet rics Literature,
Roth, J., P. H. C. Sant’Anna, A. Bilinski, and J. Poe (2023): “What’s Trending in Difference-in-Differences? A Synthesis of the Recent Economet rics Literature,” Journal of Econo- metrics, Forthcoming
work page 2023
-
[23]
Robust confidence intervals for average treatment effects under limited over- lap,
Rothe, C. (2017): “Robust confidence intervals for average treatment effects under limited over- lap,” Econometrica, 85, 645–660. Sant’Anna, P. H. and J. Zhao (2020): “Doubly robust difference-in-differences estimators ,” Journal of Econometrics , 219, 101–122
work page 2017
-
[24]
Estimation and inference for moments of ratios wit h robustness against large trimming bias,
Sasaki, Y. and T. Ura (2022): “Estimation and inference for moments of ratios wit h robustness against large trimming bias,” Econometric Theory, 38, 66–112
work page 2022
-
[25]
Introduction to Double Robust Methods for Incomplete Data,
Seaman, S. R. and S. V ansteelandt (2018): “Introduction to Double Robust Methods for Incomplete Data,” Statistical Science, 33, 184–197. 21 S/suppress loczy´nski, T., S. D. Uysal, and J. M. Wooldridge (2022): “Doubly Robust Estimation of Local Average Treatment Effects Using Inverse Probability We ighted Regression Adjustment,” arXiv:2208.01300 [econ.EM]....
-
[26]
Regression and Weighting Methods for Causal Infer ence Using Instrumental Variables,
Tan, Z. (2006): “Regression and Weighting Methods for Causal Infer ence Using Instrumental Variables,” Journal of the American Statistical Association, , 101, 1607—-1618
work page 2006
-
[27]
Inverse probability weighted estimation for gene ral missing data problems,
Wooldridge, J. M. (2007): “Inverse probability weighted estimation for gene ral missing data problems,” Journal of Econometrics , 141, 1281–1301
work page 2007
-
[28]
Yang, S. and P. Ding (2018): “Asymptotic inference of causal effects with observa tional studies trimmed by the estimated propensity scores,” Biometrika, 105, 487–493
work page 2018
-
[29]
Asymptotic trimming and rate adaptive inference f or endogenous selection estimates,
Yang, T. T. (2014): “Asymptotic trimming and rate adaptive inference f or endogenous selection estimates,” Working paper. 22
work page 2014
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.