Recognition: unknown
Cross-Fitting and Fast Remainder Rates for Semiparametric Estimation
read the original abstract
There are many interesting and widely used estimators of a functional with finite semiparametric variance bound that depend on nonparametric estimators of nuisance functions. We use cross-fitting (i.e. sample splitting) to construct novel estimators with fast remainder rates. We give cross-fit doubly robust estimators that use separate subsamples to estimate different nuisance functions. We obtain general, precise results for regression spline estimation of average linear functionals of conditional expectations with a finite semiparametric variance bound. We show that a cross-fit doubly robust spline regression estimator of the expected conditional covariance is semiparametric efficient under minimal conditions. Cross-fit doubly robust estimators of other average linear functionals of a conditional expectation are shown to have the fastest known remainder rates for the Haar basis or under certain smoothness conditions. Surprisingly, the cross-fit plug-in estimator also has nearly the fastest known remainder rate, but the remainder converges to zero slower than the cross-fit doubly robust estimator. As specific examples we consider the expected conditional covariance, mean with randomly missing data, and a weighted average derivative.
This paper has not been read by Pith yet.
Forward citations
Cited by 5 Pith papers
-
Sinkhorn Treatment Effects: A Causal Optimal Transport Measure
The Sinkhorn treatment effect is a new entropic optimal transport measure of divergence between counterfactual distributions that admits first- and second-order pathwise differentiability, debiased estimators, and asy...
-
In-Sample Evaluation of Subgroups Identified by Generic Machine Learning
A conditional adaptive perturbation approach enables valid in-sample inference for machine learning-identified subgroups with nonregular boundaries via triple robustness.
-
Improving Variance Estimation for Covariate Adjustment with Binary Outcomes
The IF-LOO variance estimator for covariate-adjusted treatment effects with binary outcomes provides appropriate type I error control in simulations, especially for rare events or small samples, with a closed-form imp...
-
UD-DML: Uniform Design Subsampling for Double Machine Learning over Massive Data
UD-DML creates balanced representative subsamples via uniform design in PCA space for efficient double machine learning estimation of average treatment effects on large datasets.
-
A Semi-Supervised Kernel Two-Sample Test
A semi-supervised kernel two-sample test integrates unlabeled covariate data to achieve asymptotic normality under the null, higher power than standard kernel tests, and consistency against fixed and local alternatives.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.