pith. machine review for the scientific record. sign in

arxiv: 1801.09138 · v1 · submitted 2018-01-27 · 🧮 math.ST · stat.TH

Recognition: unknown

Cross-Fitting and Fast Remainder Rates for Semiparametric Estimation

Authors on Pith no claims yet
classification 🧮 math.ST stat.TH
keywords cross-fitestimatorsremainderconditionaldoublyrobustsemiparametricaverage
0
0 comments X
read the original abstract

There are many interesting and widely used estimators of a functional with finite semiparametric variance bound that depend on nonparametric estimators of nuisance functions. We use cross-fitting (i.e. sample splitting) to construct novel estimators with fast remainder rates. We give cross-fit doubly robust estimators that use separate subsamples to estimate different nuisance functions. We obtain general, precise results for regression spline estimation of average linear functionals of conditional expectations with a finite semiparametric variance bound. We show that a cross-fit doubly robust spline regression estimator of the expected conditional covariance is semiparametric efficient under minimal conditions. Cross-fit doubly robust estimators of other average linear functionals of a conditional expectation are shown to have the fastest known remainder rates for the Haar basis or under certain smoothness conditions. Surprisingly, the cross-fit plug-in estimator also has nearly the fastest known remainder rate, but the remainder converges to zero slower than the cross-fit doubly robust estimator. As specific examples we consider the expected conditional covariance, mean with randomly missing data, and a weighted average derivative.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 5 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Sinkhorn Treatment Effects: A Causal Optimal Transport Measure

    stat.ML 2026-05 unverdicted novelty 7.0

    The Sinkhorn treatment effect is a new entropic optimal transport measure of divergence between counterfactual distributions that admits first- and second-order pathwise differentiability, debiased estimators, and asy...

  2. In-Sample Evaluation of Subgroups Identified by Generic Machine Learning

    stat.ME 2026-05 unverdicted novelty 7.0

    A conditional adaptive perturbation approach enables valid in-sample inference for machine learning-identified subgroups with nonregular boundaries via triple robustness.

  3. Improving Variance Estimation for Covariate Adjustment with Binary Outcomes

    stat.ME 2026-05 unverdicted novelty 6.0

    The IF-LOO variance estimator for covariate-adjusted treatment effects with binary outcomes provides appropriate type I error control in simulations, especially for rare events or small samples, with a closed-form imp...

  4. UD-DML: Uniform Design Subsampling for Double Machine Learning over Massive Data

    stat.ME 2026-05 unverdicted novelty 6.0

    UD-DML creates balanced representative subsamples via uniform design in PCA space for efficient double machine learning estimation of average treatment effects on large datasets.

  5. A Semi-Supervised Kernel Two-Sample Test

    stat.ML 2026-05 unverdicted novelty 6.0

    A semi-supervised kernel two-sample test integrates unlabeled covariate data to achieve asymptotic normality under the null, higher power than standard kernel tests, and consistency against fixed and local alternatives.