pith. sign in

hub

Cross-Fitting and Fast Remainder Rates for Semiparametric Estimation

14 Pith papers cite this work. Polarity classification is still indexing.

14 Pith papers citing it
abstract

There are many interesting and widely used estimators of a functional with finite semiparametric variance bound that depend on nonparametric estimators of nuisance functions. We use cross-fitting (i.e. sample splitting) to construct novel estimators with fast remainder rates. We give cross-fit doubly robust estimators that use separate subsamples to estimate different nuisance functions. We obtain general, precise results for regression spline estimation of average linear functionals of conditional expectations with a finite semiparametric variance bound. We show that a cross-fit doubly robust spline regression estimator of the expected conditional covariance is semiparametric efficient under minimal conditions. Cross-fit doubly robust estimators of other average linear functionals of a conditional expectation are shown to have the fastest known remainder rates for the Haar basis or under certain smoothness conditions. Surprisingly, the cross-fit plug-in estimator also has nearly the fastest known remainder rate, but the remainder converges to zero slower than the cross-fit doubly robust estimator. As specific examples we consider the expected conditional covariance, mean with randomly missing data, and a weighted average derivative.

hub tools

verdicts

UNVERDICTED 14

clear filters

representative citing papers

Causal K-Means Clustering

stat.ME · 2024-05-05 · unverdicted · novelty 7.0

Causal k-Means Clustering applies k-means to estimated counterfactual functions via plug-in and double machine learning bias-corrected estimators to identify subgroups with heterogeneous treatment effects and achieves root-n rates.

Sinkhorn Treatment Effects: A Causal Optimal Transport Measure

stat.ML · 2026-05-08 · unverdicted · novelty 7.0

The Sinkhorn treatment effect is a new entropic optimal transport measure of divergence between counterfactual distributions that admits first- and second-order pathwise differentiability, debiased estimators, and asymptotically valid tests for distributional treatment effects.

A Semi-Supervised Kernel Two-Sample Test

stat.ML · 2026-05-03 · unverdicted · novelty 6.0

A semi-supervised kernel two-sample test integrates unlabeled covariate data to achieve asymptotic normality under the null, higher power than standard kernel tests, and consistency against fixed and local alternatives.

crossfit: A Graph-Based Cross-Fitting Engine in R

stat.CO · 2026-05-15 · unverdicted · novelty 5.0

crossfit is an R package that supplies a general-purpose cross-fitting engine driven by user-specified DAGs of nuisance models with configurable fold allocations and reproducibility features.

Smooth Multi-Policy Causal Effect Estimation in Longitudinal Settings

cs.LG · 2026-05-14 · unverdicted · novelty 5.0

PEQ-Net uses policy-aware reparameterization of ICE Q-functions and kernel mean embeddings in a shared encoder, followed by LTMLE, to jointly estimate multiple policies while constraining second-order bias for lower variance.

citing papers explorer

Showing 5 of 5 citing papers after filters.