On the use of cross-fitting in causal machine learning with correlated units

· 2026 · stat.ME · arXiv 2601.10899

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

In causal machine learning, the fitting and evaluation of nuisance models are often performed on separate partitions, or folds, of the observed data. This technique, called cross-fitting, eliminates bias introduced by the use of black-box predictive algorithms. When study units may be correlated, such as in spatial, clustered, or time-series data, investigators often design bespoke forms of cross-fitting to minimize correlation between folds. We prove that, perhaps contrary to popular belief, this is typically unnecessary: performing cross fitting as if study units were independent still eliminates key bias terms even when units may be correlated. In simulation experiments with various correlation structures, we show that causal machine learning estimators achieve the same or improved bias and precision under cross-fitting that ignores correlation compared to techniques striving to eliminate correlation between folds.

representative citing papers

Cross-Fitted Survey-Weighted TMLE with Design-Based Variance for Causal Machine Learning

stat.ME · 2026-06-29 · unverdicted · novelty 7.0

Cluster-level cross-fitting restores valid coverage for survey-weighted TMLE with flexible learners under stratified multistage designs, while single-fit and internal cross-validation versions under-cover.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Cross-Fitted Survey-Weighted TMLE with Design-Based Variance for Causal Machine Learning stat.ME · 2026-06-29 · unverdicted · none · ref 73 · internal anchor
Cluster-level cross-fitting restores valid coverage for survey-weighted TMLE with flexible learners under stratified multistage designs, while single-fit and internal cross-validation versions under-cover.

On the use of cross-fitting in causal machine learning with correlated units

fields

years

verdicts

representative citing papers

citing papers explorer