Stochastic Gradients under Nuisances

Alex Luedtke; Facheng Yu; Ronak Mehta; Zaid Harchaoui

arxiv: 2508.20326 · v2 · pith:VDWL7LBSnew · submitted 2025-08-28 · 📊 stat.ML · cs.LG· math.OC

Stochastic Gradients under Nuisances

Facheng Yu , Ronak Mehta , Alex Luedtke , Zaid Harchaoui This is my paper

classification 📊 stat.ML cs.LGmath.OC

keywords learninggradientstochasticalgorithmapproximatelyclassicalconvergenceneyman

0 comments

read the original abstract

Stochastic gradient optimization is the dominant learning paradigm for a variety of scenarios, from classical supervised learning to modern self-supervised learning. We consider stochastic gradient algorithms for learning problems whose objectives rely on unknown nuisance parameters, and establish non-asymptotic convergence guarantees. Our results show that, while the presence of a nuisance can alter the optimum and upset the optimization trajectory, the classical stochastic gradient algorithm may still converge under appropriate conditions, such as Neyman orthogonality. Moreover, even when Neyman orthogonality is not satisfied, we show that an algorithm variant with approximately orthogonalized updates (with an approximately orthogonalized gradient oracle) may achieve similar convergence rates. Examples from orthogonal statistical learning/double machine learning and causal inference are discussed.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Fitted $Q$ Evaluation Without Bellman Completeness via Stationary Weighting
stat.ML 2025-12 conditional novelty 7.0

Stationary-weighted FQE achieves finite-sample linear convergence to the projected Bellman fixed point without Bellman completeness by reweighting regressions to the target stationary norm.