Orthogonal Representation Learning for Estimating Causal Quantities

Dennis Frauen; Jonas Schweisthal; Stefan Feuerriegel; Valentyn Melnychuk

arxiv: 2502.04274 · v4 · submitted 2025-02-06 · 💻 cs.LG

Orthogonal Representation Learning for Estimating Causal Quantities

Valentyn Melnychuk , Dennis Frauen , Jonas Schweisthal , Stefan Feuerriegel This is my paper

classification 💻 cs.LG

keywords learningrepresentationlearnersneyman-orthogonalend-to-endtheoreticalbalancingcausal

0 comments

read the original abstract

End-to-end representation learning has become a powerful tool for estimating causal quantities from high-dimensional observational data, but its efficiency remained unclear. Here, we face a central tension: End-to-end representation learning methods often work well in practice but lack asymptotic optimality in the form of the quasi-oracle efficiency. In contrast, two-stage Neyman-orthogonal learners provide such a theoretical optimality property but do not explicitly benefit from the strengths of representation learning. In this work, we step back and ask two research questions: (1) When do representations strengthen existing Neyman-orthogonal learners? and (2) Can a balancing constraint - a commonly proposed technique in the representation learning literature - provide improvements to Neyman-orthogonality? We address these two questions through our theoretical and empirical analysis, where we introduce a unifying framework that connects representation learning with Neyman-orthogonal learners (namely, OR-learners). In particular, we show that, under the low-dimensional manifold hypothesis, the OR-learners can strictly improve the estimation error of the standard Neyman-orthogonal learners. At the same time, we find that the balancing constraint requires an additional inductive bias and cannot generally compensate for the lack of Neyman-orthogonality of the end-to-end approaches. Building on these insights, we offer guidelines for how users can effectively combine representation learning with the classical Neyman-orthogonal learners to achieve both practical performance and theoretical guarantees.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Annotation-Assisted Learning of Treatment Policies From Multimodal Electronic Health Records
cs.LG 2025-07 unverdicted novelty 6.0

AACE is an annotation-assisted method for causal policy learning from multimodal EHRs that outperforms risk-based and representation-based baselines on synthetic, semi-synthetic, and real datasets.