pith. sign in

arxiv: 2407.18428 · v1 · pith:3ORP3U67new · submitted 2024-07-25 · 💻 cs.LG · cs.AI· cs.CV

Weighted Risk Invariance: Domain Generalization under Invariant Feature Shift

classification 💻 cs.LG cs.AIcs.CV
keywords invariantmodelsshifttextlearningundercovariateenvironments
0
0 comments X
read the original abstract

Learning models whose predictions are invariant under multiple environments is a promising approach for out-of-distribution generalization. Such models are trained to extract features $X_{\text{inv}}$ where the conditional distribution $Y \mid X_{\text{inv}}$ of the label given the extracted features does not change across environments. Invariant models are also supposed to generalize to shifts in the marginal distribution $p(X_{\text{inv}})$ of the extracted features $X_{\text{inv}}$, a type of shift we call an $\textit{invariant covariate shift}$. However, we show that proposed methods for learning invariant models underperform under invariant covariate shift, either failing to learn invariant models$\unicode{x2014}$even for data generated from simple and well-studied linear-Gaussian models$\unicode{x2014}$or having poor finite-sample performance. To alleviate these problems, we propose $\textit{weighted risk invariance}$ (WRI). Our framework is based on imposing invariance of the loss across environments subject to appropriate reweightings of the training examples. We show that WRI provably learns invariant models, i.e. discards spurious correlations, in linear-Gaussian settings. We propose a practical algorithm to implement WRI by learning the density $p(X_{\text{inv}})$ and the model parameters simultaneously, and we demonstrate empirically that WRI outperforms previous invariant learning methods under invariant covariate shift.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. S2Aligner: Pair-Efficient and Transferable Pre-Training for Sparse Text-Attributed Graphs

    cs.LG 2026-05 unverdicted novelty 6.0

    S2Aligner decouples semantic and structural components in LLM-as-Aligner pre-training for sparse TAGs and uses structure-oriented reconstruction plus domain risk balancing to improve transferability and reduce general...

  2. S2Aligner: Pair-Efficient and Transferable Pre-Training for Sparse Text-Attributed Graphs

    cs.LG 2026-05 unverdicted novelty 5.0

    S2Aligner decouples semantic and structural components in LLM-based graph-text alignment, adds structure-oriented reconstruction with consistency control, and uses sparsity-aware risk balancing to reduce cross-domain ...