pith. sign in

super hub Canonical reference

Invariant Risk Minimization

Canonical reference. 71% of citing Pith papers cite this work as background.

119 Pith papers citing it
Background 71% of classified citations
abstract

We introduce Invariant Risk Minimization (IRM), a learning paradigm to estimate invariant correlations across multiple training distributions. To achieve this goal, IRM learns a data representation such that the optimal classifier, on top of that data representation, matches for all training distributions. Through theory and experiments, we show how the invariances learned by IRM relate to the causal structures governing the data and enable out-of-distribution generalization.

hub tools

citation-role summary

background 12 method 1 other 1

citation-polarity summary

claims ledger

  • abstract We introduce Invariant Risk Minimization (IRM), a learning paradigm to estimate invariant correlations across multiple training distributions. To achieve this goal, IRM learns a data representation such that the optimal classifier, on top of that data representation, matches for all training distributions. Through theory and experiments, we show how the invariances learned by IRM relate to the causal structures governing the data and enable out-of-distribution generalization.
  • background Θ ⊆ Rd are convex and compact, and letθ∗ ∈ Θ be a minimizer of the worst-group objectiveR(θ). Then there exists a distributionQ∗ ∈ Q such thatθ∗ ∈ arg minθ Ez∼Q∗[ℓ(θ;z)]. However, this equivalence breaks down when the lossℓ is non-convex: Counterexample 1. Consider a uniform data distributionP supported on two points Z = {z1,z 2}, and letℓ(θ;z) be as in Figure 4, withΘ = [0, 1]. The DRO solutionθ∗ achieves a worst-case loss of R(θ∗) = 0.6. Now consider any weights (w1,w 2) ∈ ∆2 and w.l.o.g. letw

authors

co-cited works

clear filters

representative citing papers

Is Spurious Correlation Removal Always Learnable?

cs.LG · 2026-06-11 · unverdicted · novelty 7.0

Conditional computational barrier exists for learning k=1 invariant subspaces in samplable multi-environment instances under sparse recovery hardness; minimax risk is Theta(k(d-k)/(n|E|)) with phase transition at n* ~ k(d-k)/(|E| gamma^2).

Implicit Neural Representations of Individual Behavior

cs.LG · 2026-06-10 · unverdicted · novelty 7.0

Behavioral INR adapts INRs to behavior by mapping states to actions with FiLM-modulated episode latents for self-supervised policy inference in unlabeled data, with new policy OOD definitions.

Invariant Gradient Alignment for Robust Reasoning Distillation

cs.LG · 2026-06-03 · unverdicted · novelty 7.0

Invariant Gradient Alignment uses Logical Isomer Sets and a Continuous Gradient Conflict Mask to tighten OOD generalization bounds and boost empirical performance over ERM in reasoning distillation.

Prediction-Intervention Games and Invariant Sets

stat.ML · 2026-05-16 · unverdicted · novelty 7.0

In prediction-intervention games, stable-blanket predictors are at least as good as causal-parent predictors for two classes of follower objectives and can be worst-case optimal under additional conditions.

Continual Learning of Domain-Invariant Representations

cs.LG · 2026-05-15 · unverdicted · novelty 7.0

Introduces replay-based continual learning with sequential invariance alignment to learn domain-invariant representations, outperforming baselines on generalization to unseen domains across six datasets in vision, medicine, manufacturing, and ecology.

Spectral Gradient Surgery for Domain-Generalizable Dataset Distillation

cs.LG · 2026-05-13 · unverdicted · novelty 7.0

Spectral Gradient Surgery disentangles class-discriminative and domain-specific signals in distribution-matching distilled datasets by analyzing gradient agreement in the spectral domain, yielding better out-of-distribution performance.

citing papers explorer

Showing 6 of 6 citing papers after filters.

  • Prediction-Intervention Games and Invariant Sets stat.ML · 2026-05-16 · unverdicted · none · ref 1 · internal anchor

    In prediction-intervention games, stable-blanket predictors are at least as good as causal-parent predictors for two classes of follower objectives and can be worst-case optimal under additional conditions.

  • Anchor PCA stat.ML · 2026-06-04 · unverdicted · none · ref 4 · internal anchor

    Anchor PCA recovers a maximal invariant subspace for multi-domain data via PCA on a modified target matrix that trades off explained variance with domain agreement.

  • Unsupervised Identification and Removal of Spurious Correlations During Fine-Tuning stat.ML · 2026-05-26 · unverdicted · none · ref 1 · internal anchor

    Spurious latent factors in fine-tuning can be identified unsupervised from naive LoRA weights and removed via gradient projection of associated patterns to reduce bias and misalignment while preserving task performance.

  • Robust Representation Learning through Explicit Environment Modeling stat.ML · 2026-04-28 · unverdicted · none · ref 2 · internal anchor

    Explicitly modeling and marginalizing environment variation via generalized random-intercept models produces representations that support robust average prediction across unseen environments and outperform invariant-learning methods in challenging distribution-shift settings.

  • Environment-Robust Representation Learning with Empirical Bayes stat.ML · 2026-06-03 · unverdicted · none · ref 3 · internal anchor

    An empirical Bayes variational inference method learns environment-robust latent variables from multi-environment data for improved prediction in unseen environments.

  • Causality as the Statistical Conscience of Artificial Intelligence: From Pearl's Ladder to Trustworthy Machines stat.ML · 2026-05-22 · unverdicted · none · ref 3 · internal anchor

    Causality is required for out-of-distribution generalization in AI, with a necessity theorem and unified causal estimators proposed to fix failure modes like hallucination and reward hacking.