Learning Representations that Support Robust Transfer of Predictors

Tommi Jaakkola; Yilun Xu

arxiv: 2110.09940 · v1 · pith:NA6FZXPBnew · submitted 2021-10-19 · 💻 cs.LG

Learning Representations that Support Robust Transfer of Predictors

Yilun Xu , Tommi Jaakkola This is my paper

classification 💻 cs.LG

keywords transferriskcriterionenvironmentsgeneralizationavailableenvironmentoptimizing

0 comments

read the original abstract

Ensuring generalization to unseen environments remains a challenge. Domain shift can lead to substantially degraded performance unless shifts are well-exercised within the available training environments. We introduce a simple robust estimation criterion -- transfer risk -- that is specifically geared towards optimizing transfer to new environments. Effectively, the criterion amounts to finding a representation that minimizes the risk of applying any optimal predictor trained on one environment to another. The transfer risk essentially decomposes into two terms, a direct transfer term and a weighted gradient-matching term arising from the optimality of per-environment predictors. Although inspired by IRM, we show that transfer risk serves as a better out-of-distribution generalization criterion, both theoretically and empirically. We further demonstrate the impact of optimizing such transfer risk on two controlled settings, each representing a different pattern of environment shift, as well as on two real-world datasets. Experimentally, the approach outperforms baselines across various out-of-distribution generalization tasks. Code is available at \url{https://github.com/Newbeeer/TRM}.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Assessing Distribution Shift in Human Activity Recognition for Domain Generalization
cs.AI 2026-06 unverdicted novelty 6.0

Evaluates four distribution shifts in sensor-based HAR, finds diversity shifts dominate, and shows 28 DG methods only marginally beat ERM while releasing open benchmarks.