Measuring and Harnessing Transference in Multi-Task Learning

Chelsea Finn; Christopher Fifty; Ehsan Amid; Rohan Anil; Tianhe Yu; Zhe Zhao

arxiv: 2010.15413 · v3 · pith:ONDDNKSOnew · submitted 2020-10-29 · 💻 cs.LG · cs.AI· cs.CV· cs.RO

Measuring and Harnessing Transference in Multi-Task Learning

Christopher Fifty , Ehsan Amid , Zhe Zhao , Tianhe Yu , Rohan Anil , Chelsea Finn This is my paper

classification 💻 cs.LG cs.AIcs.CVcs.RO

keywords learningmulti-tasktaskstransferencetrainingbenefitdynamicsinformation

0 comments

read the original abstract

Multi-task learning can leverage information learned by one task to benefit the training of other tasks. Despite this capacity, naive formulations often degrade performance and in particular, identifying the tasks that would benefit from co-training remains a challenging design question. In this paper, we analyze the dynamics of information transfer, or transference, across tasks throughout training. Specifically, we develop a similarity measure that can quantify transference among tasks and use this quantity to both better understand the optimization dynamics of multi-task learning as well as improve overall learning performance. In the latter case, we propose two methods to leverage our transference metric. The first operates at a macro-level by selecting which tasks should train together while the second functions at a micro-level by determining how to combine task gradients at each training step. We find these methods can lead to significant improvement over prior work on three supervised multi-task learning benchmarks and one multi-task reinforcement learning paradigm.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

$\mathcal{B}^{3}$-Net: Controlled Posterior Bridge Learning for Multi-Task Dense Prediction
cs.CV 2026-05 unverdicted novelty 6.0

B³-Net improves multi-task dense prediction by estimating patch-wise evidence precision, fusing it into a reliability-weighted posterior bridge, and redistributing via bounded updates to limit contamination from unrel...