Unsupervised Domain Adaptation by Backpropagation

Victor Lempitsky; Yaroslav Ganin

arxiv: 1409.7495 · v2 · pith:YRCMDCXInew · submitted 2014-09-26 · 📊 stat.ML · cs.LG· cs.NE

Unsupervised Domain Adaptation by Backpropagation

Yaroslav Ganin , Victor Lempitsky This is my paper

classification 📊 stat.ML cs.LGcs.NE

keywords domaindataadaptationlabeledapproachdeeptrainedamount

0 comments

read the original abstract

Top-performing deep architectures are trained on massive amounts of labeled data. In the absence of labeled data for a certain task, domain adaptation often provides an attractive option given that labeled data of similar nature but from a different domain (e.g. synthetic images) are available. Here, we propose a new approach to domain adaptation in deep architectures that can be trained on large amount of labeled data from the source domain and large amount of unlabeled data from the target domain (no labeled target-domain data is necessary). As the training progresses, the approach promotes the emergence of "deep" features that are (i) discriminative for the main learning task on the source domain and (ii) invariant with respect to the shift between the domains. We show that this adaptation behaviour can be achieved in almost any feed-forward model by augmenting it with few standard layers and a simple new gradient reversal layer. The resulting augmented architecture can be trained using standard backpropagation. Overall, the approach can be implemented with little effort using any of the deep-learning packages. The method performs very well in a series of image classification experiments, achieving adaptation effect in the presence of big domain shifts and outperforming previous state-of-the-art on Office datasets.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 6 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Weather-Robust Cross-View Geo-Localization via Prototype-Based Semantic Part Discovery
cs.CV 2026-05 unverdicted novelty 7.0

SkyPart uses learnable prototypes for patch grouping, altitude modulation only in training, graph-attention readout, and Kendall-weighted loss to set new state-of-the-art single-pass performance on SUES-200, Universit...
Blending-target Domain Adaptation by Adversarial Meta-Adaptation Networks
cs.LG 2019-07 unverdicted novelty 7.0

AMEAN applies adversarial meta-learning to discover implicit meta-sub-target clusters in blended target data, reducing intra-target category misalignment and outperforming standard DA methods on three BTDA benchmarks.
Weather-Robust Cross-View Geo-Localization via Prototype-Based Semantic Part Discovery
cs.CV 2026-05 unverdicted novelty 6.0

SkyPart achieves state-of-the-art single-pass cross-view geo-localization on SUES-200, University-1652, and DenseUAV by using prototype-based part discovery, altitude-conditioned modulation, and Kendall-weighted loss,...
Discriminative Active Learning
cs.LG 2019-07 unverdicted novelty 6.0

DAL poses batch active learning as a binary classification task between labeled and unlabeled data to select informative examples for labeling.
NIESR: Nuisance Invariant End-to-end Speech Recognition
cs.CL 2019-07 unverdicted novelty 6.0

NIESR applies unsupervised adversarial invariance induction to end-to-end ASR, reporting 5.48-14.44% relative error reductions on WSJ0, CHiME3, and TIMIT without nuisance factor labels.
ClinQueryAgent: A Conversational Agent for Population Health Management
cs.IR 2026-04 unverdicted novelty 4.0

The paper introduces ClinQueryAgent, a conversational agent that converts natural language queries into database queries for population health management while keeping patient data secure, and reports its use by 128 s...