Blending-target Domain Adaptation by Adversarial Meta-Adaptation Networks

Jingyu Zhuang; Liang Lin; Xiaodan Liang; Ziliang Chen

arxiv: 1907.03389 · v1 · pith:472J53VCnew · submitted 2019-07-08 · 💻 cs.LG · stat.ML

Blending-target Domain Adaptation by Adversarial Meta-Adaptation Networks

Ziliang Chen , Jingyu Zhuang , Xiaodan Liang , Liang Lin This is my paper

Pith reviewed 2026-05-25 01:35 UTC · model grok-4.3

classification 💻 cs.LG stat.ML

keywords domain adaptationadversarial learningmeta-learningblending-target domain adaptationnegative transferunsupervised clusteringsub-target discovery

0 comments

The pith

AMEAN deploys an unsupervised meta-learner on target data to discover meta-sub-target domains and remove implicit category misalignment in blending-target domain adaptation.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes a new transfer setting called blending-target domain adaptation, in which the unlabeled target consists of several hidden sub-targets mixed together so that standard domain-adaptation methods encounter both domain gaps and category mismatches among the sub-targets. It shows that a conventional adversarial alignment between source and the mixed target is insufficient, and therefore introduces a second adversarial process in which an unsupervised meta-learner receives target samples together with ongoing feature feedbacks and clusters them into meta-sub-target domains. These discovered clusters automatically define an additional meta-sub-target adaptation loss that empirically corrects the hidden misalignments. A reader would care because many practical targets are naturally composed of such blended sub-populations, rendering most existing domain-adaptation algorithms unreliable without explicit sub-target labels.

Core claim

In the blending-target domain adaptation scenario the target domain comprises multiple sub-targets that are implicitly blended, so learners cannot assign each unlabeled sample to its sub-target; the Adversarial Meta-Adaptation Network therefore runs two adversarial processes—the first aligns source and mixed target in the usual way, while the second deploys an unsupervised meta-learner on target data and feature feedbacks to discover meta-sub-target domains whose induced adaptation loss removes the implicit category mismatching.

What carries the argument

The unsupervised meta-learner that receives only target data and ongoing feature-learning feedbacks and outputs discovered clusters treated as meta-sub-target domains to auto-design the meta-sub-target DA loss.

If this is right

BTDA constitutes a challenging transfer setup in which most existing domain-adaptation algorithms suffer negative transfer.
AMEAN significantly outperforms state-of-the-art baselines on three benchmarks configured under the BTDA protocol.
The meta-sub-target adaptation loss empirically eliminates implicit category mismatching within the mixed target.
The dual adversarial structure restrains negative transfer effects that arise from hidden sub-target misalignment.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same meta-learner mechanism could be applied to other unsupervised problems that contain latent mixture structure without explicit labels.
Performance would degrade if the discovered clusters do not align with the true (unknown) sub-target category boundaries.
Extending the meta-learner to produce soft rather than hard cluster assignments might further reduce residual misalignment.

Load-bearing premise

The unsupervised meta-learner, given only target data and ongoing feature feedbacks, can discover clusters that function as meaningful meta-sub-target domains.

What would settle it

An experiment in which the meta-learner is replaced by random partitioning of the target or by a supervised oracle that knows the true sub-target labels and shows that AMEAN then loses its reported gains over standard adversarial baselines.

Figures

Figures reproduced from arXiv: 1907.03389 by Jingyu Zhuang, Liang Lin, Xiaodan Liang, Ziliang Chen.

**Figure 1.** Figure 1: The comparison of MTDA and BTDA (color orange and blue denote source and target) setups. In MTDA (a), target domains are [PITH_FULL_IMAGE:figures/full_fig_p001_1.png] view at source ↗

**Figure 2.** Figure 2: The learning pipeline of our Adversarial MEta-Adaptation Network (AMEAN). AMEAN receives source samples with ground [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: t-SNE visualizations of the features learned by Source-only, RevGred, VADA and AMEAN on Digit-five in BTDA setup. Shapes [PITH_FULL_IMAGE:figures/full_fig_p008_3.png] view at source ↗

**Figure 4.** Figure 4: Ablation studies of our meta-learner across three transfer [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗

read the original abstract

(Unsupervised) Domain Adaptation (DA) seeks for classifying target instances when solely provided with source labeled and target unlabeled examples for training. Learning domain-invariant features helps to achieve this goal, whereas it underpins unlabeled samples drawn from a single or multiple explicit target domains (Multi-target DA). In this paper, we consider a more realistic transfer scenario: our target domain is comprised of multiple sub-targets implicitly blended with each other, so that learners could not identify which sub-target each unlabeled sample belongs to. This Blending-target Domain Adaptation (BTDA) scenario commonly appears in practice and threatens the validities of most existing DA algorithms, due to the presence of domain gaps and categorical misalignments among these hidden sub-targets. To reap the transfer performance gains in this new scenario, we propose Adversarial Meta-Adaptation Network (AMEAN). AMEAN entails two adversarial transfer learning processes. The first is a conventional adversarial transfer to bridge our source and mixed target domains. To circumvent the intra-target category misalignment, the second process presents as ``learning to adapt'': It deploys an unsupervised meta-learner receiving target data and their ongoing feature-learning feedbacks, to discover target clusters as our ``meta-sub-target'' domains. These meta-sub-targets auto-design our meta-sub-target DA loss, which empirically eliminates the implicit category mismatching in our mixed target. We evaluate AMEAN and a variety of DA algorithms in three benchmarks under the BTDA setup. Empirical results show that BTDA is a quite challenging transfer setup for most existing DA algorithms, yet AMEAN significantly outperforms these state-of-the-art baselines and effectively restrains the negative transfer effects in BTDA.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

BTDA names a practical failure mode in DA but the meta-learner evidence is only aggregate accuracy with no cluster diagnostics.

read the letter

The paper points out that real target data often mixes unlabeled sub-domains whose category distributions differ, and that standard adversarial DA can then produce negative transfer. That setup, called BTDA, is described clearly and matches things that happen in practice. They respond with AMEAN: a first adversarial step aligns source to the mixed target, then a second unsupervised meta-learner takes target samples plus current feature signals, forms clusters treated as meta-sub-targets, and uses those to build an extra adaptation loss. On three BTDA benchmarks the method beats the listed baselines and reduces the reported negative-transfer effect.

Referee Report

2 major / 0 minor

Summary. The paper introduces Blending-target Domain Adaptation (BTDA), a scenario in which the target domain consists of multiple implicitly blended sub-targets that cannot be identified, causing domain gaps and categorical misalignments that invalidate standard DA methods. It proposes Adversarial Meta-Adaptation Network (AMEAN) with two adversarial processes: (1) conventional adversarial transfer between source and mixed target, and (2) an unsupervised meta-learner that ingests target samples plus ongoing feature feedbacks to discover clusters treated as meta-sub-target domains; these clusters auto-design a meta-sub-target DA loss claimed to eliminate implicit category mismatching. Experiments on three BTDA benchmarks report that AMEAN significantly outperforms state-of-the-art baselines and restrains negative transfer.

Significance. If the meta-learner clusters reliably align with hidden category structure and the reported gains survive controls, the work would address a practically relevant gap between standard multi-target DA assumptions and real-world blended targets. The two-process adversarial design and the explicit handling of intra-target misalignment are conceptually coherent extensions of existing adversarial DA frameworks.

major comments (2)

[Section 3.2, Algorithm 1] Section 3.2 and Algorithm 1: the unsupervised meta-learner is described as receiving only target data and feature-learning feedbacks with no external supervision, yet the central claim that the discovered clusters function as meaningful meta-sub-target domains (and thereby correct category mismatching) rests on an unverified assumption. The manuscript provides no cluster-purity diagnostics against known sub-target partitions, no visualization of cluster-category alignment, and no ablation that isolates the contribution of the meta-sub-target DA loss from the standard adversarial term.
[Empirical evaluation] Empirical section: aggregate accuracy gains are reported on three BTDA benchmarks, but without the above diagnostics it is impossible to determine whether the gains arise from the intended meta-adaptation mechanism or from incidental regularization effects of the extra clustering term. This directly affects the claim that AMEAN “effectively restrains the negative transfer effects in BTDA.”

Simulated Author's Rebuttal

2 responses · 1 unresolved

We thank the referee for the constructive feedback on the validation of the meta-learner and the empirical claims. We respond to each major comment below.

read point-by-point responses

Referee: [Section 3.2, Algorithm 1] Section 3.2 and Algorithm 1: the unsupervised meta-learner is described as receiving only target data and feature-learning feedbacks with no external supervision, yet the central claim that the discovered clusters function as meaningful meta-sub-target domains (and thereby correct category mismatching) rests on an unverified assumption. The manuscript provides no cluster-purity diagnostics against known sub-target partitions, no visualization of cluster-category alignment, and no ablation that isolates the contribution of the meta-sub-target DA loss from the standard adversarial term.

Authors: The BTDA setting is defined by the fact that sub-target partitions are unknown and unidentifiable, so ground-truth purity metrics against known partitions are unavailable by construction. We will add t-SNE visualizations of the discovered clusters (and their relation to category structure where feasible) together with an ablation that removes the meta-sub-target DA loss while retaining the standard adversarial term. These additions will appear in the revised manuscript. revision: partial
Referee: [Empirical evaluation] Empirical section: aggregate accuracy gains are reported on three BTDA benchmarks, but without the above diagnostics it is impossible to determine whether the gains arise from the intended meta-adaptation mechanism or from incidental regularization effects of the extra clustering term. This directly affects the claim that AMEAN “effectively restrains the negative transfer effects in BTDA.”

Authors: The ablation study described above will isolate the contribution of the meta-sub-target term. We will incorporate the results into the experimental section and adjust the discussion of negative-transfer reduction to reflect only what the controlled experiments support. revision: yes

standing simulated objections not resolved

Direct cluster-purity diagnostics against known sub-target partitions cannot be supplied, because the BTDA problem definition states that such partitions are unknown and unidentifiable.

Circularity Check

0 steps flagged

No significant circularity; empirical validation independent of internal definitions

full rationale

The paper introduces AMEAN with a conventional adversarial DA step plus an unsupervised meta-learner that clusters target samples using feature feedbacks to form meta-sub-target domains. No equations, fitted parameters, or self-citations are quoted that reduce the claimed accuracy gains or the elimination of category misalignment to a tautology or construction internal to the inputs. Performance is assessed via aggregate accuracy on three external BTDA benchmarks, satisfying the criterion for self-contained evaluation against external data.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 1 invented entities

Abstract supplies no explicit free parameters, background axioms, or independent evidence for the meta-sub-target construct.

invented entities (1)

meta-sub-target domains no independent evidence
purpose: Clusters discovered by the meta-learner that serve as surrogate domains for designing the intra-target adaptation loss
Introduced to address category misalignment inside the blended target; no independent evidence supplied in the abstract.

pith-pipeline@v0.9.0 · 5840 in / 1195 out tokens · 21996 ms · 2026-05-25T01:35:34.148301+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

51 extracted references · 51 canonical work pages · 14 internal anchors

[1]

Andrychowicz, M

M. Andrychowicz, M. Denil, S. Gomez, M. W. Hoffman, D. Pfau, T. Schaul, B. Shillingford, and N. De Freitas. Learn- ing to learn by gradient descent by gradient descent. In Advances in Neural Information Processing Systems , pages 3981–3989, 2016

work page 2016
[2]

Unsupervised multi-target domain adaptation: An information theoretic approach

Anonymous. Unsupervised multi-target domain adaptation: An information theoretic approach. In Submitted to Interna- tional Conference on Learning Representations, 2019. under review

work page 2019
[3]

J. C. Balloch, V . Agrawal, I. Essa, and S. Chernova. Unbi- asing semantic segmentation for robot perception using syn- thetic data feature transfer.arXiv preprint arXiv:1809.03676, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018
[4]

Ben-David, J

S. Ben-David, J. Blitzer, K. Crammer, A. Kulesza, F. Pereira, and J. W. Vaughan. A theory of learning from different do- mains. Machine learning, 79(1):151–175, 2010

work page 2010
[5]

Bousmalis, A

K. Bousmalis, A. Irpan, P. Wohlhart, Y . Bai, M. Kelcey, M. Kalakrishnan, L. Downs, J. Ibarz, P. Pastor, K. Kono- lige, et al. Using simulation and domain adaptation to im- prove efﬁciency of deep robotic grasping. In 2018 IEEE In- ternational Conference on Robotics and Automation (ICRA), pages 4243–4250. IEEE, 2018

work page 2018
[6]

Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks

K. Bousmalis, N. Silberman, D. Dohan, D. Erhan, and D. Krishnan. Unsupervised pixel-level domain adapta- tion with generative adversarial networks. arXiv preprint arXiv:1612.05424, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016
[7]

Y . Chen, W. Li, C. Sakaridis, D. Dai, and L. Van Gool. Do- main adaptive faster r-cnn for object detection in the wild. 2018

work page 2018
[8]

C. Finn, P. Abbeel, and S. Levine. Model-agnostic meta- learning for fast adaptation of deep networks. 2017

work page 2017
[9]

Unsupervised Domain Adaptation by Backpropagation

Y . Ganin and V . Lempitsky. Unsupervised domain adaptation by backpropagation. arXiv preprint arXiv:1409.7495, 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014
[10]

Ganin, E

Y . Ganin, E. Ustinova, H. Ajakan, P. Germain, H. Larochelle, F. Laviolette, M. Marchand, and V . Lempitsky. Domain- Adversarial Training of Neural Networks. 2017

work page 2017
[11]

Fine-grained Recognition in the Wild: A Multi-Task Domain Adaptation Approach

T. Gebru, J. Hoffman, and L. Fei-Fei. Fine-grained recogni- tion in the wild: A multi-task domain adaptation approach. arXiv preprint arXiv:1709.02476, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017
[12]

Ghifary, W

M. Ghifary, W. B. Kleijn, M. Zhang, D. Balduzzi, and W. Li. Deep reconstruction-classiﬁcation networks for un- supervised domain adaptation. In European Conference on Computer Vision, pages 597–613. Springer, 2016

work page 2016
[13]

Gilad-Bachrach, N

R. Gilad-Bachrach, N. Dowlin, K. Laine, K. Lauter, M. Naehrig, and J. Wernsing. Cryptonets: Applying neu- ral networks to encrypted data with high throughput and ac- curacy. In International Conference on Machine Learning , pages 201–210, 2016

work page 2016
[14]

I. J. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y . Bengio. Generative adversarial nets. In International Conference on Neural Information Processing Systems , pages 2672–2680, 2014

work page 2014
[15]

Gopalan, R

R. Gopalan, R. Li, and R. Chellappa. Domain adaptation for object recognition: An unsupervised approach. In Com- puter Vision (ICCV), 2011 IEEE International Conference on, pages 999–1006. IEEE, 2011

work page 2011
[16]

Grandvalet and Y

Y . Grandvalet and Y . Bengio. Semi-supervised learning by entropy minimization. In Advances in neural information processing systems, pages 529–536, 2005. 13

work page 2005
[17]

Gretton, A

A. Gretton, A. J. Smola, J. Huang, M. Schmittfull, K. M. Borgwardt, and B. Sch¨olkopf. Covariate shift by kernel mean matching. 2009

work page 2009
[18]

X. Guo, L. Gao, X. Liu, and J. Yin. Improved deep embed- ded clustering with local structure preservation. In Interna- tional Joint Conference on Artiﬁcial Intelligence (IJCAI-17), pages 1753–1759, 2017

work page 2017
[19]

K. He, X. Zhang, S. Ren, and J. Sun. Deep residual learning for image recognition. 2015

work page 2015
[20]

Hoffman, E

J. Hoffman, E. Tzeng, T. Darrell, and K. Saenko. Simulta- neous deep transfer across domains and tasks. In Domain Adaptation in Computer Vision Applications , pages 173–

work page
[21]

FCNs in the Wild: Pixel-level Adversarial and Constraint-based Adaptation

J. Hoffman, D. Wang, F. Yu, and T. Darrell. Fcns in the wild: Pixel-level adversarial and constraint-based adapta- tion. arXiv preprint arXiv:1612.02649, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016
[22]

Krizhevsky, I

A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classiﬁcation with deep convolutional neural networks. In Advances in neural information processing systems , pages 1097–1105, 2012

work page 2012
[23]

LeCun, L

Y . LeCun, L. Bottou, Y . Bengio, and P. Haffner. Gradient- based learning applied to document recognition. Proceed- ings of the IEEE, 86(11):2278–2324, 1998

work page 1998
[24]

Liu and O

M.-Y . Liu and O. Tuzel. Coupled generative adversarial net- works. In Advances in neural information processing sys- tems, pages 469–477, 2016

work page 2016
[25]

M. Long, Y . Cao, J. Wang, and M. Jordan. Learning transfer- able features with deep adaptation networks. InInternational Conference on Machine Learning, pages 97–105, 2015

work page 2015
[26]

M. Long, H. Zhu, J. Wang, and M. I. Jordan. Deep trans- fer learning with joint adaptation networks. arXiv preprint arXiv:1605.06636, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016
[27]

M. Long, H. Zhu, J. Wang, and M. I. Jordan. Unsuper- vised domain adaptation with residual transfer networks. In Advances in Neural Information Processing Systems , pages 136–144, 2016

work page 2016
[28]

M. Long, H. Zhu, J. Wang, and M. I. Jordan. Deep transfer learning with joint adaptation networks. 2017

work page 2017
[29]

Boosting Domain Adaptation by Discovering Latent Domains

M. Mancini, L. Porzi, S. R. Bul `o, B. Caputo, and E. Ricci. Boosting domain adaptation by discovering latent domains. arXiv preprint arXiv:1805.01386, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018
[30]

Mansour, M

Y . Mansour, M. Mohri, and A. Rostamizadeh. Domain adap- tation with multiple sources. In Advances in neural informa- tion processing systems, pages 1041–1048, 2009

work page 2009
[31]

Miyato, S.-i

T. Miyato, S.-i. Maeda, S. Ishii, and M. Koyama. Virtual adversarial training: a regularization method for supervised and semi-supervised learning. IEEE transactions on pattern analysis and machine intelligence, 2018

work page 2018
[32]

Netzer, T

Y . Netzer, T. Wang, A. Coates, A. Bissacco, B. Wu, and A. Y . Ng. Reading digits in natural images with unsupervised fea- ture learning. Nips Workshop on Deep Learning and Unsu- pervised Feature Learning, 2011

work page 2011
[33]

S. J. Pan and Q. Yang. A survey on transfer learning. IEEE Transactions on knowledge and data engineering , 22(10):1345–1359, 2010

work page 2010
[34]

A. A. Rusu, M. Vecerik, T. Roth ¨orl, N. Heess, R. Pascanu, and R. Hadsell. Sim-to-real robot learning from pixels with progressive nets. arXiv preprint arXiv:1610.04286, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016
[35]

Saenko, B

K. Saenko, B. Kulis, M. Fritz, and T. Darrell. Adapting vi- sual category models to new domains. Computer Vision– ECCV 2010, pages 213–226, 2010

work page 2010
[36]

Asymmetric Tri-training for Unsupervised Domain Adaptation

K. Saito, Y . Ushiku, and T. Harada. Asymmetric tri- training for unsupervised domain adaptation. arXiv preprint arXiv:1702.08400, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017
[37]

Generate To Adapt: Aligning Domains using Generative Adversarial Networks

S. Sankaranarayanan, Y . Balaji, C. D. Castillo, and R. Chel- lappa. Generate to adapt: Aligning domains using generative adversarial networks. ArXiv e-prints, abs/1704.01705, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017
[38]

R. Shu, H. H. Bui, H. Narui, and S. Ermon. A dirt-t ap- proach to unsupervised domain adaptation. arXiv preprint arXiv:1802.08735, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018
[39]

Adversarial Discriminative Domain Adaptation

E. Tzeng, J. Hoffman, K. Saenko, and T. Darrell. Ad- versarial discriminative domain adaptation. arXiv preprint arXiv:1702.05464, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017
[40]

Venkateswara, J

H. Venkateswara, J. Eusebio, S. Chakraborty, and S. Pan- chanathan. Deep hashing network for unsupervised domain adaptation. In Proc. CVPR, pages 5018–5027, 2017

work page 2017
[41]

Y . X. Wang, R. Girshick, M. Hebert, and B. Hariharan. Low- shot learning from imaginary data. 2018

work page 2018
[42]

J. Xie, R. Girshick, and A. Farhadi. Unsupervised deep em- bedding for clustering analysis. In International conference on machine learning, pages 478–487, 2016

work page 2016
[43]

H. Xu, H. Zhang, Z. Hu, X. Liang, R. Salakhutdinov, and E. Xing. Autoloss: Learning discrete schedules for alternate optimization. 2018

work page 2018
[44]

R. Xu, Z. Chen, W. Zuo, J. Yan, and L. Lin. Deep cock- tail network: Multi-source unsupervised domain adaptation with category shift. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages 3964– 3973, 2018

work page 2018
[45]

J. Yang, R. Yan, and A. G. Hauptmann. Cross-domain video concept detection using adaptive svms. InProceedings of the 15th ACM international conference on Multimedia , pages 188–197. ACM, 2007

work page 2007
[46]

L. Yang, X. Liang, T. Wang, and E. Xing. Real-to-virtual do- main uniﬁcation for end-to-end autonomous driving. 2018

work page 2018
[47]

Y . You, X. Pan, Z. Wang, and C. Lu. Virtual to real rein- forcement learning for autonomous driving. 2017

work page 2017
[48]

H. Yu, M. Hu, and S. Chen. Multi-target unsupervised do- main adaptation without exactly shared categories. 2018

work page 2018
[49]

Central Moment Discrepancy (CMD) for Domain-Invariant Representation Learning

W. Zellinger, T. Grubinger, E. Lughofer, T. Natschl ¨ager, and S. Saminger-Platz. Central moment discrepancy (cmd) for domain-invariant representation learning. arXiv preprint arXiv:1702.08811, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017
[50]

H. Zhao, S. Zhang, G. Wu, J. ao P. Costeira, J. M. F. Moura, and G. J. Gordon. Multiple source domain adaptation with adversarial learning, 2018

work page 2018
[51]

Neural Architecture Search with Reinforcement Learning

B. Zoph and Q. V . Le. Neural architecture search with rein- forcement learning. arXiv preprint arXiv:1611.01578, 2016. 14

work page internal anchor Pith review Pith/arXiv arXiv 2016

[1] [1]

Andrychowicz, M

M. Andrychowicz, M. Denil, S. Gomez, M. W. Hoffman, D. Pfau, T. Schaul, B. Shillingford, and N. De Freitas. Learn- ing to learn by gradient descent by gradient descent. In Advances in Neural Information Processing Systems , pages 3981–3989, 2016

work page 2016

[2] [2]

Unsupervised multi-target domain adaptation: An information theoretic approach

Anonymous. Unsupervised multi-target domain adaptation: An information theoretic approach. In Submitted to Interna- tional Conference on Learning Representations, 2019. under review

work page 2019

[3] [3]

J. C. Balloch, V . Agrawal, I. Essa, and S. Chernova. Unbi- asing semantic segmentation for robot perception using syn- thetic data feature transfer.arXiv preprint arXiv:1809.03676, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018

[4] [4]

Ben-David, J

S. Ben-David, J. Blitzer, K. Crammer, A. Kulesza, F. Pereira, and J. W. Vaughan. A theory of learning from different do- mains. Machine learning, 79(1):151–175, 2010

work page 2010

[5] [5]

Bousmalis, A

K. Bousmalis, A. Irpan, P. Wohlhart, Y . Bai, M. Kelcey, M. Kalakrishnan, L. Downs, J. Ibarz, P. Pastor, K. Kono- lige, et al. Using simulation and domain adaptation to im- prove efﬁciency of deep robotic grasping. In 2018 IEEE In- ternational Conference on Robotics and Automation (ICRA), pages 4243–4250. IEEE, 2018

work page 2018

[6] [6]

Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks

K. Bousmalis, N. Silberman, D. Dohan, D. Erhan, and D. Krishnan. Unsupervised pixel-level domain adapta- tion with generative adversarial networks. arXiv preprint arXiv:1612.05424, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016

[7] [7]

Y . Chen, W. Li, C. Sakaridis, D. Dai, and L. Van Gool. Do- main adaptive faster r-cnn for object detection in the wild. 2018

work page 2018

[8] [8]

C. Finn, P. Abbeel, and S. Levine. Model-agnostic meta- learning for fast adaptation of deep networks. 2017

work page 2017

[9] [9]

Unsupervised Domain Adaptation by Backpropagation

Y . Ganin and V . Lempitsky. Unsupervised domain adaptation by backpropagation. arXiv preprint arXiv:1409.7495, 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014

[10] [10]

Ganin, E

Y . Ganin, E. Ustinova, H. Ajakan, P. Germain, H. Larochelle, F. Laviolette, M. Marchand, and V . Lempitsky. Domain- Adversarial Training of Neural Networks. 2017

work page 2017

[11] [11]

Fine-grained Recognition in the Wild: A Multi-Task Domain Adaptation Approach

T. Gebru, J. Hoffman, and L. Fei-Fei. Fine-grained recogni- tion in the wild: A multi-task domain adaptation approach. arXiv preprint arXiv:1709.02476, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017

[12] [12]

Ghifary, W

M. Ghifary, W. B. Kleijn, M. Zhang, D. Balduzzi, and W. Li. Deep reconstruction-classiﬁcation networks for un- supervised domain adaptation. In European Conference on Computer Vision, pages 597–613. Springer, 2016

work page 2016

[13] [13]

Gilad-Bachrach, N

R. Gilad-Bachrach, N. Dowlin, K. Laine, K. Lauter, M. Naehrig, and J. Wernsing. Cryptonets: Applying neu- ral networks to encrypted data with high throughput and ac- curacy. In International Conference on Machine Learning , pages 201–210, 2016

work page 2016

[14] [14]

I. J. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y . Bengio. Generative adversarial nets. In International Conference on Neural Information Processing Systems , pages 2672–2680, 2014

work page 2014

[15] [15]

Gopalan, R

R. Gopalan, R. Li, and R. Chellappa. Domain adaptation for object recognition: An unsupervised approach. In Com- puter Vision (ICCV), 2011 IEEE International Conference on, pages 999–1006. IEEE, 2011

work page 2011

[16] [16]

Grandvalet and Y

Y . Grandvalet and Y . Bengio. Semi-supervised learning by entropy minimization. In Advances in neural information processing systems, pages 529–536, 2005. 13

work page 2005

[17] [17]

Gretton, A

A. Gretton, A. J. Smola, J. Huang, M. Schmittfull, K. M. Borgwardt, and B. Sch¨olkopf. Covariate shift by kernel mean matching. 2009

work page 2009

[18] [18]

X. Guo, L. Gao, X. Liu, and J. Yin. Improved deep embed- ded clustering with local structure preservation. In Interna- tional Joint Conference on Artiﬁcial Intelligence (IJCAI-17), pages 1753–1759, 2017

work page 2017

[19] [19]

K. He, X. Zhang, S. Ren, and J. Sun. Deep residual learning for image recognition. 2015

work page 2015

[20] [20]

Hoffman, E

J. Hoffman, E. Tzeng, T. Darrell, and K. Saenko. Simulta- neous deep transfer across domains and tasks. In Domain Adaptation in Computer Vision Applications , pages 173–

work page

[21] [21]

FCNs in the Wild: Pixel-level Adversarial and Constraint-based Adaptation

J. Hoffman, D. Wang, F. Yu, and T. Darrell. Fcns in the wild: Pixel-level adversarial and constraint-based adapta- tion. arXiv preprint arXiv:1612.02649, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016

[22] [22]

Krizhevsky, I

A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classiﬁcation with deep convolutional neural networks. In Advances in neural information processing systems , pages 1097–1105, 2012

work page 2012

[23] [23]

LeCun, L

Y . LeCun, L. Bottou, Y . Bengio, and P. Haffner. Gradient- based learning applied to document recognition. Proceed- ings of the IEEE, 86(11):2278–2324, 1998

work page 1998

[24] [24]

Liu and O

M.-Y . Liu and O. Tuzel. Coupled generative adversarial net- works. In Advances in neural information processing sys- tems, pages 469–477, 2016

work page 2016

[25] [25]

M. Long, Y . Cao, J. Wang, and M. Jordan. Learning transfer- able features with deep adaptation networks. InInternational Conference on Machine Learning, pages 97–105, 2015

work page 2015

[26] [26]

M. Long, H. Zhu, J. Wang, and M. I. Jordan. Deep trans- fer learning with joint adaptation networks. arXiv preprint arXiv:1605.06636, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016

[27] [27]

M. Long, H. Zhu, J. Wang, and M. I. Jordan. Unsuper- vised domain adaptation with residual transfer networks. In Advances in Neural Information Processing Systems , pages 136–144, 2016

work page 2016

[28] [28]

M. Long, H. Zhu, J. Wang, and M. I. Jordan. Deep transfer learning with joint adaptation networks. 2017

work page 2017

[29] [29]

Boosting Domain Adaptation by Discovering Latent Domains

M. Mancini, L. Porzi, S. R. Bul `o, B. Caputo, and E. Ricci. Boosting domain adaptation by discovering latent domains. arXiv preprint arXiv:1805.01386, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018

[30] [30]

Mansour, M

Y . Mansour, M. Mohri, and A. Rostamizadeh. Domain adap- tation with multiple sources. In Advances in neural informa- tion processing systems, pages 1041–1048, 2009

work page 2009

[31] [31]

Miyato, S.-i

T. Miyato, S.-i. Maeda, S. Ishii, and M. Koyama. Virtual adversarial training: a regularization method for supervised and semi-supervised learning. IEEE transactions on pattern analysis and machine intelligence, 2018

work page 2018

[32] [32]

Netzer, T

Y . Netzer, T. Wang, A. Coates, A. Bissacco, B. Wu, and A. Y . Ng. Reading digits in natural images with unsupervised fea- ture learning. Nips Workshop on Deep Learning and Unsu- pervised Feature Learning, 2011

work page 2011

[33] [33]

S. J. Pan and Q. Yang. A survey on transfer learning. IEEE Transactions on knowledge and data engineering , 22(10):1345–1359, 2010

work page 2010

[34] [34]

A. A. Rusu, M. Vecerik, T. Roth ¨orl, N. Heess, R. Pascanu, and R. Hadsell. Sim-to-real robot learning from pixels with progressive nets. arXiv preprint arXiv:1610.04286, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016

[35] [35]

Saenko, B

K. Saenko, B. Kulis, M. Fritz, and T. Darrell. Adapting vi- sual category models to new domains. Computer Vision– ECCV 2010, pages 213–226, 2010

work page 2010

[36] [36]

Asymmetric Tri-training for Unsupervised Domain Adaptation

K. Saito, Y . Ushiku, and T. Harada. Asymmetric tri- training for unsupervised domain adaptation. arXiv preprint arXiv:1702.08400, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017

[37] [37]

Generate To Adapt: Aligning Domains using Generative Adversarial Networks

S. Sankaranarayanan, Y . Balaji, C. D. Castillo, and R. Chel- lappa. Generate to adapt: Aligning domains using generative adversarial networks. ArXiv e-prints, abs/1704.01705, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017

[38] [38]

R. Shu, H. H. Bui, H. Narui, and S. Ermon. A dirt-t ap- proach to unsupervised domain adaptation. arXiv preprint arXiv:1802.08735, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018

[39] [39]

Adversarial Discriminative Domain Adaptation

E. Tzeng, J. Hoffman, K. Saenko, and T. Darrell. Ad- versarial discriminative domain adaptation. arXiv preprint arXiv:1702.05464, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017

[40] [40]

Venkateswara, J

H. Venkateswara, J. Eusebio, S. Chakraborty, and S. Pan- chanathan. Deep hashing network for unsupervised domain adaptation. In Proc. CVPR, pages 5018–5027, 2017

work page 2017

[41] [41]

Y . X. Wang, R. Girshick, M. Hebert, and B. Hariharan. Low- shot learning from imaginary data. 2018

work page 2018

[42] [42]

J. Xie, R. Girshick, and A. Farhadi. Unsupervised deep em- bedding for clustering analysis. In International conference on machine learning, pages 478–487, 2016

work page 2016

[43] [43]

H. Xu, H. Zhang, Z. Hu, X. Liang, R. Salakhutdinov, and E. Xing. Autoloss: Learning discrete schedules for alternate optimization. 2018

work page 2018

[44] [44]

R. Xu, Z. Chen, W. Zuo, J. Yan, and L. Lin. Deep cock- tail network: Multi-source unsupervised domain adaptation with category shift. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages 3964– 3973, 2018

work page 2018

[45] [45]

J. Yang, R. Yan, and A. G. Hauptmann. Cross-domain video concept detection using adaptive svms. InProceedings of the 15th ACM international conference on Multimedia , pages 188–197. ACM, 2007

work page 2007

[46] [46]

L. Yang, X. Liang, T. Wang, and E. Xing. Real-to-virtual do- main uniﬁcation for end-to-end autonomous driving. 2018

work page 2018

[47] [47]

Y . You, X. Pan, Z. Wang, and C. Lu. Virtual to real rein- forcement learning for autonomous driving. 2017

work page 2017

[48] [48]

H. Yu, M. Hu, and S. Chen. Multi-target unsupervised do- main adaptation without exactly shared categories. 2018

work page 2018

[49] [49]

Central Moment Discrepancy (CMD) for Domain-Invariant Representation Learning

W. Zellinger, T. Grubinger, E. Lughofer, T. Natschl ¨ager, and S. Saminger-Platz. Central moment discrepancy (cmd) for domain-invariant representation learning. arXiv preprint arXiv:1702.08811, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017

[50] [50]

H. Zhao, S. Zhang, G. Wu, J. ao P. Costeira, J. M. F. Moura, and G. J. Gordon. Multiple source domain adaptation with adversarial learning, 2018

work page 2018

[51] [51]

Neural Architecture Search with Reinforcement Learning

B. Zoph and Q. V . Le. Neural architecture search with rein- forcement learning. arXiv preprint arXiv:1611.01578, 2016. 14

work page internal anchor Pith review Pith/arXiv arXiv 2016