Bayesian Uncertainty Matching for Unsupervised Domain Adaptation

Changyou Chen; Junsong Yuan; Jun Wen; Nenggan Zheng; Zhefeng Gong

arxiv: 1906.09693 · v1 · pith:NYMAYKUTnew · submitted 2019-06-24 · 💻 cs.LG · cs.CV· stat.ML

Bayesian Uncertainty Matching for Unsupervised Domain Adaptation

Jun Wen , Nenggan Zheng , Junsong Yuan , Zhefeng Gong , Changyou Chen This is my paper

Pith reviewed 2026-05-25 17:45 UTC · model grok-4.3

classification 💻 cs.LG cs.CVstat.ML

keywords unsupervised domain adaptationBayesian neural networkuncertainty estimationdistribution matchingnegative transferlabel distribution shiftjoint distribution alignment

0 comments

The pith

Bayesian uncertainty estimates enable joint feature and label distribution matching to reduce domain shift in unsupervised adaptation.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Domain shift persists in adaptation even after feature alignment because label distributions can still differ between source and target. The paper establishes that a Bayesian neural network can quantify prediction uncertainty to serve as a proxy for matching label distributions jointly with features. This produces more consistent classifier outputs across domains and includes adaptive loss reweighting for stable training. The result is reduced negative transfer compared with prior feature-only methods. Experiments on standard benchmarks support the gains in accuracy and robustness.

Core claim

The paper claims that imposing distribution matching on both features and labels via uncertainty from a Bayesian neural network achieves approximate joint distribution matching. This alleviates label distribution mismatch that remains after marginal feature alignment, encouraging the classifier to produce consistent predictions across source and target domains. Adaptive reweighting of the adaptation loss further supports nontrivial matching and stable optimization.

What carries the argument

Bayesian neural network that quantifies prediction uncertainty as a proxy for label distribution, enabling joint matching of feature and label distributions.

If this is right

The classifier produces predictions that remain consistent when inputs cross from source to target domain.
Negative transfer is reduced because label mismatch is explicitly addressed rather than ignored.
Adaptive reweighting of the adaptation loss yields stable training and nontrivial distribution alignment.
The method outperforms prior unsupervised domain adaptation approaches on three popular benchmark datasets.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same uncertainty proxy could be tested with non-Bayesian uncertainty estimators to check whether the Bayesian formulation is essential.
The joint matching idea might extend to settings with partial target labels by using the same uncertainty signal for semi-supervised refinement.
If label shift is the dominant remaining error source, the approach predicts larger gains on tasks where class priors differ markedly between domains.

Load-bearing premise

Uncertainty estimates from the Bayesian neural network are a sufficient proxy for the unobserved target label distribution.

What would settle it

Performance fails to improve over feature-only matching on datasets where source and target label distributions differ substantially.

Figures

Figures reproduced from arXiv: 1906.09693 by Changyou Chen, Junsong Yuan, Jun Wen, Nenggan Zheng, Zhefeng Gong.

**Figure 2.** Figure 2: Pipeline of the proposed method. We adaptively match the [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗

**Figure 3.** Figure 3: The t-SNE visualizations of features on the USPS [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

**Figure 4.** Figure 4: Comparisons of target test accuracy and uncertainty on the [PITH_FULL_IMAGE:figures/full_fig_p006_4.png] view at source ↗

read the original abstract

Domain adaptation is an important technique to alleviate performance degradation caused by domain shift, e.g., when training and test data come from different domains. Most existing deep adaptation methods focus on reducing domain shift by matching marginal feature distributions through deep transformations on the input features, due to the unavailability of target domain labels. We show that domain shift may still exist via label distribution shift at the classifier, thus deteriorating model performances. To alleviate this issue, we propose an approximate joint distribution matching scheme by exploiting prediction uncertainty. Specifically, we use a Bayesian neural network to quantify prediction uncertainty of a classifier. By imposing distribution matching on both features and labels (via uncertainty), label distribution mismatching in source and target data is effectively alleviated, encouraging the classifier to produce consistent predictions across domains. We also propose a few techniques to improve our method by adaptively reweighting domain adaptation loss to achieve nontrivial distribution matching and stable training. Comparisons with state of the art unsupervised domain adaptation methods on three popular benchmark datasets demonstrate the superiority of our approach, especially on the effectiveness of alleviating negative transfer.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

They add Bayesian uncertainty matching on top of feature alignment to handle label distribution shift in UDA, but the proxy from uncertainty to labels is too lossy to carry the main claim without further justification.

read the letter

The main thing to know is that this paper identifies label distribution shift as a remaining problem after standard feature matching in unsupervised domain adaptation and proposes to address it by also matching prediction uncertainty from a Bayesian neural network. The approach is presented as approximate joint distribution matching, with some extra adaptive reweighting tricks for the losses to stabilize training. That specific mechanism using uncertainty for the label part does not appear in the prior work they cite, so the idea itself is new within the subfield. The abstract claims better results than existing methods on three benchmarks and reduced negative transfer, which is the kind of outcome that would matter if the experiments hold up. The paper does a clear job laying out why feature-only matching can leave classifier-level mismatch. The soft spot is the central link: uncertainty (entropy or variance) is a many-to-one summary, so two different label distributions can produce identical uncertainty statistics. The paper does not supply a derivation showing that aligning the uncertainty marginal bounds the label divergence or that class-conditional uncertainty is being matched. Without that, the claim that label mismatching is effectively alleviated rests on an assumption that may not hold. The abstract gives no numbers, ablations, or error analysis, so the strength of the empirical support cannot be judged from what is here. This is the sort of paper that domain adaptation researchers would want to see in review to check whether the experiments actually demonstrate the uncertainty step doing the claimed work. It is worth sending to referees even if the final verdict ends up mixed.

Referee Report

3 major / 2 minor

Summary. The paper proposes Bayesian Uncertainty Matching (BUM) for unsupervised domain adaptation. It augments standard feature-distribution matching with an additional term that matches predictive uncertainty (quantified via a Bayesian neural network) between source and target domains, arguing that this approximates joint distribution matching and thereby alleviates label-distribution shift. Adaptive reweighting of the adaptation losses is introduced for stable training. Experiments on three standard UDA benchmarks are reported to show superiority over prior methods, with particular emphasis on reduced negative transfer.

Significance. If the uncertainty-matching construction can be shown to provide a reliable proxy for label-distribution alignment, the approach would supply a practical, label-free mechanism for handling a common failure mode in feature-only adaptation. The empirical comparisons on established benchmarks would then constitute useful evidence of practical utility. However, the absence of a derivation linking uncertainty divergence to label-distribution divergence limits the strength of the central claim.

major comments (3)

[Abstract, §3] Abstract and §3: the central claim that 'imposing distribution matching on both features and labels (via uncertainty)' alleviates label-distribution mismatch rests on the unproven assumption that aligning a scalar or low-dimensional summary of predictive uncertainty (entropy or variance) is sufficient to align the unobserved label marginals. No bound or derivation is supplied showing that the chosen uncertainty divergence implies control over label-distribution divergence; different class-probability vectors can produce identical uncertainty statistics.
[§4] §4 (experimental section): while superiority on three benchmarks is asserted, the reported tables do not include an ablation that isolates the contribution of the uncertainty-matching term versus feature matching alone, nor do they quantify the reduction in negative transfer with a direct metric (e.g., target-label distribution divergence before/after). Without these controls it is unclear whether the observed gains are attributable to the proposed mechanism.
[§3.2] §3.2, Eq. (uncertainty loss): the adaptive reweighting scheme is presented as ensuring 'nontrivial distribution matching,' yet the reweighting depends on the same uncertainty estimates whose sufficiency for label alignment is already in question; this creates a potential circularity that is not analyzed.

minor comments (2)

[§3] Notation for the Bayesian posterior and the uncertainty divergence should be introduced with explicit definitions before first use in §3.
[Figures] Figure captions should state the exact datasets, number of runs, and error bars used so that the reported superiority can be directly compared with prior work.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their constructive feedback on our manuscript. We address each of the major comments below and indicate the revisions we plan to make.

read point-by-point responses

Referee: [Abstract, §3] Abstract and §3: the central claim that 'imposing distribution matching on both features and labels (via uncertainty)' alleviates label-distribution mismatch rests on the unproven assumption that aligning a scalar or low-dimensional summary of predictive uncertainty (entropy or variance) is sufficient to align the unobserved label marginals. No bound or derivation is supplied showing that the chosen uncertainty divergence implies control over label-distribution divergence; different class-probability vectors can produce identical uncertainty statistics.

Authors: We acknowledge that our paper presents the uncertainty matching as an approximate method for joint distribution alignment without providing a formal bound or derivation. The motivation is that predictive uncertainty from a Bayesian model reflects the classifier's confidence, which is influenced by the label distribution. While different probability vectors can yield the same uncertainty, in practice the matching encourages consistency in predictions across domains. We will revise §3 to more clearly state this as an approximation and discuss the limitations. revision: partial
Referee: [§4] §4 (experimental section): while superiority on three benchmarks is asserted, the reported tables do not include an ablation that isolates the contribution of the uncertainty-matching term versus feature matching alone, nor do they quantify the reduction in negative transfer with a direct metric (e.g., target-label distribution divergence before/after). Without these controls it is unclear whether the observed gains are attributable to the proposed mechanism.

Authors: We agree that including an ablation study to isolate the effect of the uncertainty-matching term and a direct metric for negative transfer would provide stronger evidence. We will add these analyses to the experimental section in the revised manuscript. revision: yes
Referee: [§3.2] §3.2, Eq. (uncertainty loss): the adaptive reweighting scheme is presented as ensuring 'nontrivial distribution matching,' yet the reweighting depends on the same uncertainty estimates whose sufficiency for label alignment is already in question; this creates a potential circularity that is not analyzed.

Authors: The adaptive reweighting is designed to balance the losses dynamically based on current uncertainty estimates to prevent trivial solutions where one loss dominates. While it does rely on the uncertainty, the estimates are updated during training, and the reweighting is a practical heuristic for stability. We will add an analysis of this scheme and discuss any potential circularity in the revised version. revision: partial

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The provided abstract and description present a methodological proposal that builds on standard Bayesian neural network uncertainty quantification to perform feature and uncertainty-based matching. No equations, derivations, or self-citations are exhibited that reduce the claimed alleviation of label distribution shift to a fitted parameter, self-definition, or load-bearing self-citation chain by construction. The approach is self-contained against external benchmarks and does not rename known results or smuggle ansatzes via citation in the visible text. The skeptic concern addresses assumption strength rather than circular reduction.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the domain assumption that uncertainty from a Bayesian classifier can proxy label distribution shift; no free parameters or invented entities are described in the abstract.

axioms (1)

domain assumption Bayesian neural networks produce uncertainty estimates that reliably reflect label distribution differences between domains
Invoked to justify the joint-distribution matching step described in the abstract.

pith-pipeline@v0.9.0 · 5724 in / 1150 out tokens · 22655 ms · 2026-05-25T17:45:15.377884+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

25 extracted references · 25 canonical work pages

[1]

A theory of learning from different domains

Shai Ben-David, John Blitzer, Koby Crammer, Alex Kulesza, Fernando Pereira, and Jennifer Wortman Vaughan. A theory of learning from different domains. Machine learning , 79(1-2):151--175, 2010

work page 2010
[2]

Weight uncertainty in neural network

Charles Blundell, Julien Cornebise, Koray Kavukcuoglu, and Daan Wierstra. Weight uncertainty in neural network. In International Conference on Machine Learning , pages 1613--1622, 2015

work page 2015
[3]

Re-weighted adversarial adaptation network for unsupervised domain adaptation

Qingchao Chen, Yang Liu, Zhaowen Wang, Ian Wassell, and Kevin Chetty. Re-weighted adversarial adaptation network for unsupervised domain adaptation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages 7976--7985, 2018

work page 2018
[4]

Dropout as a bayesian approximation: Representing model uncertainty in deep learning

Yarin Gal and Zoubin Ghahramani. Dropout as a bayesian approximation: Representing model uncertainty in deep learning. In international conference on machine learning , pages 1050--1059, 2016

work page 2016
[5]

Domain-adversarial training of neural networks

Yaroslav Ganin, Evgeniya Ustinova, Hana Ajakan, Pascal Germain, Hugo Larochelle, Fran c ois Laviolette, Mario Marchand, and Victor Lempitsky. Domain-adversarial training of neural networks. The Journal of Machine Learning Research , 17(1):2096--2030, 2016

work page 2096
[6]

Geodesic flow kernel for unsupervised domain adaptation

Boqing Gong, Yuan Shi, Fei Sha, and Kristen Grauman. Geodesic flow kernel for unsupervised domain adaptation. In Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on , pages 2066--2073. IEEE, 2012

work page 2012
[7]

Generative adversarial nets

Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. Generative adversarial nets. In Advances in neural information processing systems , pages 2672--2680, 2014

work page 2014
[8]

Practical variational inference for neural networks

Alex Graves. Practical variational inference for neural networks. In Advances in neural information processing systems , pages 2348--2356, 2011

work page 2011
[9]

C y CADA : Cycle-consistent adversarial domain adaptation

Judy Hoffman, Eric Tzeng, Taesung Park, Jun-Yan Zhu, Phillip Isola, Kate Saenko, Alexei Efros, and Trevor Darrell. C y CADA : Cycle-consistent adversarial domain adaptation. In Proceedings of the 35th International Conference on Machine Learning , volume 80, pages 1989--1998. PMLR, 2018

work page 1989
[10]

Correcting sample selection bias by unlabeled data

Jiayuan Huang, Arthur Gretton, Karsten M Borgwardt, Bernhard Sch \"o lkopf, and Alex J Smola. Correcting sample selection bias by unlabeled data. In Advances in neural information processing systems , pages 601--608, 2007

work page 2007
[11]

What uncertainties do we need in bayesian deep learning for computer vision? In Advances in neural information processing systems , pages 5574--5584, 2017

Alex Kendall and Yarin Gal. What uncertainties do we need in bayesian deep learning for computer vision? In Advances in neural information processing systems , pages 5574--5584, 2017

work page 2017
[12]

End-to-end adversarial memory network for cross-domain sentiment classification

Zheng Li, Yun Zhang, Ying Wei, Yuxiang Wu, and Qiang Yang. End-to-end adversarial memory network for cross-domain sentiment classification. In IJCAI , pages 2237--2243, 2017

work page 2017
[13]

Deep transfer learning with joint adaptation networks

Mingsheng Long, Han Zhu, Jianmin Wang, and Michael I Jordan. Deep transfer learning with joint adaptation networks. In Proceedings of the 34th International Conference on Machine Learning-Volume 70 , pages 2208--2217. JMLR. org, 2017

work page 2017
[14]

Conditional adversarial domain adaptation

Mingsheng Long, ZHANGJIE CAO, Jianmin Wang, and Michael I Jordan. Conditional adversarial domain adaptation. In Advances in Neural Information Processing Systems 31 , pages 1647--1657. Curran Associates, Inc., 2018

work page 2018
[15]

Visualizing data using t-sne

Laurens van der Maaten and Geoffrey Hinton. Visualizing data using t-sne. Journal of machine learning research , 9(Nov):2579--2605, 2008

work page 2008
[16]

Multi-adversarial domain adaptation

Zhongyi Pei, Zhangjie Cao, Mingsheng Long, and Jianmin Wang. Multi-adversarial domain adaptation. In AAAI Conference on Artificial Intelligence , pages 3934--3941, 2018

work page 2018
[17]

Adapting visual category models to new domains

Kate Saenko, Brian Kulis, Mario Fritz, and Trevor Darrell. Adapting visual category models to new domains. In European conference on computer vision , pages 213--226. Springer, 2010

work page 2010
[18]

A dirt-t approach to unsupervised domain adaptation

Rui Shu, Hung H Bui, Hirokazu Narui, and Stefano Ermon. A dirt-t approach to unsupervised domain adaptation. In Proc. 6th International Conference on Learning Representations , 2018

work page 2018
[19]

Dropout: a simple way to prevent neural networks from overfitting

Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. Dropout: a simple way to prevent neural networks from overfitting. The Journal of Machine Learning Research , 15(1):1929--1958, 2014

work page 1929
[20]

Deep coral: Correlation alignment for deep domain adaptation

Baochen Sun and Kate Saenko. Deep coral: Correlation alignment for deep domain adaptation. In European Conference on Computer Vision , pages 443--450. Springer, 2016

work page 2016
[21]

Adversarial discriminative domain adaptation

Eric Tzeng, Judy Hoffman, Kate Saenko, and Trevor Darrell. Adversarial discriminative domain adaptation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages 7167--7176, 2017

work page 2017
[22]

Deep hashing network for unsupervised domain adaptation

Hemanth Venkateswara, Jose Eusebio, Shayok Chakraborty, and Sethuraman Panchanathan. Deep hashing network for unsupervised domain adaptation. In Proc. CVPR , pages 5018--5027, 2017

work page 2017
[23]

Exploiting local feature patterns for unsupervised domain adaptation

Jun Wen, Risheng Liu, Nenggan Zheng, Qian Zheng, Zhefeng Gong, and Junsong Yuan. Exploiting local feature patterns for unsupervised domain adaptation. In Thirty-Third AAAI Conference on Artificial Intelligence , 2019

work page 2019
[24]

Tsang, Sinno Jialin Pan, and Mingkui Tan

Joey Tianyi Zhou, Ivor W. Tsang, Sinno Jialin Pan, and Mingkui Tan. Multi-class heterogeneous domain adaptation. Journal of Machine Learning Research , 20(57):1--31, 2019

work page 2019
[25]

write newline

" write newline "" before.all 'output.state := FUNCTION fin.entry add.period write newline FUNCTION new.block output.state before.all = 'skip after.block 'output.state := if FUNCTION new.sentence output.state after.block = 'skip output.state before.all = 'skip after.sentence 'output.state := if if FUNCTION not #0 #1 if FUNCTION and 'skip pop #0 if FUNCTIO...

work page

[1] [1]

A theory of learning from different domains

Shai Ben-David, John Blitzer, Koby Crammer, Alex Kulesza, Fernando Pereira, and Jennifer Wortman Vaughan. A theory of learning from different domains. Machine learning , 79(1-2):151--175, 2010

work page 2010

[2] [2]

Weight uncertainty in neural network

Charles Blundell, Julien Cornebise, Koray Kavukcuoglu, and Daan Wierstra. Weight uncertainty in neural network. In International Conference on Machine Learning , pages 1613--1622, 2015

work page 2015

[3] [3]

Re-weighted adversarial adaptation network for unsupervised domain adaptation

Qingchao Chen, Yang Liu, Zhaowen Wang, Ian Wassell, and Kevin Chetty. Re-weighted adversarial adaptation network for unsupervised domain adaptation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages 7976--7985, 2018

work page 2018

[4] [4]

Dropout as a bayesian approximation: Representing model uncertainty in deep learning

Yarin Gal and Zoubin Ghahramani. Dropout as a bayesian approximation: Representing model uncertainty in deep learning. In international conference on machine learning , pages 1050--1059, 2016

work page 2016

[5] [5]

Domain-adversarial training of neural networks

Yaroslav Ganin, Evgeniya Ustinova, Hana Ajakan, Pascal Germain, Hugo Larochelle, Fran c ois Laviolette, Mario Marchand, and Victor Lempitsky. Domain-adversarial training of neural networks. The Journal of Machine Learning Research , 17(1):2096--2030, 2016

work page 2096

[6] [6]

Geodesic flow kernel for unsupervised domain adaptation

Boqing Gong, Yuan Shi, Fei Sha, and Kristen Grauman. Geodesic flow kernel for unsupervised domain adaptation. In Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on , pages 2066--2073. IEEE, 2012

work page 2012

[7] [7]

Generative adversarial nets

Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. Generative adversarial nets. In Advances in neural information processing systems , pages 2672--2680, 2014

work page 2014

[8] [8]

Practical variational inference for neural networks

Alex Graves. Practical variational inference for neural networks. In Advances in neural information processing systems , pages 2348--2356, 2011

work page 2011

[9] [9]

C y CADA : Cycle-consistent adversarial domain adaptation

Judy Hoffman, Eric Tzeng, Taesung Park, Jun-Yan Zhu, Phillip Isola, Kate Saenko, Alexei Efros, and Trevor Darrell. C y CADA : Cycle-consistent adversarial domain adaptation. In Proceedings of the 35th International Conference on Machine Learning , volume 80, pages 1989--1998. PMLR, 2018

work page 1989

[10] [10]

Correcting sample selection bias by unlabeled data

Jiayuan Huang, Arthur Gretton, Karsten M Borgwardt, Bernhard Sch \"o lkopf, and Alex J Smola. Correcting sample selection bias by unlabeled data. In Advances in neural information processing systems , pages 601--608, 2007

work page 2007

[11] [11]

What uncertainties do we need in bayesian deep learning for computer vision? In Advances in neural information processing systems , pages 5574--5584, 2017

Alex Kendall and Yarin Gal. What uncertainties do we need in bayesian deep learning for computer vision? In Advances in neural information processing systems , pages 5574--5584, 2017

work page 2017

[12] [12]

End-to-end adversarial memory network for cross-domain sentiment classification

Zheng Li, Yun Zhang, Ying Wei, Yuxiang Wu, and Qiang Yang. End-to-end adversarial memory network for cross-domain sentiment classification. In IJCAI , pages 2237--2243, 2017

work page 2017

[13] [13]

Deep transfer learning with joint adaptation networks

Mingsheng Long, Han Zhu, Jianmin Wang, and Michael I Jordan. Deep transfer learning with joint adaptation networks. In Proceedings of the 34th International Conference on Machine Learning-Volume 70 , pages 2208--2217. JMLR. org, 2017

work page 2017

[14] [14]

Conditional adversarial domain adaptation

Mingsheng Long, ZHANGJIE CAO, Jianmin Wang, and Michael I Jordan. Conditional adversarial domain adaptation. In Advances in Neural Information Processing Systems 31 , pages 1647--1657. Curran Associates, Inc., 2018

work page 2018

[15] [15]

Visualizing data using t-sne

Laurens van der Maaten and Geoffrey Hinton. Visualizing data using t-sne. Journal of machine learning research , 9(Nov):2579--2605, 2008

work page 2008

[16] [16]

Multi-adversarial domain adaptation

Zhongyi Pei, Zhangjie Cao, Mingsheng Long, and Jianmin Wang. Multi-adversarial domain adaptation. In AAAI Conference on Artificial Intelligence , pages 3934--3941, 2018

work page 2018

[17] [17]

Adapting visual category models to new domains

Kate Saenko, Brian Kulis, Mario Fritz, and Trevor Darrell. Adapting visual category models to new domains. In European conference on computer vision , pages 213--226. Springer, 2010

work page 2010

[18] [18]

A dirt-t approach to unsupervised domain adaptation

Rui Shu, Hung H Bui, Hirokazu Narui, and Stefano Ermon. A dirt-t approach to unsupervised domain adaptation. In Proc. 6th International Conference on Learning Representations , 2018

work page 2018

[19] [19]

Dropout: a simple way to prevent neural networks from overfitting

Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. Dropout: a simple way to prevent neural networks from overfitting. The Journal of Machine Learning Research , 15(1):1929--1958, 2014

work page 1929

[20] [20]

Deep coral: Correlation alignment for deep domain adaptation

Baochen Sun and Kate Saenko. Deep coral: Correlation alignment for deep domain adaptation. In European Conference on Computer Vision , pages 443--450. Springer, 2016

work page 2016

[21] [21]

Adversarial discriminative domain adaptation

Eric Tzeng, Judy Hoffman, Kate Saenko, and Trevor Darrell. Adversarial discriminative domain adaptation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages 7167--7176, 2017

work page 2017

[22] [22]

Deep hashing network for unsupervised domain adaptation

Hemanth Venkateswara, Jose Eusebio, Shayok Chakraborty, and Sethuraman Panchanathan. Deep hashing network for unsupervised domain adaptation. In Proc. CVPR , pages 5018--5027, 2017

work page 2017

[23] [23]

Exploiting local feature patterns for unsupervised domain adaptation

Jun Wen, Risheng Liu, Nenggan Zheng, Qian Zheng, Zhefeng Gong, and Junsong Yuan. Exploiting local feature patterns for unsupervised domain adaptation. In Thirty-Third AAAI Conference on Artificial Intelligence , 2019

work page 2019

[24] [24]

Tsang, Sinno Jialin Pan, and Mingkui Tan

Joey Tianyi Zhou, Ivor W. Tsang, Sinno Jialin Pan, and Mingkui Tan. Multi-class heterogeneous domain adaptation. Journal of Machine Learning Research , 20(57):1--31, 2019

work page 2019

[25] [25]

write newline

" write newline "" before.all 'output.state := FUNCTION fin.entry add.period write newline FUNCTION new.block output.state before.all = 'skip after.block 'output.state := if FUNCTION new.sentence output.state after.block = 'skip output.state before.all = 'skip after.sentence 'output.state := if if FUNCTION not #0 #1 if FUNCTION and 'skip pop #0 if FUNCTIO...

work page