Dual Adversarial Semantics-Consistent Network for Generalized Zero-Shot Learning

Haiyong Xie; Jian Ni; Shanghang Zhang

arxiv: 1907.05570 · v1 · pith:OGGMOQV4new · submitted 2019-07-12 · 💻 cs.CV

Dual Adversarial Semantics-Consistent Network for Generalized Zero-Shot Learning

Jian Ni , Shanghang Zhang , Haiyong Xie This is my paper

Pith reviewed 2026-05-24 22:55 UTC · model grok-4.3

classification 💻 cs.CV

keywords generalized zero-shot learninggenerative adversarial networkfeature synthesissemantic consistencyadversarial learningcomputer vision

0 comments

The pith

A dual-GAN network synthesizes discriminative and semantics-preserving features for generalized zero-shot learning.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces the Dual Adversarial Semantics-Consistent Network (DASCN) to address challenges in generalized zero-shot learning where both seen and unseen classes must be recognized at test time. Existing methods either lose semantic information or fail to ensure visual-semantic interactions. DASCN learns a primal GAN to generate visual features from semantic representations and a dual GAN to enforce that these features align with prior semantic knowledge through adversarial learning. This unified framework is claimed to produce features that are both inter-class discriminative and semantics-preserving. Experiments indicate it achieves better results than previous state-of-the-art methods on GZSL benchmarks.

Core claim

DASCN learns primal and dual Generative Adversarial Networks in a unified framework for GZSL. The primal GAN synthesizes inter-class discriminative and semantics-preserving visual features from semantic representations of seen and unseen classes as well as those reconstructed by the dual GAN. The dual GAN enforces the synthetic visual features to represent prior semantic knowledge well via semantics-consistent adversarial learning.

What carries the argument

The dual adversarial semantics-consistent network with a primal GAN for feature synthesis and a dual GAN for semantics-consistent adversarial learning.

If this is right

The approach guarantees better visual-semantic interactions than embedding-based methods.
Synthetic features maintain discriminative power between classes.
Performance improves on both seen and unseen classes during testing.
The dual mechanism prevents semantic loss during feature generation.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If the dual GAN prevents semantic drift, similar dual structures could be tested in other generative vision tasks.
The dual-GAN idea might extend to cross-modal synthesis problems beyond vision.
Ablation studies isolating the dual component would clarify its contribution to consistency.

Load-bearing premise

The semantics-consistent adversarial learning in the dual GAN can enforce prior semantic knowledge on the synthetic features without introducing mode collapse or semantic drift.

What would settle it

An ablation test removing the dual GAN and checking whether the primal GAN's synthetic features lose alignment with semantic priors or show lower accuracy on unseen classes.

Figures

Figures reproduced from arXiv: 1907.05570 by Haiyong Xie, Jian Ni, Shanghang Zhang.

**Figure 2.** Figure 2: Network architecture of DASCN. The semantic feature of class [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: (a): t-SNE visualization of real visual feature distribution and synthesized feature distribution from randomly selected three unseen classes; (b, c): Increasing the number of samples generated by DASCN and its variants wrt harmonic mean H. DASCN w/o SC denotes DASCN without semantic consistency constraint and DASCN w/o VC stands for that without visual consistency constraint. 4.5 Quality of Synthesized S… view at source ↗

read the original abstract

Generalized zero-shot learning (GZSL) is a challenging class of vision and knowledge transfer problems in which both seen and unseen classes appear during testing. Existing GZSL approaches either suffer from semantic loss and discard discriminative information at the embedding stage, or cannot guarantee the visual-semantic interactions. To address these limitations, we propose the Dual Adversarial Semantics-Consistent Network (DASCN), which learns primal and dual Generative Adversarial Networks (GANs) in a unified framework for GZSL. In particular, the primal GAN learns to synthesize inter-class discriminative and semantics-preserving visual features from both the semantic representations of seen/unseen classes and the ones reconstructed by the dual GAN. The dual GAN enforces the synthetic visual features to represent prior semantic knowledge well via semantics-consistent adversarial learning. To the best of our knowledge, this is the first work that employs a novel dual-GAN mechanism for GZSL. Extensive experiments show that our approach achieves significant improvements over the state-of-the-art approaches.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The dual-GAN setup for GZSL adds an incremental consistency mechanism but risks semantic drift without stronger evidence.

read the letter

The punchline is this dual GAN thing for zero-shot learning. One part makes the features, the other tries to keep them semantically consistent. That's the main thing your colleague should know. It's new because no one has done the dual setup before in this area, at least according to the abstract. The paper does a good job pointing out the issues with old methods like losing info in embeddings or not having good interactions, and proposing this unified way to handle both discrimination and consistency using the primal and dual networks. The results look promising on paper, with big gains claimed over state of the art. Where it might be soft is in proving that the dual part actually stops the features from drifting semantically or collapsing. The description is high level, no math shown here, and we know GANs can be finicky with training. If the loop doesn't enforce the semantics well, the advantage goes away. The stress test about drift seems worth checking in the full text. This is for folks deep in zero-shot vision work who are already playing with generative models. If you're into GANs for this, it could give you some ideas to try, even if it's incremental. I'd say send it to review so someone can dig into the details and see if the consistency holds up in the experiments and ablations.

Referee Report

2 major / 1 minor

Summary. The paper proposes the Dual Adversarial Semantics-Consistent Network (DASCN) for generalized zero-shot learning (GZSL). It learns primal and dual GANs in a unified framework: the primal GAN synthesizes inter-class discriminative and semantics-preserving visual features from semantic representations of seen/unseen classes and reconstructions from the dual GAN; the dual GAN enforces semantics consistency on the synthetic features via adversarial learning. The authors claim this is the first dual-GAN approach for GZSL and report significant improvements over prior SOTA methods on standard benchmarks.

Significance. If the dual-GAN loop reliably prevents semantic drift and mode collapse while preserving inter-class discriminability, the approach could meaningfully advance generative GZSL by jointly addressing semantic loss and visual-semantic interaction failures in a single framework. The explicit dual mechanism is a clear architectural distinction from prior single-GAN or embedding methods.

major comments (2)

[Abstract] Abstract: the central claim that the dual GAN 'enforces the synthetic visual features to represent prior semantic knowledge well via semantics-consistent adversarial learning' and that the primal GAN uses 'the ones reconstructed by the dual GAN' supplies no loss equations, consistency metric, or stability argument. Without these, it is impossible to verify whether the dual loop actually prevents the semantic drift or mode collapse that would invalidate feature synthesis for unseen classes.
[Abstract] Abstract: the assertion of 'significant improvements over the state-of-the-art approaches' is presented without reference to specific datasets, metrics, or controls for hyperparameter tuning and training stability that are known to be critical in GAN-based GZSL work; this makes the empirical claim difficult to assess as load-bearing evidence for the dual mechanism.

minor comments (1)

[Abstract] The abstract states 'to the best of our knowledge, this is the first work that employs a novel dual-GAN mechanism for GZSL' but does not cite or contrast with any prior dual-GAN or multi-adversarial GZSL papers; adding such references would clarify novelty.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback. We address the two major comments on the abstract below, clarifying that the full technical details appear in the body of the paper while agreeing to strengthen the abstract for clarity.

read point-by-point responses

Referee: [Abstract] Abstract: the central claim that the dual GAN 'enforces the synthetic visual features to represent prior semantic knowledge well via semantics-consistent adversarial learning' and that the primal GAN uses 'the ones reconstructed by the dual GAN' supplies no loss equations, consistency metric, or stability argument. Without these, it is impossible to verify whether the dual loop actually prevents the semantic drift or mode collapse that would invalidate feature synthesis for unseen classes.

Authors: The abstract is intentionally concise. The primal-dual losses, including the semantics-consistent adversarial objective and reconstruction terms, are fully specified in Section 3 (Equations 3–7), where the dual GAN is trained to reconstruct visual features from semantic embeddings and the primal GAN conditions on both original and reconstructed semantics. Consistency is measured via the adversarial discriminator that distinguishes real vs. synthetic features under semantic constraints. Stability against drift and collapse is demonstrated empirically in Section 4 via multiple random seeds and ablation studies showing that removing the dual loop degrades harmonic mean accuracy. We will revise the abstract to briefly reference the dual reconstruction mechanism and point to the loss formulation. revision: partial
Referee: [Abstract] Abstract: the assertion of 'significant improvements over the state-of-the-art approaches' is presented without reference to specific datasets, metrics, or controls for hyperparameter tuning and training stability that are known to be critical in GAN-based GZSL work; this makes the empirical claim difficult to assess as load-bearing evidence for the dual mechanism.

Authors: The abstract summarizes the outcome; the concrete evidence appears in Section 4, which reports results on CUB, SUN, AWA1, and AWA2 using the standard GZSL protocol (top-1 accuracy on seen/unseen classes and harmonic mean), with comparisons against 10+ baselines and standard deviations over 5 runs to address stability. Hyperparameter sensitivity is analyzed in the supplementary material. To improve readability we will revise the abstract to name the primary datasets and the harmonic-mean gains. revision: yes

Circularity Check

0 steps flagged

No circularity; novel dual-GAN architecture defined independently of inputs

full rationale

The paper proposes DASCN as an explicit new architecture with primal GAN synthesizing discriminative features from semantic reps plus dual-GAN reconstructions, and dual GAN enforcing semantics-consistent adversarial learning. No equations reduce a claimed prediction to a fitted parameter by construction, no self-citation chain justifies the central premise, and no ansatz or uniqueness result is imported from prior author work. The method is presented as the first dual-GAN mechanism for GZSL, with performance shown via experiments rather than derived tautologically from inputs. This is the common self-contained case.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 1 invented entities

The central claim rests on the untested premise that dual adversarial training can simultaneously preserve semantics and discrimination without collapse; no explicit free parameters or invented entities are named in the abstract, but the dual-GAN architecture itself functions as an invented training structure whose effectiveness is asserted rather than derived from first principles.

invented entities (1)

Dual-GAN mechanism no independent evidence
purpose: Enforce semantics-consistent adversarial learning between primal feature synthesis and dual semantic reconstruction
The paper introduces this paired GAN structure as the core novelty; no independent evidence outside the proposed training loop is provided in the abstract.

pith-pipeline@v0.9.0 · 5702 in / 1263 out tokens · 17304 ms · 2026-05-24T22:55:51.352414+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

34 extracted references · 34 canonical work pages · 1 internal anchor

[1]

Evaluation of output embeddings for ﬁne-grained image classiﬁcation

Zeynep Akata, Scott Reed, Daniel Walter, Honglak Lee, and Bernt Schiele. Evaluation of output embeddings for ﬁne-grained image classiﬁcation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 2927–2936, 2015

work page 2015
[2]

Preserving semantic relations for zero-shot learning

Yashas Annadani and Soma Biswas. Preserving semantic relations for zero-shot learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages 7603–7612, 2018

work page 2018
[3]

An empirical study and analysis of generalized zero-shot learning for object recognition in the wild

Wei-Lun Chao, Soravit Changpinyo, Boqing Gong, and Fei Sha. An empirical study and analysis of generalized zero-shot learning for object recognition in the wild. In European Conference on Computer Vision, pages 52–68. Springer, 2016

work page 2016
[4]

Zero-shot visual recognition using semantics-preserving adversarial embedding networks

Long Chen, Hanwang Zhang, Jun Xiao, Wei Liu, and Shih-Fu Chang. Zero-shot visual recognition using semantics-preserving adversarial embedding networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages 1043–1052, 2018

work page 2018
[5]

Infogan: Interpretable representation learning by information maximizing generative adversarial nets

Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, and Pieter Abbeel. Infogan: Interpretable representation learning by information maximizing generative adversarial nets. In Advances in neural information processing systems , pages 2172–2180, 2016

work page 2016
[6]

Describing objects by their attributes

Ali Farhadi, Ian Endres, Derek Hoiem, and David Forsyth. Describing objects by their attributes. In 2009 IEEE Conference on Computer Vision and Pattern Recognition , pages 1778–1785. IEEE, 2009

work page 2009
[7]

Multi-modal cycle-consistent generalized zero-shot learning

Rafael Felix, Vijay BG Kumar, Ian Reid, and Gustavo Carneiro. Multi-modal cycle-consistent generalized zero-shot learning. In Proceedings of the European Conference on Computer Vision (ECCV), pages 21–37, 2018

work page 2018
[8]

De- vise: A deep visual-semantic embedding model

Andrea Frome, Greg S Corrado, Jon Shlens, Samy Bengio, Jeff Dean, Tomas Mikolov, et al. De- vise: A deep visual-semantic embedding model. In Advances in neural information processing systems, pages 2121–2129, 2013

work page 2013
[9]

Improved training of wasserstein gans

Ishaan Gulrajani, Faruk Ahmed, Martin Arjovsky, Vincent Dumoulin, and Aaron C Courville. Improved training of wasserstein gans. In Advances in neural information processing systems , pages 5767–5777, 2017

work page 2017
[10]

Synthesizing samples fro zero-shot learning

Yuchen Guo, Guiguang Ding, Jungong Han, and Yue Gao. Synthesizing samples fro zero-shot learning. IJCAI, 2017

work page 2017
[11]

Learning to discover cross-domain relations with generative adversarial networks

Taeksoo Kim, Moonsu Cha, Hyunsoo Kim, Jung Kwon Lee, and Jiwon Kim. Learning to discover cross-domain relations with generative adversarial networks. In Proceedings of the 34th International Conference on Machine Learning-V olume 70, pages 1857–1865. JMLR. org, 2017

work page 2017
[12]

Semantic autoencoder for zero-shot learning

Elyor Kodirov, Tao Xiang, and Shaogang Gong. Semantic autoencoder for zero-shot learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages 3174–3183, 2017

work page 2017
[13]

Generalized zero-shot learning via synthesized examples

Vinay Kumar Verma, Gundeep Arora, Ashish Mishra, and Piyush Rai. Generalized zero-shot learning via synthesized examples. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 4281–4289, 2018

work page 2018
[14]

Learning to detect unseen object classes by between-class attribute transfer

Christoph H Lampert, Hannes Nickisch, and Stefan Harmeling. Learning to detect unseen object classes by between-class attribute transfer. In 2009 IEEE Conference on Computer Vision and Pattern Recognition, pages 951–958. IEEE, 2009

work page 2009
[15]

Attribute-based classiﬁcation for zero-shot visual object categorization

Christoph H Lampert, Hannes Nickisch, and Stefan Harmeling. Attribute-based classiﬁcation for zero-shot visual object categorization. IEEE Transactions on Pattern Analysis and Machine Intelligence, 36(3):453–465, 2013

work page 2013
[16]

Generalized zero-shot learning with deep calibration network

Shichen Liu, Mingsheng Long, Jianmin Wang, and Michael I Jordan. Generalized zero-shot learning with deep calibration network. In Advances in Neural Information Processing Systems , pages 2005–2015, 2018

work page 2005
[17]

Zero shot learning via low-rank embedded semantic autoencoder

Yang Liu, Quanxue Gao, Jin Li, Jungong Han, and Ling Shao. Zero shot learning via low-rank embedded semantic autoencoder. In IJCAI, pages 2490–2496, 2018. 9

work page 2018
[18]

From zero-shot learning to conventional supervised classiﬁcation: Unseen visual data synthesis

Yang Long, Li Liu, Ling Shao, Fumin Shen, Guiguang Ding, and Jungong Han. From zero-shot learning to conventional supervised classiﬁcation: Unseen visual data synthesis. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages 1627–1636, 2017

work page 2017
[19]

Visualizing data using t-sne

Laurens van der Maaten and Geoffrey Hinton. Visualizing data using t-sne. Journal of machine learning research, 9(Nov):2579–2605, 2008

work page 2008
[20]

Sun attribute database: Discovering, annotating, and recognizing scene attributes

Genevieve Patterson and James Hays. Sun attribute database: Discovering, annotating, and recognizing scene attributes. In 2012 IEEE Conference on Computer Vision and Pattern Recognition, pages 2751–2758. IEEE, 2012

work page 2012
[21]

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks

Alec Radford, Luke Metz, and Soumith Chintala. Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434, 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015
[22]

An embarrassingly simple approach to zero-shot learning

Bernardino Romera-Paredes and Philip Torr. An embarrassingly simple approach to zero-shot learning. In International Conference on Machine Learning , pages 2152–2161, 2015

work page 2015
[23]

Zero-shot learning through cross-modal transfer

Richard Socher, Milind Ganjoo, Christopher D Manning, and Andrew Ng. Zero-shot learning through cross-modal transfer. In Advances in neural information processing systems , pages 935–943, 2013

work page 2013
[24]

Learning to compare: Relation network for few-shot learning

Flood Sung, Yongxin Yang, Li Zhang, Tao Xiang, Philip HS Torr, and Timothy M Hospedales. Learning to compare: Relation network for few-shot learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 1199–1208, 2018

work page 2018
[25]

Zero-shot recognition via semantic embeddings and knowledge graphs

Xiaolong Wang, Yufei Ye, and Abhinav Gupta. Zero-shot recognition via semantic embeddings and knowledge graphs. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 6857–6866, 2018

work page 2018
[26]

Caltech-ucsd birds 200

Peter Welinder, Steve Branson, Takeshi Mita, Catherine Wah, Florian Schroff, Serge Belongie, and Pietro Perona. Caltech-ucsd birds 200. 2010

work page 2010
[27]

Zero-shot learning-a comprehensive evaluation of the good, the bad and the ugly

Yongqin Xian, Christoph H Lampert, Bernt Schiele, and Zeynep Akata. Zero-shot learning-a comprehensive evaluation of the good, the bad and the ugly. IEEE transactions on pattern analysis and machine intelligence, 2018

work page 2018
[28]

Feature generating networks for zero-shot learning

Yongqin Xian, Tobias Lorenz, Bernt Schiele, and Zeynep Akata. Feature generating networks for zero-shot learning. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 5542–5551, 2018

work page 2018
[29]

Dualgan: Unsupervised dual learning for image-to-image translation

Zili Yi, Hao Zhang, Ping Tan, and Minglun Gong. Dualgan: Unsupervised dual learning for image-to-image translation. In Proceedings of the IEEE international conference on computer vision, pages 2849–2857, 2017

work page 2017
[30]

Triple veriﬁcation network for general- ized zero-shot learning

Haofeng Zhang, Yang Long, Yu Guan, and Ling Shao. Triple veriﬁcation network for general- ized zero-shot learning. IEEE Transactions on Image Processing, 28(1):506–517, 2018

work page 2018
[31]

Zero-shot kernel learning

Hongguang Zhang and Piotr Koniusz. Zero-shot kernel learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 7670–7679, 2018

work page 2018
[32]

Learning a deep embedding model for zero-shot learning

Li Zhang, Tao Xiang, and Shaogang Gong. Learning a deep embedding model for zero-shot learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages 2021–2030, 2017

work page 2021
[33]

Unpaired image-to-image translation using cycle-consistent adversarial networks

Jun-Yan Zhu, Taesung Park, Phillip Isola, and Alexei A Efros. Unpaired image-to-image translation using cycle-consistent adversarial networks. InProceedings of the IEEE international conference on computer vision, pages 2223–2232, 2017

work page 2017
[34]

A generative adversarial approach for zero-shot learning from noisy texts

Yizhe Zhu, Mohamed Elhoseiny, Bingchen Liu, Xi Peng, and Ahmed Elgammal. A generative adversarial approach for zero-shot learning from noisy texts. In Proceedings of the IEEE conference on computer vision and pattern recognition , pages 1004–1013, 2018. 10

work page 2018

[1] [1]

Evaluation of output embeddings for ﬁne-grained image classiﬁcation

Zeynep Akata, Scott Reed, Daniel Walter, Honglak Lee, and Bernt Schiele. Evaluation of output embeddings for ﬁne-grained image classiﬁcation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 2927–2936, 2015

work page 2015

[2] [2]

Preserving semantic relations for zero-shot learning

Yashas Annadani and Soma Biswas. Preserving semantic relations for zero-shot learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages 7603–7612, 2018

work page 2018

[3] [3]

An empirical study and analysis of generalized zero-shot learning for object recognition in the wild

Wei-Lun Chao, Soravit Changpinyo, Boqing Gong, and Fei Sha. An empirical study and analysis of generalized zero-shot learning for object recognition in the wild. In European Conference on Computer Vision, pages 52–68. Springer, 2016

work page 2016

[4] [4]

Zero-shot visual recognition using semantics-preserving adversarial embedding networks

Long Chen, Hanwang Zhang, Jun Xiao, Wei Liu, and Shih-Fu Chang. Zero-shot visual recognition using semantics-preserving adversarial embedding networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages 1043–1052, 2018

work page 2018

[5] [5]

Infogan: Interpretable representation learning by information maximizing generative adversarial nets

Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, and Pieter Abbeel. Infogan: Interpretable representation learning by information maximizing generative adversarial nets. In Advances in neural information processing systems , pages 2172–2180, 2016

work page 2016

[6] [6]

Describing objects by their attributes

Ali Farhadi, Ian Endres, Derek Hoiem, and David Forsyth. Describing objects by their attributes. In 2009 IEEE Conference on Computer Vision and Pattern Recognition , pages 1778–1785. IEEE, 2009

work page 2009

[7] [7]

Multi-modal cycle-consistent generalized zero-shot learning

Rafael Felix, Vijay BG Kumar, Ian Reid, and Gustavo Carneiro. Multi-modal cycle-consistent generalized zero-shot learning. In Proceedings of the European Conference on Computer Vision (ECCV), pages 21–37, 2018

work page 2018

[8] [8]

De- vise: A deep visual-semantic embedding model

Andrea Frome, Greg S Corrado, Jon Shlens, Samy Bengio, Jeff Dean, Tomas Mikolov, et al. De- vise: A deep visual-semantic embedding model. In Advances in neural information processing systems, pages 2121–2129, 2013

work page 2013

[9] [9]

Improved training of wasserstein gans

Ishaan Gulrajani, Faruk Ahmed, Martin Arjovsky, Vincent Dumoulin, and Aaron C Courville. Improved training of wasserstein gans. In Advances in neural information processing systems , pages 5767–5777, 2017

work page 2017

[10] [10]

Synthesizing samples fro zero-shot learning

Yuchen Guo, Guiguang Ding, Jungong Han, and Yue Gao. Synthesizing samples fro zero-shot learning. IJCAI, 2017

work page 2017

[11] [11]

Learning to discover cross-domain relations with generative adversarial networks

Taeksoo Kim, Moonsu Cha, Hyunsoo Kim, Jung Kwon Lee, and Jiwon Kim. Learning to discover cross-domain relations with generative adversarial networks. In Proceedings of the 34th International Conference on Machine Learning-V olume 70, pages 1857–1865. JMLR. org, 2017

work page 2017

[12] [12]

Semantic autoencoder for zero-shot learning

Elyor Kodirov, Tao Xiang, and Shaogang Gong. Semantic autoencoder for zero-shot learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages 3174–3183, 2017

work page 2017

[13] [13]

Generalized zero-shot learning via synthesized examples

Vinay Kumar Verma, Gundeep Arora, Ashish Mishra, and Piyush Rai. Generalized zero-shot learning via synthesized examples. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 4281–4289, 2018

work page 2018

[14] [14]

Learning to detect unseen object classes by between-class attribute transfer

Christoph H Lampert, Hannes Nickisch, and Stefan Harmeling. Learning to detect unseen object classes by between-class attribute transfer. In 2009 IEEE Conference on Computer Vision and Pattern Recognition, pages 951–958. IEEE, 2009

work page 2009

[15] [15]

Attribute-based classiﬁcation for zero-shot visual object categorization

Christoph H Lampert, Hannes Nickisch, and Stefan Harmeling. Attribute-based classiﬁcation for zero-shot visual object categorization. IEEE Transactions on Pattern Analysis and Machine Intelligence, 36(3):453–465, 2013

work page 2013

[16] [16]

Generalized zero-shot learning with deep calibration network

Shichen Liu, Mingsheng Long, Jianmin Wang, and Michael I Jordan. Generalized zero-shot learning with deep calibration network. In Advances in Neural Information Processing Systems , pages 2005–2015, 2018

work page 2005

[17] [17]

Zero shot learning via low-rank embedded semantic autoencoder

Yang Liu, Quanxue Gao, Jin Li, Jungong Han, and Ling Shao. Zero shot learning via low-rank embedded semantic autoencoder. In IJCAI, pages 2490–2496, 2018. 9

work page 2018

[18] [18]

From zero-shot learning to conventional supervised classiﬁcation: Unseen visual data synthesis

Yang Long, Li Liu, Ling Shao, Fumin Shen, Guiguang Ding, and Jungong Han. From zero-shot learning to conventional supervised classiﬁcation: Unseen visual data synthesis. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages 1627–1636, 2017

work page 2017

[19] [19]

Visualizing data using t-sne

Laurens van der Maaten and Geoffrey Hinton. Visualizing data using t-sne. Journal of machine learning research, 9(Nov):2579–2605, 2008

work page 2008

[20] [20]

Sun attribute database: Discovering, annotating, and recognizing scene attributes

Genevieve Patterson and James Hays. Sun attribute database: Discovering, annotating, and recognizing scene attributes. In 2012 IEEE Conference on Computer Vision and Pattern Recognition, pages 2751–2758. IEEE, 2012

work page 2012

[21] [21]

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks

Alec Radford, Luke Metz, and Soumith Chintala. Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434, 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015

[22] [22]

An embarrassingly simple approach to zero-shot learning

Bernardino Romera-Paredes and Philip Torr. An embarrassingly simple approach to zero-shot learning. In International Conference on Machine Learning , pages 2152–2161, 2015

work page 2015

[23] [23]

Zero-shot learning through cross-modal transfer

Richard Socher, Milind Ganjoo, Christopher D Manning, and Andrew Ng. Zero-shot learning through cross-modal transfer. In Advances in neural information processing systems , pages 935–943, 2013

work page 2013

[24] [24]

Learning to compare: Relation network for few-shot learning

Flood Sung, Yongxin Yang, Li Zhang, Tao Xiang, Philip HS Torr, and Timothy M Hospedales. Learning to compare: Relation network for few-shot learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 1199–1208, 2018

work page 2018

[25] [25]

Zero-shot recognition via semantic embeddings and knowledge graphs

Xiaolong Wang, Yufei Ye, and Abhinav Gupta. Zero-shot recognition via semantic embeddings and knowledge graphs. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 6857–6866, 2018

work page 2018

[26] [26]

Caltech-ucsd birds 200

Peter Welinder, Steve Branson, Takeshi Mita, Catherine Wah, Florian Schroff, Serge Belongie, and Pietro Perona. Caltech-ucsd birds 200. 2010

work page 2010

[27] [27]

Zero-shot learning-a comprehensive evaluation of the good, the bad and the ugly

Yongqin Xian, Christoph H Lampert, Bernt Schiele, and Zeynep Akata. Zero-shot learning-a comprehensive evaluation of the good, the bad and the ugly. IEEE transactions on pattern analysis and machine intelligence, 2018

work page 2018

[28] [28]

Feature generating networks for zero-shot learning

Yongqin Xian, Tobias Lorenz, Bernt Schiele, and Zeynep Akata. Feature generating networks for zero-shot learning. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 5542–5551, 2018

work page 2018

[29] [29]

Dualgan: Unsupervised dual learning for image-to-image translation

Zili Yi, Hao Zhang, Ping Tan, and Minglun Gong. Dualgan: Unsupervised dual learning for image-to-image translation. In Proceedings of the IEEE international conference on computer vision, pages 2849–2857, 2017

work page 2017

[30] [30]

Triple veriﬁcation network for general- ized zero-shot learning

Haofeng Zhang, Yang Long, Yu Guan, and Ling Shao. Triple veriﬁcation network for general- ized zero-shot learning. IEEE Transactions on Image Processing, 28(1):506–517, 2018

work page 2018

[31] [31]

Zero-shot kernel learning

Hongguang Zhang and Piotr Koniusz. Zero-shot kernel learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 7670–7679, 2018

work page 2018

[32] [32]

Learning a deep embedding model for zero-shot learning

Li Zhang, Tao Xiang, and Shaogang Gong. Learning a deep embedding model for zero-shot learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages 2021–2030, 2017

work page 2021

[33] [33]

Unpaired image-to-image translation using cycle-consistent adversarial networks

Jun-Yan Zhu, Taesung Park, Phillip Isola, and Alexei A Efros. Unpaired image-to-image translation using cycle-consistent adversarial networks. InProceedings of the IEEE international conference on computer vision, pages 2223–2232, 2017

work page 2017

[34] [34]

A generative adversarial approach for zero-shot learning from noisy texts

Yizhe Zhu, Mohamed Elhoseiny, Bingchen Liu, Xi Peng, and Ahmed Elgammal. A generative adversarial approach for zero-shot learning from noisy texts. In Proceedings of the IEEE conference on computer vision and pattern recognition , pages 1004–1013, 2018. 10

work page 2018