Co-Evolutionary Compression for Unpaired Image Translation

Chang Xu; Chunjing Xu; Han Shu; Hanting Chen; Kai Han; Qi Tian; Xu Jia; Yunhe Wang

arxiv: 1907.10804 · v1 · pith:ELUAYUSCnew · submitted 2019-07-25 · 💻 cs.CV · cs.LG· eess.IV

Co-Evolutionary Compression for Unpaired Image Translation

Han Shu , Yunhe Wang , Xu Jia , Kai Han , Hanting Chen , Chunjing Xu , Qi Tian , Chang Xu This is my paper

Pith reviewed 2026-05-24 16:44 UTC · model grok-4.3

classification 💻 cs.CV cs.LGeess.IV

keywords co-evolutionary compressionunpaired image translationGAN compressiongenerator pruningcycle consistencyimage-to-image translationmodel compression

0 comments

The pith

A co-evolutionary method simultaneously prunes generators in unpaired image translation GANs while preserving translation quality.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a method to compress generators in GANs used for translating images between two domains without paired examples. It treats the two generators as evolving populations that iteratively select important convolution filters. Fitness of each candidate is scored by parameter count, a term that accounts for the discriminator, and the cycle consistency loss that enforces translation back and forth. If successful, this produces smaller models that use less memory and fewer operations yet still generate good translations on standard benchmarks.

Core claim

Generators for the two domains are encoded as separate populations and co-evolved by iteratively removing less important filters. The fitness of each individual combines the number of parameters, a discriminator-aware regularization, and cycle consistency, allowing joint optimization that reduces both memory and FLOPs without paired training data.

What carries the argument

Co-evolutionary optimization of two generator populations, where fitness is computed from parameter count, discriminator-aware regularization, and cycle consistency.

If this is right

Compact generators achieve similar translation performance on benchmark datasets.
Memory usage and computational complexity are reduced simultaneously.
The method works for unpaired image-to-image translation tasks.
Extensive experiments validate effectiveness on standard datasets.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same co-evolution idea could be tested on other GAN architectures beyond translation.
It may lower the cost of running translation models on mobile or edge hardware.
Combining this approach with quantization could yield further size reductions.
The fitness function might be adapted to other unpaired learning settings like style transfer.

Load-bearing premise

The combination of parameter count, discriminator regularization, and cycle consistency in the fitness function is enough to find compact generators that keep translation quality without needing extra checks on separate data.

What would settle it

Run the compressed generator on a held-out image set and measure whether translation quality measured by standard metrics drops substantially below the original full model.

Figures

Figures reproduced from arXiv: 1907.10804 by Chang Xu, Chunjing Xu, Han Shu, Hanting Chen, Kai Han, Qi Tian, Xu Jia, Yunhe Wang.

**Figure 1.** Figure 1: The diagram of the proposed co-evolutionary method for learning efficient generators. Wherein, filters in generators are represented [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗

**Figure 2.** Figure 2: Images generated using the generator compressed by exploiting the proposed method with different hyper-parameters. The top line [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗

**Figure 3.** Figure 3: The generated images on horse2zebra and summer2winter datasets using different methods and strategies. The first two columns [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗

**Figure 4.** Figure 4: Filter visualization results. From top to bottom: the original filters with red rectangles selecting the remained filters by the proposed [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗

read the original abstract

Generative adversarial networks (GANs) have been successfully used for considerable computer vision tasks, especially the image-to-image translation. However, generators in these networks are of complicated architectures with large number of parameters and huge computational complexities. Existing methods are mainly designed for compressing and speeding-up deep neural networks in the classification task, and cannot be directly applied on GANs for image translation, due to their different objectives and training procedures. To this end, we develop a novel co-evolutionary approach for reducing their memory usage and FLOPs simultaneously. In practice, generators for two image domains are encoded as two populations and synergistically optimized for investigating the most important convolution filters iteratively. Fitness of each individual is calculated using the number of parameters, a discriminator-aware regularization, and the cycle consistency. Extensive experiments conducted on benchmark datasets demonstrate the effectiveness of the proposed method for obtaining compact and effective generators.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The co-evolutionary compression for unpaired translation GANs is a reasonable adaptation but the fitness function's link to actual output quality remains the weakest part.

read the letter

The main takeaway is a co-evolutionary search that treats the two generators in an unpaired translation setup as interacting populations, with fitness driven by parameter count plus discriminator regularization and cycle consistency. This is presented as distinct from classifier compression methods, which the authors note do not transfer directly because of the adversarial training and dual-domain objectives. That framing is fair and the approach is a logical extension rather than a wholesale invention. The paper does a clean job stating the mismatch with prior work and defining an iterative population-based search that accounts for both generators at once. The experiments are described as extensive on standard benchmarks, which at least shows the authors ran the method end-to-end. The soft spot is whether the chosen fitness actually preserves translation quality. Cycle consistency and discriminator terms can be satisfied by low-effort mappings that do not carry semantic content, and parameter count alone does not guarantee retained feature statistics. The abstract supplies no numbers, ablations, or held-out metrics, so the correlation between fitness scores and real performance is not yet demonstrated. If the full paper contains quantitative tables, comparisons to other pruning baselines, and checks on unseen data, the concern shrinks; otherwise it stays central. This is mainly for readers already working on efficient GAN deployment rather than new theory. It deserves a serious referee to check the experimental controls and effect sizes.

Referee Report

2 major / 1 minor

Summary. The paper proposes a co-evolutionary compression method for unpaired image-to-image translation GANs. Two generator populations (one per domain) are encoded and iteratively optimized by selecting important convolution filters; fitness of each individual is computed from parameter count, a discriminator-aware regularization term, and cycle consistency loss. The abstract states that extensive experiments on benchmark datasets demonstrate the method's effectiveness at producing compact yet effective generators.

Significance. If the central empirical claim holds, the approach would offer a domain-specific compression technique for GAN generators that simultaneously targets memory and FLOPs while respecting the adversarial and cycle-consistency objectives, which could be useful for deploying image-translation models on edge devices. The co-evolutionary framing and the composite fitness function are the main technical contributions.

major comments (2)

[Abstract] Abstract: the claim that 'extensive experiments conducted on benchmark datasets demonstrate the effectiveness' is unsupported because the abstract (and the supplied excerpt) contains no quantitative results, no tables of FID/SSIM/perceptual scores, no ablation studies, and no baseline comparisons. Without these data it is impossible to verify whether the fitness-driven search actually preserves translation quality.
[Method] Method (fitness definition): the composite fitness (parameter count + discriminator-aware regularization + cycle consistency) can be satisfied by degenerate mappings that preserve cycle consistency yet produce semantically incorrect translations; the manuscript provides no held-out quantitative validation or post-search fine-tuning protocol to establish that fitness correlates with actual output quality on unseen data.

minor comments (1)

[Abstract] Abstract: 'considerable computer vision tasks' should be replaced by a more precise phrase such as 'various' or 'several'.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments. We address each major point below and indicate where revisions will be made.

read point-by-point responses

Referee: [Abstract] Abstract: the claim that 'extensive experiments conducted on benchmark datasets demonstrate the effectiveness' is unsupported because the abstract (and the supplied excerpt) contains no quantitative results, no tables of FID/SSIM/perceptual scores, no ablation studies, and no baseline comparisons. Without these data it is impossible to verify whether the fitness-driven search actually preserves translation quality.

Authors: We agree that the abstract would be strengthened by including key quantitative results. The full manuscript contains tables and figures reporting FID scores, parameter/FLOP reductions, baseline comparisons, and ablation studies on standard benchmarks. We will revise the abstract to cite representative metrics (e.g., comparable FID with >50% parameter reduction). revision: yes
Referee: [Method] Method (fitness definition): the composite fitness (parameter count + discriminator-aware regularization + cycle consistency) can be satisfied by degenerate mappings that preserve cycle consistency yet produce semantically incorrect translations; the manuscript provides no held-out quantitative validation or post-search fine-tuning protocol to establish that fitness correlates with actual output quality on unseen data.

Authors: The discriminator-aware regularization term explicitly penalizes outputs that the discriminator classifies as fake, thereby discouraging semantically degenerate solutions even when cycle consistency holds. The manuscript reports held-out test-set results (FID, SSIM, and visual comparisons) showing that the fitness-selected generators preserve translation quality without requiring post-search fine-tuning; the evolutionary search directly optimizes the composite objective that includes the discriminator signal. revision: no

Circularity Check

0 steps flagged

No significant circularity; evolutionary search uses external fitness without self-referential derivation

full rationale

The paper describes a co-evolutionary compression procedure in which two generator populations are iteratively optimized according to an explicitly stated composite fitness function (parameter count + discriminator-aware regularization + cycle consistency). This constitutes an applied search algorithm whose outputs are validated on benchmark datasets rather than a closed mathematical derivation in which any claimed prediction or uniqueness result reduces by construction to its own fitted inputs or self-citations. No equations or uniqueness theorems are presented that would trigger self-definitional, fitted-input-called-prediction, or self-citation-load-bearing patterns. The method therefore remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies no explicit free parameters, axioms, or invented entities; the fitness function itself is described at a high level without numerical constants or additional postulated objects.

pith-pipeline@v0.9.0 · 5694 in / 1052 out tokens · 17054 ms · 2026-05-24T16:44:39.717331+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

36 extracted references · 36 canonical work pages · 11 internal anchors

[1]

W. Chen, J. T. Wilson, S. Tyree, K. Q. Weinberger, and Y . Chen. Compressing convolutional neural networks.arXiv preprint arXiv:1506.04449, 2015. 3

work page internal anchor Pith review Pith/arXiv arXiv 2015
[2]

Y . Choi, M. Choi, M. Kim, J.-W. Ha, S. Kim, and J. Choo. Stargan: Uniﬁed generative adversarial networks for multi- domain image-to-image translation. arXiv preprint, 1711,

work page
[3]

Binarized Neural Networks: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1

M. Courbariaux and Y . Bengio. Binarynet: Training deep neural networks with weights and activations constrained to+ 1 or-1. arXiv preprint arXiv:1602.02830, 2016. 1, 3

work page internal anchor Pith review Pith/arXiv arXiv 2016
[4]

K. Deb, A. Pratap, S. Agarwal, and T. Meyarivan. A fast and elitist multiobjective genetic algorithm: Nsga-ii. IEEE trans- actions on evolutionary computation, 6(2):182–197, 2002. 4, 5

work page 2002
[5]

E. L. Denton, W. Zaremba, J. Bruna, Y . LeCun, and R. Fergus. Exploiting linear structure within convolutional networks for efﬁcient evaluation. InNIPS, 2014. 1, 3

work page 2014
[6]

Eberhart and J

R. Eberhart and J. Kennedy. A new optimizer using particle swarm theory. In Micro Machine and Human Science, 1995. MHS’95., Proceedings of the Sixth International Symposium on, pages 39–43, 1995. 4

work page 1995
[7]

Goodfellow, J

I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde- Farley, S. Ozair, A. Courville, and Y . Bengio. Generative adversarial nets. In Advances in neural information process- ing systems, pages 2672–2680, 2014. 1

work page 2014
[8]

S. Han, H. Mao, and W. J. Dally. Deep compression: Com- pressing deep neural networks with pruning, trained quantiza- tion and huffman coding. In ICLR, 2016. 1, 3, 8

work page 2016
[9]

S. Han, J. Pool, J. Tran, and W. Dally. Learning both weights and connections for efﬁcient neural network. InNIPS, 2015. 3

work page 2015
[10]

K. He, X. Zhang, S. Ren, and J. Sun. Deep residual learning for image recognition. In CVPR, 2016. 1, 3, 4, 5

work page 2016
[11]

Distilling the Knowledge in a Neural Network

G. Hinton, O. Vinyals, and J. Dean. Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531, 2015. 1

work page internal anchor Pith review Pith/arXiv arXiv 2015
[12]

A. G. Howard, M. Zhu, B. Chen, D. Kalenichenko, W. Wang, T. Weyand, M. Andreetto, and H. Adam. Mobilenets: Efﬁ- cient convolutional neural networks for mobile vision appli- cations. arXiv preprint arXiv:1704.04861, 2017. 1

work page internal anchor Pith review Pith/arXiv arXiv 2017
[13]

H. Hu, R. Peng, Y .-W. Tai, and C.-K. Tang. Network trim- ming: A data-driven neuron pruning approach towards efﬁ- cient deep architectures. arXiv preprint arXiv:1607.03250,

work page internal anchor Pith review Pith/arXiv arXiv
[14]

Isola, J.-Y

P. Isola, J.-Y . Zhu, T. Zhou, and A. A. Efros. Image-to-image translation with conditional adversarial networks. arXiv preprint, 2017. 1, 2, 8

work page 2017
[15]

T. Kim, M. Cha, H. Kim, J. K. Lee, and J. Kim. Learning to discover cross-domain relations with generative adversarial networks. arXiv preprint arXiv:1703.05192, 2017. 1, 2

work page internal anchor Pith review Pith/arXiv arXiv 2017
[16]

Kirkpatrick, C

S. Kirkpatrick, C. D. Gelatt, and M. P. Vecchi. Optimization by simulated annealing. science, 220(4598):671–680, 1983. 4

work page 1983
[17]

Krizhevsky, I

A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classiﬁcation with deep convolutional neural networks. In NIPS, 2012. 3

work page 2012
[18]

Ledig, L

C. Ledig, L. Theis, F. Husz´ar, J. Caballero, A. Cunningham, A. Acosta, A. P. Aitken, A. Tejani, J. Totz, Z. Wang, et al. Photo-realistic single image super-resolution using a genera- tive adversarial network. In CVPR, volume 2, page 4, 2017. 1

work page 2017
[19]

W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y . Fu, and A. C. Berg. Ssd: Single shot multibox detector. In ECCV, 2016. 3

work page 2016
[20]

Z. Liu, J. Li, Z. Shen, G. Huang, S. Yan, and C. Zhang. Learning efﬁcient convolutional networks through network slimming. In ICCV, pages 2755–2763, 2017. 7

work page 2017
[21]

J. Long, E. Shelhamer, and T. Darrell. Fully convolutional networks for semantic segmentation. In CVPR, 2015. 5, 8

work page 2015
[22]

J.-H. Luo, J. Wu, and W. Lin. Thinet: A ﬁlter level pruning method for deep neural network compression. InICCV, pages 5058–5066, 2017. 1, 3, 6, 7, 8

work page 2017
[23]

XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks

M. Rastegari, V . Ordonez, J. Redmon, and A. Farhadi. Xnor- net: Imagenet classiﬁcation using binary convolutional neural networks. arXiv preprint arXiv:1603.05279, 2016. 1, 3

work page internal anchor Pith review Pith/arXiv arXiv 2016
[24]

E. Real, S. Moore, A. Selle, S. Saxena, Y . L. Suematsu, Q. Le, and A. Kurakin. Large-scale evolution of image classiﬁers. arXiv preprint arXiv:1703.01041, 2017. 4

work page internal anchor Pith review Pith/arXiv arXiv 2017
[25]

S. Ren, K. He, R. Girshick, and J. Sun. Faster r-cnn: Towards real-time object detection with region proposal networks. In NIPS, 2015. 3

work page 2015
[26]

FitNets: Hints for Thin Deep Nets

A. Romero, N. Ballas, S. E. Kahou, A. Chassang, C. Gatta, and Y . Bengio. Fitnets: Hints for thin deep nets. arXiv preprint arXiv:1412.6550, 2014. 1

work page internal anchor Pith review Pith/arXiv arXiv 2014
[27]

C. Shen, X. Wang, J. Song, L. Sun, and M. Song. Amalgamat- ing knowledge towards comprehensive classiﬁcation.arXiv preprint arXiv:1811.02796, 2018. 1

work page internal anchor Pith review Pith/arXiv arXiv 2018
[28]

Simonyan and A

K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. ICLR, 2015. 3

work page 2015
[29]

Vanhoucke, A

V . Vanhoucke, A. Senior, and M. Z. Mao. Improving the speed of neural networks on cpus. In Deep Learning and Unsupervised Feature Learning Workshop, NIPS, 2011. 1, 3

work page 2011
[30]

Wang, M.-Y

T.-C. Wang, M.-Y . Liu, J.-Y . Zhu, A. Tao, J. Kautz, and B. Catanzaro. High-resolution image synthesis and semantic manipulation with conditional gans. In CVPR, pages 8798– 8807, 2018. 1, 2

work page 2018
[31]

Y . Wang, C. Xu, J. Qiu, C. Xu, and D. Tao. Towards evolu- tional compression. In SIGKDD, 2018. 4, 5

work page 2018
[32]

Y . Wang, C. Xu, S. You, D. Tao, and C. Xu. Cnnpack: Packing convolutional neural networks in the frequency domain. In NIPS, 2016. 1, 3

work page 2016
[33]

Z. Yi, H. Zhang, P. Tan, and M. Gong. Dualgan: Unsupervised dual learning for image-to-image translation. In ICCV, pages 2868–2876. IEEE, 2017. 1, 2

work page 2017
[34]

ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices

X. Zhang, X. Zhou, M. Lin, and J. Sun. Shufﬂenet: An extremely efﬁcient convolutional neural network for mobile devices. arXiv preprint arXiv:1707.01083, 2017. 1

work page internal anchor Pith review Pith/arXiv arXiv 2017
[35]

J.-Y . Zhu, P. Kr¨ahenb¨uhl, E. Shechtman, and A. A. Efros. Generative visual manipulation on the natural image manifold. In ECCV, pages 597–613, 2016. 1

work page 2016
[36]

J.-Y . Zhu, T. Park, P. Isola, and A. A. Efros. Unpaired image- to-image translation using cycle-consistent adversarial net- works. 2017. 1, 2, 3, 4, 5, 8

work page 2017

[1] [1]

W. Chen, J. T. Wilson, S. Tyree, K. Q. Weinberger, and Y . Chen. Compressing convolutional neural networks.arXiv preprint arXiv:1506.04449, 2015. 3

work page internal anchor Pith review Pith/arXiv arXiv 2015

[2] [2]

Y . Choi, M. Choi, M. Kim, J.-W. Ha, S. Kim, and J. Choo. Stargan: Uniﬁed generative adversarial networks for multi- domain image-to-image translation. arXiv preprint, 1711,

work page

[3] [3]

Binarized Neural Networks: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1

M. Courbariaux and Y . Bengio. Binarynet: Training deep neural networks with weights and activations constrained to+ 1 or-1. arXiv preprint arXiv:1602.02830, 2016. 1, 3

work page internal anchor Pith review Pith/arXiv arXiv 2016

[4] [4]

K. Deb, A. Pratap, S. Agarwal, and T. Meyarivan. A fast and elitist multiobjective genetic algorithm: Nsga-ii. IEEE trans- actions on evolutionary computation, 6(2):182–197, 2002. 4, 5

work page 2002

[5] [5]

E. L. Denton, W. Zaremba, J. Bruna, Y . LeCun, and R. Fergus. Exploiting linear structure within convolutional networks for efﬁcient evaluation. InNIPS, 2014. 1, 3

work page 2014

[6] [6]

Eberhart and J

R. Eberhart and J. Kennedy. A new optimizer using particle swarm theory. In Micro Machine and Human Science, 1995. MHS’95., Proceedings of the Sixth International Symposium on, pages 39–43, 1995. 4

work page 1995

[7] [7]

Goodfellow, J

I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde- Farley, S. Ozair, A. Courville, and Y . Bengio. Generative adversarial nets. In Advances in neural information process- ing systems, pages 2672–2680, 2014. 1

work page 2014

[8] [8]

S. Han, H. Mao, and W. J. Dally. Deep compression: Com- pressing deep neural networks with pruning, trained quantiza- tion and huffman coding. In ICLR, 2016. 1, 3, 8

work page 2016

[9] [9]

S. Han, J. Pool, J. Tran, and W. Dally. Learning both weights and connections for efﬁcient neural network. InNIPS, 2015. 3

work page 2015

[10] [10]

K. He, X. Zhang, S. Ren, and J. Sun. Deep residual learning for image recognition. In CVPR, 2016. 1, 3, 4, 5

work page 2016

[11] [11]

Distilling the Knowledge in a Neural Network

G. Hinton, O. Vinyals, and J. Dean. Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531, 2015. 1

work page internal anchor Pith review Pith/arXiv arXiv 2015

[12] [12]

A. G. Howard, M. Zhu, B. Chen, D. Kalenichenko, W. Wang, T. Weyand, M. Andreetto, and H. Adam. Mobilenets: Efﬁ- cient convolutional neural networks for mobile vision appli- cations. arXiv preprint arXiv:1704.04861, 2017. 1

work page internal anchor Pith review Pith/arXiv arXiv 2017

[13] [13]

H. Hu, R. Peng, Y .-W. Tai, and C.-K. Tang. Network trim- ming: A data-driven neuron pruning approach towards efﬁ- cient deep architectures. arXiv preprint arXiv:1607.03250,

work page internal anchor Pith review Pith/arXiv arXiv

[14] [14]

Isola, J.-Y

P. Isola, J.-Y . Zhu, T. Zhou, and A. A. Efros. Image-to-image translation with conditional adversarial networks. arXiv preprint, 2017. 1, 2, 8

work page 2017

[15] [15]

T. Kim, M. Cha, H. Kim, J. K. Lee, and J. Kim. Learning to discover cross-domain relations with generative adversarial networks. arXiv preprint arXiv:1703.05192, 2017. 1, 2

work page internal anchor Pith review Pith/arXiv arXiv 2017

[16] [16]

Kirkpatrick, C

S. Kirkpatrick, C. D. Gelatt, and M. P. Vecchi. Optimization by simulated annealing. science, 220(4598):671–680, 1983. 4

work page 1983

[17] [17]

Krizhevsky, I

A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classiﬁcation with deep convolutional neural networks. In NIPS, 2012. 3

work page 2012

[18] [18]

Ledig, L

C. Ledig, L. Theis, F. Husz´ar, J. Caballero, A. Cunningham, A. Acosta, A. P. Aitken, A. Tejani, J. Totz, Z. Wang, et al. Photo-realistic single image super-resolution using a genera- tive adversarial network. In CVPR, volume 2, page 4, 2017. 1

work page 2017

[19] [19]

W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y . Fu, and A. C. Berg. Ssd: Single shot multibox detector. In ECCV, 2016. 3

work page 2016

[20] [20]

Z. Liu, J. Li, Z. Shen, G. Huang, S. Yan, and C. Zhang. Learning efﬁcient convolutional networks through network slimming. In ICCV, pages 2755–2763, 2017. 7

work page 2017

[21] [21]

J. Long, E. Shelhamer, and T. Darrell. Fully convolutional networks for semantic segmentation. In CVPR, 2015. 5, 8

work page 2015

[22] [22]

J.-H. Luo, J. Wu, and W. Lin. Thinet: A ﬁlter level pruning method for deep neural network compression. InICCV, pages 5058–5066, 2017. 1, 3, 6, 7, 8

work page 2017

[23] [23]

XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks

M. Rastegari, V . Ordonez, J. Redmon, and A. Farhadi. Xnor- net: Imagenet classiﬁcation using binary convolutional neural networks. arXiv preprint arXiv:1603.05279, 2016. 1, 3

work page internal anchor Pith review Pith/arXiv arXiv 2016

[24] [24]

E. Real, S. Moore, A. Selle, S. Saxena, Y . L. Suematsu, Q. Le, and A. Kurakin. Large-scale evolution of image classiﬁers. arXiv preprint arXiv:1703.01041, 2017. 4

work page internal anchor Pith review Pith/arXiv arXiv 2017

[25] [25]

S. Ren, K. He, R. Girshick, and J. Sun. Faster r-cnn: Towards real-time object detection with region proposal networks. In NIPS, 2015. 3

work page 2015

[26] [26]

FitNets: Hints for Thin Deep Nets

A. Romero, N. Ballas, S. E. Kahou, A. Chassang, C. Gatta, and Y . Bengio. Fitnets: Hints for thin deep nets. arXiv preprint arXiv:1412.6550, 2014. 1

work page internal anchor Pith review Pith/arXiv arXiv 2014

[27] [27]

C. Shen, X. Wang, J. Song, L. Sun, and M. Song. Amalgamat- ing knowledge towards comprehensive classiﬁcation.arXiv preprint arXiv:1811.02796, 2018. 1

work page internal anchor Pith review Pith/arXiv arXiv 2018

[28] [28]

Simonyan and A

K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. ICLR, 2015. 3

work page 2015

[29] [29]

Vanhoucke, A

V . Vanhoucke, A. Senior, and M. Z. Mao. Improving the speed of neural networks on cpus. In Deep Learning and Unsupervised Feature Learning Workshop, NIPS, 2011. 1, 3

work page 2011

[30] [30]

Wang, M.-Y

T.-C. Wang, M.-Y . Liu, J.-Y . Zhu, A. Tao, J. Kautz, and B. Catanzaro. High-resolution image synthesis and semantic manipulation with conditional gans. In CVPR, pages 8798– 8807, 2018. 1, 2

work page 2018

[31] [31]

Y . Wang, C. Xu, J. Qiu, C. Xu, and D. Tao. Towards evolu- tional compression. In SIGKDD, 2018. 4, 5

work page 2018

[32] [32]

Y . Wang, C. Xu, S. You, D. Tao, and C. Xu. Cnnpack: Packing convolutional neural networks in the frequency domain. In NIPS, 2016. 1, 3

work page 2016

[33] [33]

Z. Yi, H. Zhang, P. Tan, and M. Gong. Dualgan: Unsupervised dual learning for image-to-image translation. In ICCV, pages 2868–2876. IEEE, 2017. 1, 2

work page 2017

[34] [34]

ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices

X. Zhang, X. Zhou, M. Lin, and J. Sun. Shufﬂenet: An extremely efﬁcient convolutional neural network for mobile devices. arXiv preprint arXiv:1707.01083, 2017. 1

work page internal anchor Pith review Pith/arXiv arXiv 2017

[35] [35]

J.-Y . Zhu, P. Kr¨ahenb¨uhl, E. Shechtman, and A. A. Efros. Generative visual manipulation on the natural image manifold. In ECCV, pages 597–613, 2016. 1

work page 2016

[36] [36]

J.-Y . Zhu, T. Park, P. Isola, and A. A. Efros. Unpaired image- to-image translation using cycle-consistent adversarial net- works. 2017. 1, 2, 3, 4, 5, 8

work page 2017