pith. sign in

arxiv: 1907.10804 · v1 · pith:ELUAYUSCnew · submitted 2019-07-25 · 💻 cs.CV · cs.LG· eess.IV

Co-Evolutionary Compression for Unpaired Image Translation

Pith reviewed 2026-05-24 16:44 UTC · model grok-4.3

classification 💻 cs.CV cs.LGeess.IV
keywords co-evolutionary compressionunpaired image translationGAN compressiongenerator pruningcycle consistencyimage-to-image translationmodel compression
0
0 comments X

The pith

A co-evolutionary method simultaneously prunes generators in unpaired image translation GANs while preserving translation quality.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a method to compress generators in GANs used for translating images between two domains without paired examples. It treats the two generators as evolving populations that iteratively select important convolution filters. Fitness of each candidate is scored by parameter count, a term that accounts for the discriminator, and the cycle consistency loss that enforces translation back and forth. If successful, this produces smaller models that use less memory and fewer operations yet still generate good translations on standard benchmarks.

Core claim

Generators for the two domains are encoded as separate populations and co-evolved by iteratively removing less important filters. The fitness of each individual combines the number of parameters, a discriminator-aware regularization, and cycle consistency, allowing joint optimization that reduces both memory and FLOPs without paired training data.

What carries the argument

Co-evolutionary optimization of two generator populations, where fitness is computed from parameter count, discriminator-aware regularization, and cycle consistency.

If this is right

  • Compact generators achieve similar translation performance on benchmark datasets.
  • Memory usage and computational complexity are reduced simultaneously.
  • The method works for unpaired image-to-image translation tasks.
  • Extensive experiments validate effectiveness on standard datasets.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same co-evolution idea could be tested on other GAN architectures beyond translation.
  • It may lower the cost of running translation models on mobile or edge hardware.
  • Combining this approach with quantization could yield further size reductions.
  • The fitness function might be adapted to other unpaired learning settings like style transfer.

Load-bearing premise

The combination of parameter count, discriminator regularization, and cycle consistency in the fitness function is enough to find compact generators that keep translation quality without needing extra checks on separate data.

What would settle it

Run the compressed generator on a held-out image set and measure whether translation quality measured by standard metrics drops substantially below the original full model.

Figures

Figures reproduced from arXiv: 1907.10804 by Chang Xu, Chunjing Xu, Han Shu, Hanting Chen, Kai Han, Qi Tian, Xu Jia, Yunhe Wang.

Figure 1
Figure 1. Figure 1: The diagram of the proposed co-evolutionary method for learning efficient generators. Wherein, filters in generators are represented [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Images generated using the generator compressed by exploiting the proposed method with different hyper-parameters. The top line [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: The generated images on horse2zebra and summer2winter datasets using different methods and strategies. The first two columns [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Filter visualization results. From top to bottom: the original filters with red rectangles selecting the remained filters by the proposed [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗
read the original abstract

Generative adversarial networks (GANs) have been successfully used for considerable computer vision tasks, especially the image-to-image translation. However, generators in these networks are of complicated architectures with large number of parameters and huge computational complexities. Existing methods are mainly designed for compressing and speeding-up deep neural networks in the classification task, and cannot be directly applied on GANs for image translation, due to their different objectives and training procedures. To this end, we develop a novel co-evolutionary approach for reducing their memory usage and FLOPs simultaneously. In practice, generators for two image domains are encoded as two populations and synergistically optimized for investigating the most important convolution filters iteratively. Fitness of each individual is calculated using the number of parameters, a discriminator-aware regularization, and the cycle consistency. Extensive experiments conducted on benchmark datasets demonstrate the effectiveness of the proposed method for obtaining compact and effective generators.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The paper proposes a co-evolutionary compression method for unpaired image-to-image translation GANs. Two generator populations (one per domain) are encoded and iteratively optimized by selecting important convolution filters; fitness of each individual is computed from parameter count, a discriminator-aware regularization term, and cycle consistency loss. The abstract states that extensive experiments on benchmark datasets demonstrate the method's effectiveness at producing compact yet effective generators.

Significance. If the central empirical claim holds, the approach would offer a domain-specific compression technique for GAN generators that simultaneously targets memory and FLOPs while respecting the adversarial and cycle-consistency objectives, which could be useful for deploying image-translation models on edge devices. The co-evolutionary framing and the composite fitness function are the main technical contributions.

major comments (2)
  1. [Abstract] Abstract: the claim that 'extensive experiments conducted on benchmark datasets demonstrate the effectiveness' is unsupported because the abstract (and the supplied excerpt) contains no quantitative results, no tables of FID/SSIM/perceptual scores, no ablation studies, and no baseline comparisons. Without these data it is impossible to verify whether the fitness-driven search actually preserves translation quality.
  2. [Method] Method (fitness definition): the composite fitness (parameter count + discriminator-aware regularization + cycle consistency) can be satisfied by degenerate mappings that preserve cycle consistency yet produce semantically incorrect translations; the manuscript provides no held-out quantitative validation or post-search fine-tuning protocol to establish that fitness correlates with actual output quality on unseen data.
minor comments (1)
  1. [Abstract] Abstract: 'considerable computer vision tasks' should be replaced by a more precise phrase such as 'various' or 'several'.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments. We address each major point below and indicate where revisions will be made.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the claim that 'extensive experiments conducted on benchmark datasets demonstrate the effectiveness' is unsupported because the abstract (and the supplied excerpt) contains no quantitative results, no tables of FID/SSIM/perceptual scores, no ablation studies, and no baseline comparisons. Without these data it is impossible to verify whether the fitness-driven search actually preserves translation quality.

    Authors: We agree that the abstract would be strengthened by including key quantitative results. The full manuscript contains tables and figures reporting FID scores, parameter/FLOP reductions, baseline comparisons, and ablation studies on standard benchmarks. We will revise the abstract to cite representative metrics (e.g., comparable FID with >50% parameter reduction). revision: yes

  2. Referee: [Method] Method (fitness definition): the composite fitness (parameter count + discriminator-aware regularization + cycle consistency) can be satisfied by degenerate mappings that preserve cycle consistency yet produce semantically incorrect translations; the manuscript provides no held-out quantitative validation or post-search fine-tuning protocol to establish that fitness correlates with actual output quality on unseen data.

    Authors: The discriminator-aware regularization term explicitly penalizes outputs that the discriminator classifies as fake, thereby discouraging semantically degenerate solutions even when cycle consistency holds. The manuscript reports held-out test-set results (FID, SSIM, and visual comparisons) showing that the fitness-selected generators preserve translation quality without requiring post-search fine-tuning; the evolutionary search directly optimizes the composite objective that includes the discriminator signal. revision: no

Circularity Check

0 steps flagged

No significant circularity; evolutionary search uses external fitness without self-referential derivation

full rationale

The paper describes a co-evolutionary compression procedure in which two generator populations are iteratively optimized according to an explicitly stated composite fitness function (parameter count + discriminator-aware regularization + cycle consistency). This constitutes an applied search algorithm whose outputs are validated on benchmark datasets rather than a closed mathematical derivation in which any claimed prediction or uniqueness result reduces by construction to its own fitted inputs or self-citations. No equations or uniqueness theorems are presented that would trigger self-definitional, fitted-input-called-prediction, or self-citation-load-bearing patterns. The method therefore remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies no explicit free parameters, axioms, or invented entities; the fitness function itself is described at a high level without numerical constants or additional postulated objects.

pith-pipeline@v0.9.0 · 5694 in / 1052 out tokens · 17054 ms · 2026-05-24T16:44:39.717331+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

36 extracted references · 36 canonical work pages · 11 internal anchors

  1. [1]

    W. Chen, J. T. Wilson, S. Tyree, K. Q. Weinberger, and Y . Chen. Compressing convolutional neural networks.arXiv preprint arXiv:1506.04449, 2015. 3

  2. [2]

    Y . Choi, M. Choi, M. Kim, J.-W. Ha, S. Kim, and J. Choo. Stargan: Unified generative adversarial networks for multi- domain image-to-image translation. arXiv preprint, 1711,

  3. [3]

    Binarized Neural Networks: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1

    M. Courbariaux and Y . Bengio. Binarynet: Training deep neural networks with weights and activations constrained to+ 1 or-1. arXiv preprint arXiv:1602.02830, 2016. 1, 3

  4. [4]

    K. Deb, A. Pratap, S. Agarwal, and T. Meyarivan. A fast and elitist multiobjective genetic algorithm: Nsga-ii. IEEE trans- actions on evolutionary computation, 6(2):182–197, 2002. 4, 5

  5. [5]

    E. L. Denton, W. Zaremba, J. Bruna, Y . LeCun, and R. Fergus. Exploiting linear structure within convolutional networks for efficient evaluation. InNIPS, 2014. 1, 3

  6. [6]

    Eberhart and J

    R. Eberhart and J. Kennedy. A new optimizer using particle swarm theory. In Micro Machine and Human Science, 1995. MHS’95., Proceedings of the Sixth International Symposium on, pages 39–43, 1995. 4

  7. [7]

    Goodfellow, J

    I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde- Farley, S. Ozair, A. Courville, and Y . Bengio. Generative adversarial nets. In Advances in neural information process- ing systems, pages 2672–2680, 2014. 1

  8. [8]

    S. Han, H. Mao, and W. J. Dally. Deep compression: Com- pressing deep neural networks with pruning, trained quantiza- tion and huffman coding. In ICLR, 2016. 1, 3, 8

  9. [9]

    S. Han, J. Pool, J. Tran, and W. Dally. Learning both weights and connections for efficient neural network. InNIPS, 2015. 3

  10. [10]

    K. He, X. Zhang, S. Ren, and J. Sun. Deep residual learning for image recognition. In CVPR, 2016. 1, 3, 4, 5

  11. [11]

    Distilling the Knowledge in a Neural Network

    G. Hinton, O. Vinyals, and J. Dean. Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531, 2015. 1

  12. [12]

    A. G. Howard, M. Zhu, B. Chen, D. Kalenichenko, W. Wang, T. Weyand, M. Andreetto, and H. Adam. Mobilenets: Effi- cient convolutional neural networks for mobile vision appli- cations. arXiv preprint arXiv:1704.04861, 2017. 1

  13. [13]

    H. Hu, R. Peng, Y .-W. Tai, and C.-K. Tang. Network trim- ming: A data-driven neuron pruning approach towards effi- cient deep architectures. arXiv preprint arXiv:1607.03250,

  14. [14]

    Isola, J.-Y

    P. Isola, J.-Y . Zhu, T. Zhou, and A. A. Efros. Image-to-image translation with conditional adversarial networks. arXiv preprint, 2017. 1, 2, 8

  15. [15]

    T. Kim, M. Cha, H. Kim, J. K. Lee, and J. Kim. Learning to discover cross-domain relations with generative adversarial networks. arXiv preprint arXiv:1703.05192, 2017. 1, 2

  16. [16]

    Kirkpatrick, C

    S. Kirkpatrick, C. D. Gelatt, and M. P. Vecchi. Optimization by simulated annealing. science, 220(4598):671–680, 1983. 4

  17. [17]

    Krizhevsky, I

    A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In NIPS, 2012. 3

  18. [18]

    Ledig, L

    C. Ledig, L. Theis, F. Husz´ar, J. Caballero, A. Cunningham, A. Acosta, A. P. Aitken, A. Tejani, J. Totz, Z. Wang, et al. Photo-realistic single image super-resolution using a genera- tive adversarial network. In CVPR, volume 2, page 4, 2017. 1

  19. [19]

    W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y . Fu, and A. C. Berg. Ssd: Single shot multibox detector. In ECCV, 2016. 3

  20. [20]

    Z. Liu, J. Li, Z. Shen, G. Huang, S. Yan, and C. Zhang. Learning efficient convolutional networks through network slimming. In ICCV, pages 2755–2763, 2017. 7

  21. [21]

    J. Long, E. Shelhamer, and T. Darrell. Fully convolutional networks for semantic segmentation. In CVPR, 2015. 5, 8

  22. [22]

    J.-H. Luo, J. Wu, and W. Lin. Thinet: A filter level pruning method for deep neural network compression. InICCV, pages 5058–5066, 2017. 1, 3, 6, 7, 8

  23. [23]

    XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks

    M. Rastegari, V . Ordonez, J. Redmon, and A. Farhadi. Xnor- net: Imagenet classification using binary convolutional neural networks. arXiv preprint arXiv:1603.05279, 2016. 1, 3

  24. [24]

    E. Real, S. Moore, A. Selle, S. Saxena, Y . L. Suematsu, Q. Le, and A. Kurakin. Large-scale evolution of image classifiers. arXiv preprint arXiv:1703.01041, 2017. 4

  25. [25]

    S. Ren, K. He, R. Girshick, and J. Sun. Faster r-cnn: Towards real-time object detection with region proposal networks. In NIPS, 2015. 3

  26. [26]

    FitNets: Hints for Thin Deep Nets

    A. Romero, N. Ballas, S. E. Kahou, A. Chassang, C. Gatta, and Y . Bengio. Fitnets: Hints for thin deep nets. arXiv preprint arXiv:1412.6550, 2014. 1

  27. [27]

    C. Shen, X. Wang, J. Song, L. Sun, and M. Song. Amalgamat- ing knowledge towards comprehensive classification.arXiv preprint arXiv:1811.02796, 2018. 1

  28. [28]

    Simonyan and A

    K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. ICLR, 2015. 3

  29. [29]

    Vanhoucke, A

    V . Vanhoucke, A. Senior, and M. Z. Mao. Improving the speed of neural networks on cpus. In Deep Learning and Unsupervised Feature Learning Workshop, NIPS, 2011. 1, 3

  30. [30]

    Wang, M.-Y

    T.-C. Wang, M.-Y . Liu, J.-Y . Zhu, A. Tao, J. Kautz, and B. Catanzaro. High-resolution image synthesis and semantic manipulation with conditional gans. In CVPR, pages 8798– 8807, 2018. 1, 2

  31. [31]

    Y . Wang, C. Xu, J. Qiu, C. Xu, and D. Tao. Towards evolu- tional compression. In SIGKDD, 2018. 4, 5

  32. [32]

    Y . Wang, C. Xu, S. You, D. Tao, and C. Xu. Cnnpack: Packing convolutional neural networks in the frequency domain. In NIPS, 2016. 1, 3

  33. [33]

    Z. Yi, H. Zhang, P. Tan, and M. Gong. Dualgan: Unsupervised dual learning for image-to-image translation. In ICCV, pages 2868–2876. IEEE, 2017. 1, 2

  34. [34]

    ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices

    X. Zhang, X. Zhou, M. Lin, and J. Sun. Shufflenet: An extremely efficient convolutional neural network for mobile devices. arXiv preprint arXiv:1707.01083, 2017. 1

  35. [35]

    J.-Y . Zhu, P. Kr¨ahenb¨uhl, E. Shechtman, and A. A. Efros. Generative visual manipulation on the natural image manifold. In ECCV, pages 597–613, 2016. 1

  36. [36]

    J.-Y . Zhu, T. Park, P. Isola, and A. A. Efros. Unpaired image- to-image translation using cycle-consistent adversarial net- works. 2017. 1, 2, 3, 4, 5, 8