Are GANs Created Equal? A Large-Scale Study

Karol Kurach; Marcin Michalski; Mario Lucic; Olivier Bousquet; Sylvain Gelly

arxiv: 1711.10337 · v4 · pith:TCSG3NF4new · submitted 2017-11-28 · 📊 stat.ML · cs.LG

Are GANs Created Equal? A Large-Scale Study

Mario Lucic , Karol Kurach , Marcin Michalski , Sylvain Gelly , Olivier Bousquet This is my paper

classification 📊 stat.ML cs.LG

keywords modelsalgorithmsevaluationfindgenerativelarge-scaleresearchvery

0 comments

read the original abstract

Generative adversarial networks (GAN) are a powerful subclass of generative models. Despite a very rich research activity leading to numerous interesting GAN algorithms, it is still very hard to assess which algorithm(s) perform better than others. We conduct a neutral, multi-faceted large-scale empirical study on state-of-the art models and evaluation measures. We find that most models can reach similar scores with enough hyperparameter optimization and random restarts. This suggests that improvements can arise from a higher computational budget and tuning more than fundamental algorithmic changes. To overcome some limitations of the current metrics, we also propose several data sets on which precision and recall can be computed. Our experimental results suggest that future GAN research should be based on more systematic and objective evaluation procedures. Finally, we did not find evidence that any of the tested algorithms consistently outperforms the non-saturating GAN introduced in \cite{goodfellow2014generative}.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Reproducibility in Machine Learning for Health
cs.LG 2019-07 unverdicted novelty 5.0

Systematic evaluation of over 100 ML4H papers finds poorer reproducibility than other ML fields, driven by limited data and code access, and offers recommendations to data providers, publishers, and researchers.