pith. machine review for the scientific record. sign in

arxiv: 1803.09655 · v2 · submitted 2018-03-26 · 💻 cs.CV · cs.LG· stat.ML

Recognition: unknown

BAGAN: Data Augmentation with Balancing GAN

Authors on Pith no claims yet
classification 💻 cs.CV cs.LGstat.ML
keywords imagesbaganclassesimbalancedaugmentationbalancingclassdatasets
0
0 comments X
read the original abstract

Image classification datasets are often imbalanced, characteristic that negatively affects the accuracy of deep-learning classifiers. In this work we propose balancing GAN (BAGAN) as an augmentation tool to restore balance in imbalanced datasets. This is challenging because the few minority-class images may not be enough to train a GAN. We overcome this issue by including during the adversarial training all available images of majority and minority classes. The generative model learns useful features from majority classes and uses these to generate images for minority classes. We apply class conditioning in the latent space to drive the generation process towards a target class. The generator in the GAN is initialized with the encoder module of an autoencoder that enables us to learn an accurate class-conditioning in the latent space. We compare the proposed methodology with state-of-the-art GANs and demonstrate that BAGAN generates images of superior quality when trained with an imbalanced dataset.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. VAE-Inf: A statistically interpretable generative paradigm for imbalanced classification

    cs.LG 2026-04 unverdicted novelty 5.0

    VAE-Inf trains a VAE on majority data to build a reference distribution, then uses limited minority samples and a projection score to produce classifiers with guaranteed control of false-positive rates in imbalanced settings.

  2. Synthesizing real-world distributions from high-dimensional Gaussian Noise with Fully Connected Neural Network

    cs.LG 2026-04 unverdicted novelty 5.0

    Fully connected neural network with randomized loss synthesizes real-world tabular data distributions from Gaussian noise faster than state-of-the-art deep generative models.