pith. machine review for the scientific record. sign in

arxiv: 1811.11264 · v1 · submitted 2018-11-27 · 💻 cs.LG · stat.ML

Recognition: unknown

Synthesizing Tabular Data using Generative Adversarial Networks

Authors on Pith no claims yet
classification 💻 cs.LG stat.ML
keywords generativeadversarialnetworkstabulartgandatadatasetsdistribution
0
0 comments X
read the original abstract

Generative adversarial networks (GANs) implicitly learn the probability distribution of a dataset and can draw samples from the distribution. This paper presents, Tabular GAN (TGAN), a generative adversarial network which can generate tabular data like medical or educational records. Using the power of deep neural networks, TGAN generates high-quality and fully synthetic tables while simultaneously generating discrete and continuous variables. When we evaluate our model on three datasets, we find that TGAN outperforms conventional statistical generative models in both capturing the correlation between columns and scaling up for large datasets.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 5 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Breaking the Quality-Privacy Tradeoff in Tabular Data Generation via In-Context Learning

    cs.LG 2026-05 unverdicted novelty 6.0

    DiffICL breaks the quality-privacy tradeoff in small-data tabular synthesis by using in-context learning on pretrained structural priors to generate data that is both higher quality and less memorizing of training samples.

  2. Posterior-Calibrated Causal Circuits in Variational Autoencoders: Why Image-Domain Interpretability Fails on Tabular Data

    cs.LG 2026-03 unverdicted novelty 6.0

    Tabular VAEs show ~50% lower causal circuit modularity than image VAEs, with beta-VAE CES collapsing to 0.043 versus 0.133 due to reconstruction degradation, challenging direct transfer of image interpretability techniques.

  3. Generative Augmentation of Imbalanced Flight Records for Flight Diversion Prediction: A Multi-objective Optimisation Framework

    cs.LG 2026-04 unverdicted novelty 5.0

    Hyperparameter-optimized generative models augment scarce flight diversion records and substantially improve prediction accuracy over real data alone.

  4. Synthesizing real-world distributions from high-dimensional Gaussian Noise with Fully Connected Neural Network

    cs.LG 2026-04 unverdicted novelty 5.0

    Fully connected neural network with randomized loss synthesizes real-world tabular data distributions from Gaussian noise faster than state-of-the-art deep generative models.

  5. Synthetic Flight Data Generation Using Generative Models

    cs.LG 2026-04 unverdicted novelty 3.0

    Synthetic flight data generated by TVAE and Gaussian Copula models supports flight delay prediction models with accuracy comparable to real data.