Geometric GAN

Jae Hyun Lim , Jong Chul Ye

Authors on Pith no claims yet

classification 📊 stat.ML cond-mat.dis-nncs.AIcs.CVcs.LG

keywords geometrichyperplaneseparatinggenerativeadversarialdiscriminatorgeneratorupdate

read the original abstract

Generative Adversarial Nets (GANs) represent an important milestone for effective generative models, which has inspired numerous variants seemingly different from each other. One of the main contributions of this paper is to reveal a unified geometric structure in GAN and its variants. Specifically, we show that the adversarial generative model training can be decomposed into three geometric steps: separating hyperplane search, discriminator parameter update away from the separating hyperplane, and the generator update along the normal vector direction of the separating hyperplane. This geometric intuition reveals the limitations of the existing approaches and leads us to propose a new formulation called geometric GAN using SVM separating hyperplane that maximizes the margin. Our theoretical analysis shows that the geometric GAN converges to a Nash equilibrium between the discriminator and generator. In addition, extensive numerical results show that the superior performance of geometric GAN.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 7 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Learning Stratigraphically Consistent Relative Geologic Time from 3D Seismic Data via Sinusoidal Mapping
physics.geo-ph 2026-05 unverdicted novelty 7.0

RGT-Est reformulates relative geologic time learning via sinusoidal mapping and joint pointwise-perceptual-adversarial losses to enforce stratigraphic consistency, outperforming prior AI methods on field data especial...
Learning Stratigraphically Consistent Relative Geologic Time from 3D Seismic Data via Sinusoidal Mapping
physics.geo-ph 2026-05 unverdicted novelty 7.0

RGT-Est transforms RGT estimation into a sinusoidal space with joint losses to capture fine horizons and global stratigraphic order from seismic data, outperforming prior AI methods especially with sparse horizon priors.
Large Scale GAN Training for High Fidelity Natural Image Synthesis
cs.LG 2018-09 accept novelty 7.0

BigGANs achieve state-of-the-art class-conditional synthesis on ImageNet 128x128 with Inception Score 166.5 and FID 7.4 by scaling GANs and applying orthogonal regularization plus truncation.
Lightweight Unpaired Smartphone ISP Transfer with Semantic Pseudo-Pairing
cs.CV 2026-05 conditional novelty 6.0

Semantic pseudo-pairing via DINOv2 embeddings and fused Gromov-Wasserstein optimal transport enables training a 7K-parameter CNN for unpaired smartphone ISP, achieving 22.569 PSNR on the NTIRE 2026 challenge test set.
Continuous Adversarial Flow Models
cs.LG 2026-04 unverdicted novelty 6.0

Continuous adversarial flow models replace MSE in flow matching with adversarial training via a discriminator, improving guidance-free FID on ImageNet from 8.26 to 3.63 for SiT and similar gains for JiT and text-to-im...
Exploring Clustering Capability of Inpainting Model Embeddings for Pattern-based Individual Identification
cs.CV 2026-05 unverdicted novelty 5.0

Inpainting auxiliary task improves clustering of embeddings for individual zebrafish identification based on skin patterns.
Woosh: A Sound Effects Foundation Model
cs.SD 2026-04 accept novelty 5.0

Woosh is a new publicly released foundation model optimized for high-quality sound effect generation from text or video, showing competitive or better results than open alternatives like Stable Audio Open.