hub Mixed citations

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks

Alec Radford, Luke Metz, Soumith Chintala · 2015 · cs.LG · arXiv 1511.06434

Mixed citation behavior. Most common role is background (67%).

51 Pith papers citing it

Background 67% of classified citations

open full Pith review browse 51 citing papers arXiv PDF

abstract

In recent years, supervised learning with convolutional networks (CNNs) has seen huge adoption in computer vision applications. Comparatively, unsupervised learning with CNNs has received less attention. In this work we hope to help bridge the gap between the success of CNNs for supervised learning and unsupervised learning. We introduce a class of CNNs called deep convolutional generative adversarial networks (DCGANs), that have certain architectural constraints, and demonstrate that they are a strong candidate for unsupervised learning. Training on various image datasets, we show convincing evidence that our deep convolutional adversarial pair learns a hierarchy of representations from object parts to scenes in both the generator and discriminator. Additionally, we use the learned features for novel tasks - demonstrating their applicability as general image representations.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 4 method 2

citation-polarity summary

background 4 use method 2

representative citing papers

Basic syntax from speech: Spontaneous concatenation in unsupervised deep neural networks

cs.CL · 2023-05-02 · unverdicted · novelty 8.0

ciwGAN and fiwGAN models trained on isolated words spontaneously generate concatenated multi-word outputs and display early compositionality precursors.

Toy Models of Superposition

cs.LG · 2022-09-21 · accept · novelty 8.0

Toy models demonstrate that polysemanticity arises when neural networks store more sparse features than neurons via superposition, producing a phase transition tied to polytope geometry and increased adversarial vulnerability.

Generative Language Modeling for Automated Theorem Proving

cs.LG · 2020-09-07 · unverdicted · novelty 8.0

GPT-f, a transformer-based prover for Metamath, generated new short proofs that were accepted into the main library—the first such contribution from a deep-learning system.

AGAN: Towards Automated Design of Generative Adversarial Networks

cs.LG · 2019-06-25 · unverdicted · novelty 8.0

AGAN is the first neural architecture search method for GANs that discovers architectures outperforming state-of-the-art on CIFAR-10 unsupervised image generation and competitive on supervised tasks.

Density estimation using Real NVP

cs.LG · 2016-05-27 · accept · novelty 8.0

Real NVP uses affine coupling layers to create invertible transformations that support exact density estimation, sampling, and latent inference without approximations.

Discriminative Span as a Predictor of Synthetic Data Utility via Classifier Reconstruction

cs.CV · 2026-05-10 · unverdicted · novelty 7.0 · 2 refs

A relative projection error metric in foundation-model embedding space predicts the downstream utility of synthetic positive samples for binary classifiers.

Active Learning for Conditional Generative Compressed Sensing

cs.LG · 2026-05-06 · unverdicted · novelty 7.0

Prompts can be split into separate roles for sampling design and recovery modeling in generative compressed sensing, with stable recovery bounds for matched prompts and an explicit penalty for mismatch, validated on Stable Diffusion.

Physics-informed, Generative Adversarial Design of Funicular Shells

cs.CE · 2026-04-17 · unverdicted · novelty 7.0

A modified DCGAN with an auxiliary discriminator using the membrane factor generates stable, previously unseen funicular shells optimized for pure compression in three dimensions.

SurFITR: A Dataset for Surveillance Image Forgery Detection and Localisation

cs.CV · 2026-04-08 · conditional · novelty 7.0

SurFITR is a new collection of 137k+ surveillance-style forged images that causes existing detectors to degrade while enabling substantial gains when used for training in both in-domain and cross-domain settings.

Toward Generative Quantum Utility via Correlation-Complexity Map

cs.LG · 2026-03-06 · unverdicted · novelty 7.0

A pre-training diagnostic map based on spectral correlation resemblance to IQP circuits and excess structural complexity identifies suitable datasets like turbulence data for quantum generative models, yielding competitive low-resource performance.

ASTRA: Let Arbitrary Subjects Transform in Video Editing

cs.CV · 2025-10-01 · unverdicted · novelty 7.0

ASTRA is a plug-and-play training-free method for precise multi-subject video editing that uses prompt-guided multimodal alignment and prior-based mask retargeting to avoid attention dilution and boundary issues.

Progressive Growing of GANs for Improved Quality, Stability, and Variation

cs.NE · 2017-10-27 · accept · novelty 7.0

Progressive growing stabilizes GAN training to produce high-resolution images of unprecedented quality and achieves a record unsupervised inception score of 8.80 on CIFAR10.

Mixed Precision Training

cs.AI · 2017-10-10 · accept · novelty 7.0

Mixed precision training uses FP16 for most computations, FP32 master weights for accumulation, and loss scaling to enable accurate training of large DNNs with halved memory usage.

Vision Foundation Models as Generalist Tokenizers for Image Generation

cs.CV · 2026-05-18 · unverdicted · novelty 6.0

VFMTok builds a generalist image tokenizer on frozen VFMs using adaptive quantization and semantic alignment, delivering gFID 1.36 for autoregressive and 1.25 for continuous generation on ImageNet with 3x faster convergence.

Neural Fields for NV-Center Inverse Sensing

cs.LG · 2026-05-13 · unverdicted · novelty 6.0

NeTMY neural fields with annealed encoding, multiscale optimization, and spectrum-fidelity losses achieve superior localization and distributional accuracy in NV-center inverse sensing by using a tensor power-summed dipolar operator that exposes and mitigates center-collapse failures.

Enabling Federated Inference via Unsupervised Consensus Embedding

cs.LG · 2026-05-07 · unverdicted · novelty 6.0

CE-FI maps heterogeneous model representations to a shared embedding space via unsupervised training on unlabeled data, enabling privacy-preserving federated inference that outperforms solo models on image classification benchmarks.

A Dual Perspective on Synthetic Trajectory Generators: Utility Framework and Privacy Vulnerabilities

cs.AI · 2026-04-21 · unverdicted · novelty 6.0

A new framework evaluates utility of synthetic mobility trajectories while a membership inference attack reveals privacy vulnerabilities in generative models thought to be safe.

Embedding Arithmetic: A Lightweight, Tuning-Free Framework for Post-hoc Bias Mitigation in Text-to-Image Models

cs.CV · 2026-04-20 · unverdicted · novelty 6.0

Embedding Arithmetic performs vector operations in the embedding space of T2I models to mitigate bias at inference time, outperforming baselines on diversity while preserving coherence via a new Concept Coherence Score.

FatigueFusion: Latent Space Fusion for Fatigue-Driven Motion Synthesis

cs.GR · 2026-04-11 · unverdicted · novelty 6.0

FatigueFusion fuses fatigue features in latent space using algorithmic, data-driven, and PINN modules to synthesize novel fatigued motions from non-fatigued joint sequences in an end-to-end pipeline.

gen2seg: Generative Models Enable Generalizable Instance Segmentation

cs.CV · 2025-05-21 · unverdicted · novelty 6.0

Finetuning generative models on limited instance segmentation data produces zero-shot generalization to unseen object categories and styles, matching or exceeding supervised baselines like SAM on ambiguous boundaries.

"Noisier" Noise Contrastive Eestimation is (Almost) Maximum Likelihood

cs.LG · 2024-05-27 · unverdicted · novelty 6.0

Scaling noise magnitude in NCE aligns gradients with MLE, enabling a practical approximation that improves performance on CIFAR-10 and ImageNet image modeling with fewer training steps.

MetaGPT: Meta Programming for A Multi-Agent Collaborative Framework

cs.AI · 2023-08-01 · unverdicted · novelty 6.0

MetaGPT embeds human SOPs into LLM prompts to create role-specialized agent teams that produce more coherent solutions on collaborative software engineering tasks than prior chat-based multi-agent systems.

VideoGPT: Video Generation using VQ-VAE and Transformers

cs.CV · 2021-04-20 · accept · novelty 6.0

VideoGPT generates competitive natural videos by learning discrete latents with VQ-VAE and modeling them autoregressively with a transformer.

Dual Adversarial Semantics-Consistent Network for Generalized Zero-Shot Learning

cs.CV · 2019-07-12 · unverdicted · novelty 6.0

DASCN uses a unified primal-dual GAN architecture to generate semantics-consistent visual features for generalized zero-shot learning, claiming state-of-the-art gains.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Physics-informed, Generative Adversarial Design of Funicular Shells cs.CE · 2026-04-17 · unverdicted · none · ref 28 · internal anchor
A modified DCGAN with an auxiliary discriminator using the membrane factor generates stable, previously unseen funicular shells optimized for pure compression in three dimensions.
One-Step Generative Modeling via Wasserstein Gradient Flows cs.LG · 2026-05-12 · unreviewed · ref 48 · internal anchor

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer