pith. sign in

hub Mixed citations

Knowledge Distillation in Iterative Generative Models for Improved Sampling Speed

Mixed citation behavior. Most common role is background (67%).

27 Pith papers citing it
Background 67% of classified citations
abstract

Iterative generative models, such as noise conditional score networks and denoising diffusion probabilistic models, produce high quality samples by gradually denoising an initial noise vector. However, their denoising process has many steps, making them 2-3 orders of magnitude slower than other generative models such as GANs and VAEs. In this paper, we establish a novel connection between knowledge distillation and image generation with a technique that distills a multi-step denoising process into a single step, resulting in a sampling speed similar to other single-step generative models. Our Denoising Student generates high quality samples comparable to GANs on the CIFAR-10 and CelebA datasets, without adversarial training. We demonstrate that our method scales to higher resolutions through experiments on 256 x 256 LSUN. Code and checkpoints are available at https://github.com/tcl9876/Denoising_Student

hub tools

citation-role summary

background 6 method 2 baseline 1

citation-polarity summary

representative citing papers

Consistency Models

cs.LG · 2023-03-02 · conditional · novelty 8.0

Consistency models achieve fast one-step generation with SOTA FID of 3.55 on CIFAR-10 and 6.20 on ImageNet 64x64 by directly mapping noise to data, outperforming prior distillation techniques.

Generative Pseudo-Force Fields for Molecular Generation

cs.LG · 2026-05-18 · unverdicted · novelty 7.0

Proposes generative pseudo-force fields trained on quadratic pseudo-potentials from noisy equilibria as a time-step-agnostic diffusion variant for efficient molecular conformation generation with high validity on QM9.

One Step Diffusion via Shortcut Models

cs.LG · 2024-10-16 · conditional · novelty 7.0

Shortcut models enable high-quality single or few-step sampling in diffusion models with one network and training phase by conditioning on desired step size.

Diffusion Models Beat GANs on Image Synthesis

cs.LG · 2021-05-11 · accept · novelty 7.0

Diffusion models with architecture improvements and classifier guidance achieve superior FID scores to GANs on unconditional and conditional ImageNet image synthesis.

Fast Image Super-Resolution via Consistency Rectified Flow

cs.CV · 2026-05-12 · unverdicted · novelty 6.0

FlowSR enables single-step image super-resolution by learning a rectified flow from LR to HR with consistency distillation, HR regularization, and dual fast-slow timestep scheduling.

MixFlow: Mixed Source Distributions Improve Rectified Flows

cs.CV · 2026-04-10 · unverdicted · novelty 6.0

Mixing unconditional Gaussian noise with a κ-conditioned source during training of rectified flows reduces path curvature, yielding 12% better FID scores and faster sampling than standard rectified flows.

A Unified View of Score-Based and Drifting Models

cs.LG · 2026-03-08 · unverdicted · novelty 6.0

Drifting with Gaussian kernels exactly matches score-matching on smoothed distributions via Tweedie's formula, while Laplace kernels approximate this closely in high dimensions.

MAGI-1: Autoregressive Video Generation at Scale

cs.CV · 2025-05-19 · unverdicted · novelty 6.0

MAGI-1 is a 24B-parameter autoregressive video world model that predicts denoised frame chunks sequentially with increasing noise to enable causal, scalable, streaming generation up to 4M token contexts.

Improved Techniques for Training Consistency Models

cs.LG · 2023-10-22 · accept · novelty 6.0

Improved consistency training techniques achieve FID scores of 2.51 on CIFAR-10 and 3.25 on ImageNet 64x64 in one sampling step, outperforming prior consistency training and distillation methods.

BADiff: Bandwidth Adaptive Diffusion Model

cs.CV · 2025-10-24 · unverdicted · novelty 5.0

BADiff introduces joint training of diffusion models with quality conditioning derived from bandwidth to enable adaptive early-stop sampling that preserves appropriate perceptual quality.

Elucidating the SNR-t Bias of Diffusion Probabilistic Models

cs.CV · 2026-04-17 · unverdicted · novelty 4.0

Diffusion models have an SNR-timestep mismatch during inference that the authors mitigate with per-frequency differential correction, raising generation quality across IDDPM, ADM, DDIM and others.

citing papers explorer

Showing 27 of 27 citing papers.