Deep unsupervised learning using nonequilibrium thermodynamics.ICML, 2015

Jascha Sohl-Dickstein, Eric Weiss, Niru Maheswaranathan, Surya Ganguli · 2015

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

V-GRPO: Online Reinforcement Learning for Denoising Generative Models Is Easier than You Think

cs.LG · 2026-04-25 · unverdicted · novelty 6.0

V-GRPO makes ELBO surrogates stable and efficient for online RL alignment of denoising models, delivering SOTA text-to-image performance with 2-3x speedups over MixGRPO and DiffusionNFT.

citing papers explorer

Showing 1 of 1 citing paper.

V-GRPO: Online Reinforcement Learning for Denoising Generative Models Is Easier than You Think cs.LG · 2026-04-25 · unverdicted · none · ref 35
V-GRPO makes ELBO surrogates stable and efficient for online RL alignment of denoising models, delivering SOTA text-to-image performance with 2-3x speedups over MixGRPO and DiffusionNFT.

Deep unsupervised learning using nonequilibrium thermodynamics.ICML, 2015

fields

years

verdicts

representative citing papers

citing papers explorer