hub

Generative modeling by estimating gradients of the data distribution.Advances in neural information processing systems, 32

Yang Song, Stefano Ermon · 2019

22 Pith papers cite this work. Polarity classification is still indexing.

22 Pith papers citing it

browse 22 citing papers

hub tools

JSON dossier citing papers JSON

citation-role summary

background 2 method 1

citation-polarity summary

background 2 use method 1

representative citing papers

Training-Free Generative Sampling via Moment-Matched Score Smoothing

stat.ML · 2026-05-14 · unverdicted · novelty 7.0

MM-SOLD is a training-free particle sampler whose large-particle limit converges to a moment-matched Gibbs distribution obtained by exponentially tilting a score-smoothed target.

JEDI: Joint Embedding Diffusion World Model for Online Model-Based Reinforcement Learning

cs.LG · 2026-05-13 · unverdicted · novelty 7.0

JEDI is the first online end-to-end latent diffusion world model that trains latents from denoising loss rather than reconstruction, achieving competitive Atari100k results with 43% less VRAM and over 3x faster sampling than pixel diffusion baselines.

Constraint-Aware Flow Matching: Decision Aligned End-to-End Training for Constrained Sampling

cs.LG · 2026-05-12 · unverdicted · novelty 7.0

Constraint-Aware Flow Matching integrates constraint projections into the flow matching training objective to align model dynamics with constrained sampling and reduce distributional shift.

Arena as Offline Reward: Efficient Fine-Grained Preference Optimization for Diffusion Models

cs.CV · 2026-05-07 · unverdicted · novelty 7.0

ArenaPO infers Gaussian capability distributions from pairwise preferences and applies truncated-normal latent inference to derive fine-grained offline rewards for preference optimization of text-to-image diffusion models.

Semantically Structured Mixture-of-Experts for Compositional Robotic Manipulation

cs.RO · 2026-05-22 · unverdicted · novelty 6.0

SMoDP routes action chunks in a diffusion policy to semantically specialized experts via a VLM-supervised skill predictor and dual contrastive alignment, achieving better efficiency and compositional transfer than baselines.

Provably Learning Diffusion Models under the Manifold Hypothesis: Collapse and Refine

cs.LG · 2026-05-16 · unverdicted · novelty 6.0

SiLD is a score-matching framework that learns both manifold projection and intrinsic density from a single objective, with proven sample complexity depending only on intrinsic dimension.

Registers Matter for Pixel-Space Diffusion Transformers

cs.CV · 2026-05-15 · unverdicted · novelty 6.0

Register tokens enhance pixel-space DiT training and output quality via cleaner high-noise feature maps, and a dual-stream design adds further gains with little overhead.

Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization

cs.CV · 2026-05-15 · unverdicted · novelty 6.0

Flash-GRPO introduces iso-temporal grouping and temporal gradient rectification to enable single-step GRPO training that outperforms full-trajectory methods on video diffusion alignment under low compute budgets.

Slowly Annealed Langevin Dynamics: Theory and Applications to Training-Free Guided Generation

cs.LG · 2026-05-08 · unverdicted · novelty 6.0

Slowly Annealed Langevin Dynamics provides non-asymptotic KL-based convergence guarantees for tracking moving targets and enables training-free guided generation via a velocity-aware correction that accounts for pretrained marginals.

Physical Fidelity Reconstruction via Improved Consistency-Distilled Flow Matching for Dynamical Systems

cs.LG · 2026-05-07 · unverdicted · novelty 6.0

Distilled one-step consistency model from optimal-transport flow-matching teacher reconstructs high-fidelity dynamical system flows from low-fidelity data with 12x speedup, half the parameters, and 23.1% better SSIM than scratch-trained baselines.

Active Learning for Communication Structure Optimization in LLM-Based Multi-Agent Systems

cs.MA · 2026-05-07 · unverdicted · novelty 6.0 · 2 refs

An ensemble-based information-theoretic active learning method using ensemble Kalman inversion selects valuable tasks to optimize communication structures in LLM multi-agent systems more reliably than random sampling under limited training budgets.

Taming Outlier Tokens in Diffusion Transformers

cs.CV · 2026-05-06 · unverdicted · novelty 6.0

Outlier tokens in DiTs are addressed with Dual-Stage Registers, which reduce artifacts and improve image generation on ImageNet and text-to-image tasks.

Language Diffusion Models are Associative Memories Capable of Retrieving Unseen Data

cs.LG · 2026-04-29 · unverdicted · novelty 6.0

Uniform-based discrete diffusion models behave as associative memories that retrieve unseen data, with a dataset-size-driven memorization-to-generalization transition detectable via conditional entropy of token predictions.

Learning biophysical models of gene regulation with probability flow matching

q-bio.MN · 2026-04-27 · unverdicted · novelty 6.0

Probability Flow Matching learns biophysically consistent stochastic processes for gene regulation from time-resolved single-cell measurements, where only the biophysical versions accurately capture lineage transitions, fate specification, and perturbation responses despite similar data fit.

Efficient Diffusion Distillation via Embedding Loss

cs.CV · 2026-04-24 · unverdicted · novelty 6.0

Embedding Loss aligns feature distributions via MMD in random network embeddings to boost one-step diffusion distillation, reaching SOTA FID of 1.475 on CIFAR-10 unconditional generation.

Energy-Guided Generative Modeling for Low-Energy Molecular Structure Discovery

cs.LG · 2025-12-27 · unverdicted · novelty 6.0

EnFlow integrates flow-based conformer generation with energy landscape modeling to enable joint ensemble generation and ground-state identification using only 1-2 ODE steps.

Efficient Score Pre-computation for Diffusion Models via Cross-Matrix Krylov Projection

cs.CV · 2025-11-19 · unverdicted · novelty 6.0

Cross-matrix Krylov projection reuses shared subspaces from seed matrices to accelerate score pre-computation in diffusion models, delivering 15.8-43.7% time savings and up to 115x speedup versus DDPM baselines.

DanceGRPO: Unleashing GRPO on Visual Generation

cs.CV · 2025-05-12 · unverdicted · novelty 6.0

DanceGRPO applies GRPO to visual generation tasks to achieve stable policy optimization across diffusion models, rectified flows, multiple tasks, and diverse reward models, outperforming prior RL methods.

The Amazing Stability of Flow Matching

cs.CV · 2026-04-17 · unverdicted · novelty 5.0

Flow matching generative models preserve sample quality, diversity, and latent representations despite pruning 50% of the CelebA-HQ dataset or altering architecture and training configurations.

Learning World Models for Interactive Video Generation

cs.CV · 2025-05-28 · unverdicted · novelty 5.0

The work introduces video retrieval augmented generation (VRAG) with explicit global state conditioning to reduce compounding errors and improve spatiotemporal consistency in interactive video world models.

Accelerating Redshift-Conditioned Galaxy Image Synthesis with One-step Generative Modeling

astro-ph.IM · 2026-05-17 · unverdicted · novelty 4.0

One-step pixel-MeanFlow models recover key galaxy morphology statistics at orders-of-magnitude lower computational cost than standard DDPM sampling while remaining weaker on fine-grained structure.

Technical Note on Relating Scores of Tilted Distributions

math.ST · 2026-04-29 · unverdicted · novelty 4.0

Extends score relations for tilted distributions to constant negative diagonal tilts by linking denoisers via Tweedie's formula, yielding location and time shifts in the score operator.

citing papers explorer

Showing 22 of 22 citing papers.

Training-Free Generative Sampling via Moment-Matched Score Smoothing stat.ML · 2026-05-14 · unverdicted · none · ref 3
MM-SOLD is a training-free particle sampler whose large-particle limit converges to a moment-matched Gibbs distribution obtained by exponentially tilting a score-smoothed target.
JEDI: Joint Embedding Diffusion World Model for Online Model-Based Reinforcement Learning cs.LG · 2026-05-13 · unverdicted · none · ref 57
JEDI is the first online end-to-end latent diffusion world model that trains latents from denoising loss rather than reconstruction, achieving competitive Atari100k results with 43% less VRAM and over 3x faster sampling than pixel diffusion baselines.
Constraint-Aware Flow Matching: Decision Aligned End-to-End Training for Constrained Sampling cs.LG · 2026-05-12 · unverdicted · none · ref 51
Constraint-Aware Flow Matching integrates constraint projections into the flow matching training objective to align model dynamics with constrained sampling and reduce distributional shift.
Arena as Offline Reward: Efficient Fine-Grained Preference Optimization for Diffusion Models cs.CV · 2026-05-07 · unverdicted · none · ref 31
ArenaPO infers Gaussian capability distributions from pairwise preferences and applies truncated-normal latent inference to derive fine-grained offline rewards for preference optimization of text-to-image diffusion models.
Semantically Structured Mixture-of-Experts for Compositional Robotic Manipulation cs.RO · 2026-05-22 · unverdicted · none · ref 35
SMoDP routes action chunks in a diffusion policy to semantically specialized experts via a VLM-supervised skill predictor and dual contrastive alignment, achieving better efficiency and compositional transfer than baselines.
Provably Learning Diffusion Models under the Manifold Hypothesis: Collapse and Refine cs.LG · 2026-05-16 · unverdicted · none · ref 51
SiLD is a score-matching framework that learns both manifold projection and intrinsic density from a single objective, with proven sample complexity depending only on intrinsic dimension.
Registers Matter for Pixel-Space Diffusion Transformers cs.CV · 2026-05-15 · unverdicted · none · ref 19
Register tokens enhance pixel-space DiT training and output quality via cleaner high-noise feature maps, and a dual-stream design adds further gains with little overhead.
Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization cs.CV · 2026-05-15 · unverdicted · none · ref 23
Flash-GRPO introduces iso-temporal grouping and temporal gradient rectification to enable single-step GRPO training that outperforms full-trajectory methods on video diffusion alignment under low compute budgets.
Slowly Annealed Langevin Dynamics: Theory and Applications to Training-Free Guided Generation cs.LG · 2026-05-08 · unverdicted · none · ref 29
Slowly Annealed Langevin Dynamics provides non-asymptotic KL-based convergence guarantees for tracking moving targets and enables training-free guided generation via a velocity-aware correction that accounts for pretrained marginals.
Physical Fidelity Reconstruction via Improved Consistency-Distilled Flow Matching for Dynamical Systems cs.LG · 2026-05-07 · unverdicted · none · ref 8
Distilled one-step consistency model from optimal-transport flow-matching teacher reconstructs high-fidelity dynamical system flows from low-fidelity data with 12x speedup, half the parameters, and 23.1% better SSIM than scratch-trained baselines.
Active Learning for Communication Structure Optimization in LLM-Based Multi-Agent Systems cs.MA · 2026-05-07 · unverdicted · none · ref 38 · 2 links
An ensemble-based information-theoretic active learning method using ensemble Kalman inversion selects valuable tasks to optimize communication structures in LLM multi-agent systems more reliably than random sampling under limited training budgets.
Taming Outlier Tokens in Diffusion Transformers cs.CV · 2026-05-06 · unverdicted · none · ref 28
Outlier tokens in DiTs are addressed with Dual-Stage Registers, which reduce artifacts and improve image generation on ImageNet and text-to-image tasks.
Language Diffusion Models are Associative Memories Capable of Retrieving Unseen Data cs.LG · 2026-04-29 · unverdicted · none · ref 3
Uniform-based discrete diffusion models behave as associative memories that retrieve unseen data, with a dataset-size-driven memorization-to-generalization transition detectable via conditional entropy of token predictions.
Learning biophysical models of gene regulation with probability flow matching q-bio.MN · 2026-04-27 · unverdicted · none · ref 77
Probability Flow Matching learns biophysically consistent stochastic processes for gene regulation from time-resolved single-cell measurements, where only the biophysical versions accurately capture lineage transitions, fate specification, and perturbation responses despite similar data fit.
Efficient Diffusion Distillation via Embedding Loss cs.CV · 2026-04-24 · unverdicted · none · ref 2
Embedding Loss aligns feature distributions via MMD in random network embeddings to boost one-step diffusion distillation, reaching SOTA FID of 1.475 on CIFAR-10 unconditional generation.
Energy-Guided Generative Modeling for Low-Energy Molecular Structure Discovery cs.LG · 2025-12-27 · unverdicted · none · ref 15
EnFlow integrates flow-based conformer generation with energy landscape modeling to enable joint ensemble generation and ground-state identification using only 1-2 ODE steps.
Efficient Score Pre-computation for Diffusion Models via Cross-Matrix Krylov Projection cs.CV · 2025-11-19 · unverdicted · none · ref 1
Cross-matrix Krylov projection reuses shared subspaces from seed matrices to accelerate score pre-computation in diffusion models, delivering 15.8-43.7% time savings and up to 115x speedup versus DDPM baselines.
DanceGRPO: Unleashing GRPO on Visual Generation cs.CV · 2025-05-12 · unverdicted · none · ref 30
DanceGRPO applies GRPO to visual generation tasks to achieve stable policy optimization across diffusion models, rectified flows, multiple tasks, and diverse reward models, outperforming prior RL methods.
The Amazing Stability of Flow Matching cs.CV · 2026-04-17 · unverdicted · none · ref 29
Flow matching generative models preserve sample quality, diversity, and latent representations despite pruning 50% of the CelebA-HQ dataset or altering architecture and training configurations.
Learning World Models for Interactive Video Generation cs.CV · 2025-05-28 · unverdicted · none · ref 5
The work introduces video retrieval augmented generation (VRAG) with explicit global state conditioning to reduce compounding errors and improve spatiotemporal consistency in interactive video world models.
Accelerating Redshift-Conditioned Galaxy Image Synthesis with One-step Generative Modeling astro-ph.IM · 2026-05-17 · unverdicted · none · ref 50
One-step pixel-MeanFlow models recover key galaxy morphology statistics at orders-of-magnitude lower computational cost than standard DDPM sampling while remaining weaker on fine-grained structure.
Technical Note on Relating Scores of Tilted Distributions math.ST · 2026-04-29 · unverdicted · none · ref 7
Extends score relations for tilted distributions to constant negative diagonal tilts by linking denoisers via Tweedie's formula, yielding location and time shifts in the score operator.

Generative modeling by estimating gradients of the data distribution.Advances in neural information processing systems, 32

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer