hub Mixed citations

International Conference on Learning Representations , year=

Score-Based Generative Modeling through Stochastic Differential Equations , author=

Mixed citation behavior. Most common role is background (67%).

27 Pith papers citing it

Background 67% of classified citations

browse 27 citing papers

hub tools

JSON dossier citing papers JSON

citation-role summary

background 4 method 2

citation-polarity summary

background 4 use method 2

representative citing papers

Quotient-Space Diffusion Models

cs.LG · 2026-04-23 · unverdicted · novelty 8.0

Quotient-space diffusion models generate correct symmetric distributions by removing redundancy on the quotient space, simplifying learning and improving results on small molecules and proteins under SE(3) symmetry.

Building Normalizing Flows with Stochastic Interpolants

cs.LG · 2022-09-30 · conditional · novelty 8.0

Normalizing flows are constructed by learning the velocity of a stochastic interpolant via a quadratic loss derived from its probability current, yielding an efficient ODE-based alternative to diffusion models.

Reducing Diffusion Model Memorization with Higher Order Langevin Dynamics

stat.ML · 2026-05-18 · unverdicted · novelty 7.0

Higher-order Langevin dynamics reduce memorization in diffusion models by making the data dynamics follow a low-pass-filtered score whose smoothness grows with model order.

Dimension-Free Convergence of Discrete Diffusion Models: Adjoint Equations Induce the Right Space

cs.LG · 2026-05-17 · unverdicted · novelty 7.0

Introduces adjoint-equation framework establishing dimension-free convergence bounds in any IPM for discrete diffusion models under masked and uniform priors.

Where Should Diffusion Enter a Language Model? Geometry-Guided Hidden-State Replacement

cs.CL · 2026-05-14 · unverdicted · novelty 7.0

DiHAL uses geometry proxies to pick where to replace the lower layers of a pretrained transformer with a diffusion bridge for hidden-state reconstruction, improving over token-level diffusion baselines on 8B models.

Sobolev Regularized MMD Gradient Flow

cs.LG · 2026-05-12 · unverdicted · novelty 7.0

Sobolev regularization on the witness function enables global convergence of MMD gradient flows for both sampling and generative modeling without isoperimetric assumptions.

Kernel-Gradient Drifting Models

cs.LG · 2026-05-11 · unverdicted · novelty 7.0

Kernel-gradient drifting reformulates drifting models via kernel gradients to yield identifiable one-step generation with smoothed score matching and KL descent on Euclidean, Riemannian, and discrete spaces.

Metropolis-Adjusted Diffusion Models

stat.ML · 2026-05-10 · unverdicted · novelty 7.0

Metropolis-adjusted Langevin correctors using score-based acceptance probabilities, including an exact Bernoulli factory method and a Simpson's rule approximation, reduce sampling bias in diffusion models and improve FID scores.

Sinkhorn Treatment Effects: A Causal Optimal Transport Measure

stat.ML · 2026-05-08 · unverdicted · novelty 7.0

The Sinkhorn treatment effect is a new entropic optimal transport measure of divergence between counterfactual distributions that admits first- and second-order pathwise differentiability, debiased estimators, and asymptotically valid tests for distributional treatment effects.

Stochastic Transition-Map Distillation for Fast Probabilistic Inference

cs.LG · 2026-05-08 · unverdicted · novelty 7.0

STMD distills the full transition map of diffusion sampling SDEs into a conditional Mean Flow model to enable fast one- or few-step stochastic sampling without teacher models or bi-level optimization.

TRACE: Transport Alignment Conformal Prediction via Diffusion and Flow Matching Models

stat.ML · 2026-05-08 · unverdicted · novelty 7.0

TRACE creates valid conformal prediction sets for complex generative models by scoring outputs via averaged denoising or velocity errors along stochastic transport paths instead of likelihoods.

Towards Closing the Autoregressive Gap in Language Modeling via Entropy-Gated Continuous Bitstream Diffusion

cs.CL · 2026-05-07 · unverdicted · novelty 7.0

A 130M-parameter continuous bitstream diffusion model with entropy-gated Langevin sampling achieves GenPPL 59.76 on LM1B and 27.06 on OWT, closing the gap to autoregressive models at matched entropy with 256 NFEs.

Stochastic Schr\"odinger Diffusion Models for Pure-State Ensemble Generation

stat.ML · 2026-05-05 · unverdicted · novelty 7.0

SSDMs introduce an intrinsic score-based diffusion framework on the Fubini-Study manifold to sample quantum pure-state ensembles without classical re-preparation.

Beyond Continuity: Simulation-free Reconstruction of Discrete Branching Dynamics from Single-cell Snapshots

cs.LG · 2026-05-01 · unverdicted · novelty 7.0

Unbalanced Schrödinger Bridge (USB) provides a tractable, simulation-free solution to the Branching Schrödinger Bridge problem for modeling discrete birth-death dynamics at single-cell resolution from snapshot data.

Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data

cs.LG · 2024-06-06 · conditional · novelty 7.0

Absorbing discrete diffusion models the conditional distributions of clean data; reparameterizing yields a time-independent RADD that unifies with AO-ARMs and reaches SOTA perplexity among diffusion models on zero-shot language benchmarks.

Diffusion Domain Expansion: Learning to Coordinate Pre-trained Diffusion Models

cs.LG · 2026-05-22 · unverdicted · novelty 6.0

DDE introduces a compact coordinator network that combines denoised outputs from pre-trained diffusion models to enable generation in larger domains and complex conditioning settings.

Self-Supervised On-Policy Distillation for Reasoning Language Models

cs.LG · 2026-05-17 · unverdicted · novelty 6.0

SSOPD converts intra-group correct-wrong contrast into process supervision by distilling a teacher distribution from the shortest correct completion into prefixes of the longest wrong completion, improving GRPO on AIME and HMMT benchmarks.

Discrete Flow Matching for Offline-to-Online Reinforcement Learning

cs.LG · 2026-05-12 · unverdicted · novelty 6.0

DRIFT enables stable offline-to-online fine-tuning of CTMC policies in discrete RL via advantage-weighted discrete flow matching, path-space regularization, and candidate-set approximation.

Couple to Control: Joint Initial Noise Design in Diffusion Models

cs.LG · 2026-05-11 · unverdicted · novelty 6.0

Coupled initial noises in diffusion models, with designed dependence but unchanged marginal Gaussians, improve generated image diversity on Stable Diffusion variants while preserving quality and alignment.

Consistent Diffusion Language Models

cs.LG · 2026-04-30 · unverdicted · novelty 6.0

CDLM trains denoisers to be path-invariant across stochastic posterior bridges in discrete diffusion, unifying prior methods and achieving new SOTA few-step text generation performance.

Cold-Start Forecasting of New Product Life-Cycles via Conditional Diffusion Models

cs.LG · 2026-04-22 · unverdicted · novelty 6.0

CDLF applies conditional diffusion models to produce probabilistic life-cycle forecasts for new products by conditioning on static descriptors and reference trajectories from similar items.

Scaling Diffusion Language Models via Adaptation from Autoregressive Models

cs.CL · 2024-10-23 · conditional · novelty 6.0

Adapting autoregressive models via continual pre-training yields diffusion language models from 127M to 7B parameters that outperform prior diffusion models and compete with their autoregressive counterparts on language, reasoning, and commonsense benchmarks.

Rectified Flow: A Marginal Preserving Approach to Optimal Transport

stat.ML · 2022-09-29 · unverdicted · novelty 6.0

A single-objective rectified flow variant uses neural ODEs trained by regression to monotonically decrease a fixed convex transport cost while preserving marginal distributions.

RNA-FM: Flow-Matching Generative Model for Genome-wide RNA-Seq Prediction

cs.CV · 2026-05-12 · unverdicted · novelty 5.0

RNA-FM is a flow-matching generative model that predicts genome-wide bulk RNA-seq expression from WSIs by learning a conditional velocity field, outperforming deterministic baselines.

citing papers explorer

Showing 27 of 27 citing papers.

Quotient-Space Diffusion Models cs.LG · 2026-04-23 · unverdicted · none · ref 48
Quotient-space diffusion models generate correct symmetric distributions by removing redundancy on the quotient space, simplifying learning and improving results on small molecules and proteins under SE(3) symmetry.
Building Normalizing Flows with Stochastic Interpolants cs.LG · 2022-09-30 · conditional · none · ref 69
Normalizing flows are constructed by learning the velocity of a stochastic interpolant via a quadratic loss derived from its probability current, yielding an efficient ODE-based alternative to diffusion models.
Reducing Diffusion Model Memorization with Higher Order Langevin Dynamics stat.ML · 2026-05-18 · unverdicted · none · ref 5
Higher-order Langevin dynamics reduce memorization in diffusion models by making the data dynamics follow a low-pass-filtered score whose smoothness grows with model order.
Dimension-Free Convergence of Discrete Diffusion Models: Adjoint Equations Induce the Right Space cs.LG · 2026-05-17 · unverdicted · none · ref 17
Introduces adjoint-equation framework establishing dimension-free convergence bounds in any IPM for discrete diffusion models under masked and uniform priors.
Where Should Diffusion Enter a Language Model? Geometry-Guided Hidden-State Replacement cs.CL · 2026-05-14 · unverdicted · none · ref 4
DiHAL uses geometry proxies to pick where to replace the lower layers of a pretrained transformer with a diffusion bridge for hidden-state reconstruction, improving over token-level diffusion baselines on 8B models.
Sobolev Regularized MMD Gradient Flow cs.LG · 2026-05-12 · unverdicted · none · ref 65
Sobolev regularization on the witness function enables global convergence of MMD gradient flows for both sampling and generative modeling without isoperimetric assumptions.
Kernel-Gradient Drifting Models cs.LG · 2026-05-11 · unverdicted · none · ref 53
Kernel-gradient drifting reformulates drifting models via kernel gradients to yield identifiable one-step generation with smoothed score matching and KL descent on Euclidean, Riemannian, and discrete spaces.
Metropolis-Adjusted Diffusion Models stat.ML · 2026-05-10 · unverdicted · none · ref 39
Metropolis-adjusted Langevin correctors using score-based acceptance probabilities, including an exact Bernoulli factory method and a Simpson's rule approximation, reduce sampling bias in diffusion models and improve FID scores.
Sinkhorn Treatment Effects: A Causal Optimal Transport Measure stat.ML · 2026-05-08 · unverdicted · none · ref 84
The Sinkhorn treatment effect is a new entropic optimal transport measure of divergence between counterfactual distributions that admits first- and second-order pathwise differentiability, debiased estimators, and asymptotically valid tests for distributional treatment effects.
Stochastic Transition-Map Distillation for Fast Probabilistic Inference cs.LG · 2026-05-08 · unverdicted · none · ref 16
STMD distills the full transition map of diffusion sampling SDEs into a conditional Mean Flow model to enable fast one- or few-step stochastic sampling without teacher models or bi-level optimization.
TRACE: Transport Alignment Conformal Prediction via Diffusion and Flow Matching Models stat.ML · 2026-05-08 · unverdicted · none · ref 71
TRACE creates valid conformal prediction sets for complex generative models by scoring outputs via averaged denoising or velocity errors along stochastic transport paths instead of likelihoods.
Towards Closing the Autoregressive Gap in Language Modeling via Entropy-Gated Continuous Bitstream Diffusion cs.CL · 2026-05-07 · unverdicted · none · ref 2
A 130M-parameter continuous bitstream diffusion model with entropy-gated Langevin sampling achieves GenPPL 59.76 on LM1B and 27.06 on OWT, closing the gap to autoregressive models at matched entropy with 256 NFEs.
Stochastic Schr\"odinger Diffusion Models for Pure-State Ensemble Generation stat.ML · 2026-05-05 · unverdicted · none · ref 4
SSDMs introduce an intrinsic score-based diffusion framework on the Fubini-Study manifold to sample quantum pure-state ensembles without classical re-preparation.
Beyond Continuity: Simulation-free Reconstruction of Discrete Branching Dynamics from Single-cell Snapshots cs.LG · 2026-05-01 · unverdicted · none · ref 100
Unbalanced Schrödinger Bridge (USB) provides a tractable, simulation-free solution to the Branching Schrödinger Bridge problem for modeling discrete birth-death dynamics at single-cell resolution from snapshot data.
Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data cs.LG · 2024-06-06 · conditional · none · ref 38
Absorbing discrete diffusion models the conditional distributions of clean data; reparameterizing yields a time-independent RADD that unifies with AO-ARMs and reaches SOTA perplexity among diffusion models on zero-shot language benchmarks.
Diffusion Domain Expansion: Learning to Coordinate Pre-trained Diffusion Models cs.LG · 2026-05-22 · unverdicted · none · ref 2
DDE introduces a compact coordinator network that combines denoised outputs from pre-trained diffusion models to enable generation in larger domains and complex conditioning settings.
Self-Supervised On-Policy Distillation for Reasoning Language Models cs.LG · 2026-05-17 · unverdicted · none · ref 87
SSOPD converts intra-group correct-wrong contrast into process supervision by distilling a teacher distribution from the shortest correct completion into prefixes of the longest wrong completion, improving GRPO on AIME and HMMT benchmarks.
Discrete Flow Matching for Offline-to-Online Reinforcement Learning cs.LG · 2026-05-12 · unverdicted · none · ref 64
DRIFT enables stable offline-to-online fine-tuning of CTMC policies in discrete RL via advantage-weighted discrete flow matching, path-space regularization, and candidate-set approximation.
Couple to Control: Joint Initial Noise Design in Diffusion Models cs.LG · 2026-05-11 · unverdicted · none · ref 13
Coupled initial noises in diffusion models, with designed dependence but unchanged marginal Gaussians, improve generated image diversity on Stable Diffusion variants while preserving quality and alignment.
Consistent Diffusion Language Models cs.LG · 2026-04-30 · unverdicted · none · ref 26
CDLM trains denoisers to be path-invariant across stochastic posterior bridges in discrete diffusion, unifying prior methods and achieving new SOTA few-step text generation performance.
Cold-Start Forecasting of New Product Life-Cycles via Conditional Diffusion Models cs.LG · 2026-04-22 · unverdicted · none · ref 51
CDLF applies conditional diffusion models to produce probabilistic life-cycle forecasts for new products by conditioning on static descriptors and reference trajectories from similar items.
Scaling Diffusion Language Models via Adaptation from Autoregressive Models cs.CL · 2024-10-23 · conditional · none · ref 17
Adapting autoregressive models via continual pre-training yields diffusion language models from 127M to 7B parameters that outperform prior diffusion models and compete with their autoregressive counterparts on language, reasoning, and commonsense benchmarks.
Rectified Flow: A Marginal Preserving Approach to Optimal Transport stat.ML · 2022-09-29 · unverdicted · none · ref 4
A single-objective rectified flow variant uses neural ODEs trained by regression to monotonically decrease a fixed convex transport cost while preserving marginal distributions.
RNA-FM: Flow-Matching Generative Model for Genome-wide RNA-Seq Prediction cs.CV · 2026-05-12 · unverdicted · none · ref 9
RNA-FM is a flow-matching generative model that predicts genome-wide bulk RNA-seq expression from WSIs by learning a conditional velocity field, outperforming deterministic baselines.
APEX: Assumption-free Projection-based Embedding eXamination Metric for Image Quality Assessment cs.CV · 2026-05-08 · unverdicted · none · ref 38
APEX is an assumption-free image quality metric using Sliced Wasserstein Distance on CLIP and DINOv2 embeddings that claims superior robustness to degradations and cross-dataset stability.
LASER: Learning Active Sensing for Continuum Field Reconstruction cs.LG · 2026-04-21 · unverdicted · none · ref 88
LASER trains a reinforcement learning policy inside a latent dynamics model to choose sensor placements that improve reconstruction of continuum fields under sparsity.
On the Robustness of Distribution Support under Diffusion Guidance cs.LG · 2026-05-08 · unreviewed · ref 21

International Conference on Learning Representations , year=

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer