arXiv preprint arXiv:2510.03149 , year=

Lecture notes · 2025 · arXiv 2510.03149

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

VGB for Masked Diffusion Model: Efficient Test-time Scaling for Reward Satisfaction and Sample Editing

cs.LG · 2026-06-26 · unverdicted · novelty 7.0

MDM-VGB augments masked diffusion with backtracking-style reward-guided remasking to achieve quadratic-complexity high-reward generation and sample editing, with proofs of noise robustness.

The Power of Test-Time Training for Approximate Sampling

cs.DS · 2026-06-09 · unverdicted · novelty 7.0

Establishes a quadratic lower bound on query complexity for sampling from large classes of distributions given approximate density oracles, answers an open question on optimality of random walks, and shows circumvention for bounded classes as an abstraction of TTT.

The tractability landscape of diffusion alignment: regularization, rewards, and computational primitives

cs.LG · 2026-05-12 · unverdicted · novelty 7.0

The choice of closeness measure in diffusion reward alignment determines the computational primitives and tractable reward classes, with linear exponential tilts sufficing for KL with convex rewards and proximal oracles for Wasserstein with concave or low-dimensional Lipschitz rewards.

citing papers explorer

Showing 3 of 3 citing papers.

VGB for Masked Diffusion Model: Efficient Test-time Scaling for Reward Satisfaction and Sample Editing cs.LG · 2026-06-26 · unverdicted · none · ref 23
MDM-VGB augments masked diffusion with backtracking-style reward-guided remasking to achieve quadratic-complexity high-reward generation and sample editing, with proofs of noise robustness.
The Power of Test-Time Training for Approximate Sampling cs.DS · 2026-06-09 · unverdicted · none · ref 12
Establishes a quadratic lower bound on query complexity for sampling from large classes of distributions given approximate density oracles, answers an open question on optimality of random walks, and shows circumvention for bounded classes as an abstraction of TTT.
The tractability landscape of diffusion alignment: regularization, rewards, and computational primitives cs.LG · 2026-05-12 · unverdicted · none · ref 26
The choice of closeness measure in diffusion reward alignment determines the computational primitives and tractable reward classes, with linear exponential tilts sufficing for KL with convex rewards and proximal oracles for Wasserstein with concave or low-dimensional Lipschitz rewards.

arXiv preprint arXiv:2510.03149 , year=

fields

years

verdicts

representative citing papers

citing papers explorer