Adjoint-equation framework yields dimension-free convergence bounds in any IPM for discrete diffusion models under masked or uniform priors using one rate-matrix regularity assumption.
Structured denoising diffusion models in discrete state-spaces
3 Pith papers cite this work. Polarity classification is still indexing.
3
Pith papers citing it
verdicts
UNVERDICTED 3representative citing papers
RSPO interprets reward advantages as targets for relative log-ratios in dLLMs, calibrating noisy estimates to stabilize RLVR training and achieve strong gains on planning tasks with competitive math reasoning performance.
A method trains discrete diffusion policies for combinatorial RL by matching to a PMD-regularized target distribution, reporting SOTA performance and sample efficiency on DNA generation, macro-action, and multi-agent benchmarks.
citing papers explorer
No citing papers match the current filters.