DUEL : Exact likelihood for masked diffusion via deterministic unmasking

Gilad Turok, Chris De Sa, Volodymyr Kuleshov · 2026 · arXiv 2603.01367

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

TUBE: Tangent Upper Bound on Evidence for Discrete Diffusion Language Models

cs.LG · 2026-05-22 · unverdicted · novelty 7.0

TUBE is a new upper bound on evidence for discrete diffusion models that shows block MDMs and AO-ARMs have strictly lower likelihood than exact ARMs.

GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models

cs.LG · 2026-05-28 · unverdicted · novelty 6.0

GDSD reduces RL for dLLMs to likelihood-free self-distillation via a normalization-free logit-matching objective, outperforming ELBO methods with more stable training on LLaDA-8B and Dream-7B.

citing papers explorer

Showing 2 of 2 citing papers after filters.

TUBE: Tangent Upper Bound on Evidence for Discrete Diffusion Language Models cs.LG · 2026-05-22 · unverdicted · none · ref 45
TUBE is a new upper bound on evidence for discrete diffusion models that shows block MDMs and AO-ARMs have strictly lower likelihood than exact ARMs.
GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models cs.LG · 2026-05-28 · unverdicted · none · ref 51
GDSD reduces RL for dLLMs to likelihood-free self-distillation via a normalization-free logit-matching objective, outperforming ELBO methods with more stable training on LLaDA-8B and Dream-7B.

DUEL : Exact likelihood for masked diffusion via deterministic unmasking

fields

years

verdicts

representative citing papers

citing papers explorer