Any-order flexible length masked diffusion.arXiv preprint arXiv:2509.01025

URL https://arxiv · 2025 · arXiv 2509.01025

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

cs.LG · 2026-04-20 · unverdicted · novelty 7.0

Discrete Tilt Matching recasts dLLM fine-tuning as state-level matching of tilted local unmasking posteriors, producing a stable weighted cross-entropy loss that improves Sudoku and Countdown performance when applied to LLaDA-8B-Instruct.

Generative Modeling from Black-box Corruptions via Self-Consistent Stochastic Interpolants

cs.LG · 2025-12-11 · unverdicted · novelty 7.0

SCSI iteratively refines a self-consistent transport map to invert black-box corruptions and enable generative modeling of clean data.

Edit-Based Refinement for Parallel Masked Diffusion Language Models

cs.CL · 2026-05-10 · unverdicted · novelty 6.0

ME-DLM augments parallel masked diffusion models with edit-distance-supervised refinements to raise quality on coding and math benchmarks while using far fewer diffusion steps.

CreditDecoding: Accelerating Parallel Decoding in Diffusion Large Language Models with Trace Credit

cs.CL · 2025-10-07

citing papers explorer

Showing 4 of 4 citing papers.

Discrete Tilt Matching cs.LG · 2026-04-20 · unverdicted · none · ref 16
Discrete Tilt Matching recasts dLLM fine-tuning as state-level matching of tilted local unmasking posteriors, producing a stable weighted cross-entropy loss that improves Sudoku and Countdown performance when applied to LLaDA-8B-Instruct.
Generative Modeling from Black-box Corruptions via Self-Consistent Stochastic Interpolants cs.LG · 2025-12-11 · unverdicted · none · ref 12
SCSI iteratively refines a self-consistent transport map to invert black-box corruptions and enable generative modeling of clean data.
Edit-Based Refinement for Parallel Masked Diffusion Language Models cs.CL · 2026-05-10 · unverdicted · none · ref 18
ME-DLM augments parallel masked diffusion models with edit-distance-supervised refinements to raise quality on coding and math benchmarks while using far fewer diffusion steps.
CreditDecoding: Accelerating Parallel Decoding in Diffusion Large Language Models with Trace Credit cs.CL · 2025-10-07 · unreviewed · ref 8

Any-order flexible length masked diffusion.arXiv preprint arXiv:2509.01025

fields

years

verdicts

representative citing papers

citing papers explorer