Unlocking Prompt Infilling Capability for Diffusion Language Models

· 2026 · cs.CL · arXiv 2604.03677

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Masked diffusion language models (dLMs) generate text through bidirectional denoising, yet this capability remains locked for infilling prompts. This limitation is an artifact of the current supervised finetuning (SFT) convention of applying response-only masking. To unlock this capability, we extend full-sequence masking during SFT, where both prompts and responses are masked jointly. Once unlocked, the model infills masked portions of a prompt template conditioned on few-shot examples. We show that such model-infilled prompts match or surpass manually designed templates, transfer effectively across models, and are complementary to existing prompt optimization methods. Our results suggest that training practices, not architectural limitations, are the primary bottleneck preventing masked diffusion language models from infilling effective prompts

representative citing papers

Extracting Training Data from Diffusion Language Models via Infilling

cs.CL · 2026-05-22 · unverdicted · novelty 7.0

Infilling extraction on diffusion language models extracts up to three times more verbatim sequences than prefix methods and achieves higher recall on redacted emails than autoregressive models.

citing papers explorer

Showing 1 of 1 citing paper.

Extracting Training Data from Diffusion Language Models via Infilling cs.CL · 2026-05-22 · unverdicted · none · ref 15 · internal anchor
Infilling extraction on diffusion language models extracts up to three times more verbatim sequences than prefix methods and achieves higher recall on redacted emails than autoregressive models.

Unlocking Prompt Infilling Capability for Diffusion Language Models

fields

years

verdicts

representative citing papers

citing papers explorer