arXiv preprint arXiv:2509.13866 , year=

Masked Diffusion Models as Energy Minimization , author= · 2025 · arXiv 2509.13866

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Prefilling-dLLM: Predictive Prefilling for Long-Context Inference in Diffusion Language Models

cs.CL · 2026-06-09 · unverdicted · novelty 7.0

Prefilling-dLLM partitions prefixes into chunks, caches KV representations, and applies sparse top-K selection during decoding to cut dLLM inference complexity to quadratic in decode length only.

Fixed-Point Masked Generative Modeling

cs.LG · 2026-05-29 · unverdicted · novelty 6.0

FP-MGMs with consistency loss and three-state reuse (CoFRe) reduce parameters by up to 38.8% and improve low-budget perplexity and FID versus standard masked generative models on text and images.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Prefilling-dLLM: Predictive Prefilling for Long-Context Inference in Diffusion Language Models cs.CL · 2026-06-09 · unverdicted · none · ref 13
Prefilling-dLLM partitions prefixes into chunks, caches KV representations, and applies sparse top-K selection during decoding to cut dLLM inference complexity to quadratic in decode length only.
Fixed-Point Masked Generative Modeling cs.LG · 2026-05-29 · unverdicted · none · ref 9
FP-MGMs with consistency loss and three-state reuse (CoFRe) reduce parameters by up to 38.8% and improve low-budget perplexity and FID versus standard masked generative models on text and images.

arXiv preprint arXiv:2509.13866 , year=

fields

years

verdicts

representative citing papers

citing papers explorer