Final score for the sampled embeddings is obtained as an average of these values

Scoring Head:The transformer outputs for each token are projected to a scalar value · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Early Decisions Matter: Proximity Bias and Initial Trajectory Shaping in Non-Autoregressive Diffusion Language Models

cs.CL · 2026-04-12 · unverdicted · novelty 5.0

Non-autoregressive diffusion language models have an inherent proximity bias in token unmasking that causes spatial error propagation, which a minimal planner and annealing strategy can mitigate for better reasoning performance.

citing papers explorer

Showing 1 of 1 citing paper.

Early Decisions Matter: Proximity Bias and Initial Trajectory Shaping in Non-Autoregressive Diffusion Language Models cs.CL · 2026-04-12 · unverdicted · none · ref 14
Non-autoregressive diffusion language models have an inherent proximity bias in token unmasking that causes spatial error propagation, which a minimal planner and annealing strategy can mitigate for better reasoning performance.

Final score for the sampled embeddings is obtained as an average of these values

fields

years

verdicts

representative citing papers

citing papers explorer