pith. sign in

← back to paper

Review history

arxiv: 2605.14530 · 2 revisions

Mitigating Mask Prior Drift and Positional Attention Collapse in Large Diffusion Vision-Language Models

  1. 2026-05-20 UNVERDICTED LOW v0.9.0 novelty 6.0
    83188 ms 5743 in 1191 out 2026-05-20T21:42:05.895629+00:00
  2. 2026-05-15 UNVERDICTED LOW v0.9.0 novelty 7.0
    49534 ms 5513 in 1301 out 2026-05-15T02:12:13.889671+00:00