pith. sign in

Multimodal latent language modeling with next- token diffusion

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

citation-role summary

background 2

citation-polarity summary

years

2026 6 2025 2

roles

background 2

polarities

background 2

representative citing papers

Leveraging Latent Visual Reasoning in Silence

cs.CV · 2026-05-18 · conditional · novelty 6.0

Latent visual reasoning improves multimodal models via training effects even without using latent tokens at inference, enabled by an attention-based RL reward that promotes interaction with text tokens.

citing papers explorer

Showing 8 of 8 citing papers.