The paper identifies task-composition and fusion bottlenecks as the main barriers in multimodal reasoning, with experiments showing extra modalities help only when they supply independent reasoning paths.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2025 1verdicts
ACCEPT 1representative citing papers
citing papers explorer
-
Compose and Fuse: Revisiting the Foundational Bottlenecks in Multimodal Reasoning
The paper identifies task-composition and fusion bottlenecks as the main barriers in multimodal reasoning, with experiments showing extra modalities help only when they supply independent reasoning paths.