The paper identifies task-composition and fusion bottlenecks as the main barriers in multimodal reasoning, with experiments showing extra modalities help only when they supply independent reasoning paths.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
representative citing papers
Introduces self-captioning and a Multimodal Interaction Gate to amplify redundant multimodal interactions, reporting 38.3% reduction in visual-induced errors and 16.8% consistency improvement.
citing papers explorer
No citing papers match the current filters.