Mmbench: Is your multi-modal model an all-around player?, in: ECCV (6), Springer

Liu, Y

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Seeing is Understanding: Unlocking Causal Attention into Modality-Mutual Attention for Multimodal LLMs

cs.CV · 2025-03-04 · unverdicted · novelty 5.0

Modality-mutual attention (MMA) is introduced to replace causal attention in MLLMs, enabling mutual attention between image and text tokens and claiming SOTA results on 12 multimodal benchmarks with no extra parameters.

citing papers explorer

Showing 1 of 1 citing paper.

Seeing is Understanding: Unlocking Causal Attention into Modality-Mutual Attention for Multimodal LLMs cs.CV · 2025-03-04 · unverdicted · none · ref 30
Modality-mutual attention (MMA) is introduced to replace causal attention in MLLMs, enabling mutual attention between image and text tokens and claiming SOTA results on 12 multimodal benchmarks with no extra parameters.

Mmbench: Is your multi-modal model an all-around player?, in: ECCV (6), Springer

fields

years

verdicts

representative citing papers

citing papers explorer