Feature-level Interaction Explanations in Multimodal Transformers

· 2026 · cs.LG · arXiv 2603.13326

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Multimodal Transformers often produce predictions without clarifying how different modalities jointly support a decision. Most existing multimodal explainable AI (MXAI) methods extend unimodal saliency to multimodal backbones, highlighting important tokens or patches within each modality, but they rarely pinpoint which cross-modal feature pairs provide complementary evidence (synergy) or serve as reliable backups (redundancy). We present Feature-level I2MoE (FL-I2MoE), a structured Mixture-of-Experts layer that operates directly on token/patch sequences from frozen pretrained encoders and explicitly separates unique, synergistic, and redundant evidence at the feature level. We further develop an expert-wise explanation pipeline that combines attribution with top-K% masking to assess faithfulness, and we introduce Monte Carlo interaction probes to quantify pairwise behavior: the Shapley Interaction Index (SII) to score synergistic pairs and a redundancy-gap score to capture substitutable (redundant) pairs. Across three benchmarks (MMIMDb, ENRICO, and MMHS150K), FL-I2MoE yields more interactionspecific and concentrated importance patterns than a dense Transformer with the same encoders. Finally, pair-level masking shows that removing pairs ranked by SII or redundancy-gap degrades performance more than masking randomly chosen pairs under the same budget, supporting that the identified interactions are causally relevant. Code is available at https://github.com/dut0817/FL-I2MoE.

representative citing papers

Does Role Specialization Matter for Explanation Faithfulness in Mixture-of-Experts?

cs.LG · 2026-06-28 · unverdicted · novelty 4.0

Representation decorrelation regularization in MoE models improves explanation faithfulness on multimodal benchmarks while preserving task performance.

citing papers explorer

Showing 1 of 1 citing paper.

Does Role Specialization Matter for Explanation Faithfulness in Mixture-of-Experts? cs.LG · 2026-06-28 · unverdicted · none · ref 15 · internal anchor
Representation decorrelation regularization in MoE models improves explanation faithfulness on multimodal benchmarks while preserving task performance.

Feature-level Interaction Explanations in Multimodal Transformers

fields

years

verdicts

representative citing papers

citing papers explorer