Feature-level Interaction Explanations in Multimodal Transformers

Housam Khalifa Bashier Babiker; Mi-Young Kim; Randy Goebel; Yeji Kim

arxiv: 2603.13326 · v2 · pith:RX34T5AKnew · submitted 2026-03-04 · 💻 cs.LG · cs.AI

Feature-level Interaction Explanations in Multimodal Transformers

Yeji Kim , Housam Khalifa Bashier Babiker , Mi-Young Kim , Randy Goebel This is my paper

classification 💻 cs.LG cs.AI

keywords pairsmultimodalfl-i2moeinteractionmaskingencodersevidencefeature

0 comments

read the original abstract

Multimodal Transformers often produce predictions without clarifying how different modalities jointly support a decision. Most existing multimodal explainable AI (MXAI) methods extend unimodal saliency to multimodal backbones, highlighting important tokens or patches within each modality, but they rarely pinpoint which cross-modal feature pairs provide complementary evidence (synergy) or serve as reliable backups (redundancy). We present Feature-level I2MoE (FL-I2MoE), a structured Mixture-of-Experts layer that operates directly on token/patch sequences from frozen pretrained encoders and explicitly separates unique, synergistic, and redundant evidence at the feature level. We further develop an expert-wise explanation pipeline that combines attribution with top-K% masking to assess faithfulness, and we introduce Monte Carlo interaction probes to quantify pairwise behavior: the Shapley Interaction Index (SII) to score synergistic pairs and a redundancy-gap score to capture substitutable (redundant) pairs. Across three benchmarks (MMIMDb, ENRICO, and MMHS150K), FL-I2MoE yields more interactionspecific and concentrated importance patterns than a dense Transformer with the same encoders. Finally, pair-level masking shows that removing pairs ranked by SII or redundancy-gap degrades performance more than masking randomly chosen pairs under the same budget, supporting that the identified interactions are causally relevant. Code is available at https://github.com/dut0817/FL-I2MoE.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Does Role Specialization Matter for Explanation Faithfulness in Mixture-of-Experts?
cs.LG 2026-06 unverdicted novelty 4.0

Representation decorrelation regularization in MoE models improves explanation faithfulness on multimodal benchmarks while preserving task performance.