Multimodal contrastive learning using multilinear products is fragile to single bad modalities, and a gated version improves top-1 retrieval accuracy on synthetic and real trimodal data.
Garomsa, Anna Zapaishchykova, Tafadzwa L
5 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 5roles
background 1polarities
background 1representative citing papers
MICViT outperforms CNN and transformer baselines on brain age prediction from multimodal 3D MRI by combining modality-specific and cross-modal local/global attention across three heterogeneous datasets.
NeuroBridge integrates self-supervised MRI pretraining with hippocampal tasks and gated fusion to reach 88.17% AD vs. CN accuracy on ADNI and 82.78% on OASIS, claiming gains over single-task methods with cross-cohort generalization.
Empirical comparison of graded MRI preprocessing levels for MAE and JEPA pretraining on brain scans shows moderate levels (P2) are often sufficient, with limited additional utility from stronger preprocessing on downstream tasks.
Bayesian meta-learner predicts individualized Alzheimer's disease progression distributions from MRI and trajectories, competitive on ADNI data and less overconfident for long-term scores than deterministic versions.
citing papers explorer
-
Hidden in the Multiplicative Interaction: Uncovering Fragility in Multimodal Contrastive Learning
Multimodal contrastive learning using multilinear products is fragile to single bad modalities, and a gated version improves top-1 retrieval accuracy on synthetic and real trimodal data.