hub

MOSI: Multimodal Corpus of Sentiment Intensity and Subjectivity Analysis in Online Opinion Videos

Amir Zadeh, Rowan Zellers, Eli Pincus, Louis-Philippe Morency · 2016 · cs.CL · arXiv 1606.06259

17 Pith papers cite this work. Polarity classification is still indexing.

17 Pith papers citing it

open full Pith review browse 17 citing papers arXiv PDF

abstract

People are sharing their opinions, stories and reviews through online video sharing websites every day. Studying sentiment and subjectivity in these opinion videos is experiencing a growing attention from academia and industry. While sentiment analysis has been successful for text, it is an understudied research question for videos and multimedia content. The biggest setbacks for studies in this direction are lack of a proper dataset, methodology, baselines and statistical analysis of how information from different modality sources relate to each other. This paper introduces to the scientific community the first opinion-level annotated corpus of sentiment and subjectivity analysis in online videos called Multimodal Opinion-level Sentiment Intensity dataset (MOSI). The dataset is rigorously annotated with labels for subjectivity, sentiment intensity, per-frame and per-opinion annotated visual features, and per-milliseconds annotated audio features. Furthermore, we present baselines for future studies in this direction as well as a new multimodal fusion approach that jointly models spoken words and visual gestures.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 1 dataset 1

citation-polarity summary

background 2

representative citing papers

Rethinking Gating Mechanism in Sparse MoE: Handling Arbitrary Modality Inputs with Confidence-Guided Gate

cs.LG · 2025-05-26 · unverdicted · novelty 7.0

ConfSMoE adds expert-opinion imputation and detaches softmax routing scores to ground-truth task confidence to relieve expert collapse in SMoE without extra load-balance losses, evaluated on four real-world datasets.

Deep Multimodal Learning with Missing Modality: A Survey

cs.CV · 2024-09-12 · unverdicted · novelty 7.0

This survey provides the first comprehensive overview of deep multimodal learning methods designed to remain robust when some input modalities are absent.

McNdroid: A Longitudinal Multimodal Benchmark for Robust Drift Detection in Android Malware

cs.CR · 2026-05-07 · unverdicted · novelty 7.0

McNdroid is a new longitudinal multimodal benchmark showing that Android malware detectors degrade over time but multimodal approaches maintain better performance across long temporal gaps.

Are We Making Progress in Multimodal Domain Generalization? A Comprehensive Benchmark Study

cs.CV · 2026-05-07 · unverdicted · novelty 7.0

A large-scale benchmark finds that recent multimodal domain generalization methods give only marginal gains over a plain ERM baseline, with no method winning consistently and all degrading sharply under corruption or missing modalities.

EmoTrans: A Benchmark for Understanding, Reasoning, and Predicting Emotion Transitions in Multimodal LLMs

cs.CV · 2026-04-25 · unverdicted · novelty 7.0

EmoTrans is a new video benchmark with four progressive tasks that measures how well current multimodal LLMs handle dynamic emotion transitions rather than static recognition.

SynIB: Informational Bottleneck for Maximizing Synergy in Multimodal Learning

cs.LG · 2026-05-12 · unverdicted · novelty 6.0

SynIB is an information-theoretic objective that adds a penalty for unimodal confidence to standard task loss, improving accuracy on synergy-dependent examples by up to 7.8% across synthetic XOR tasks and five real-world multimodal benchmarks.

Nano-EmoX: Unifying Multimodal Emotional Intelligence from Perception to Empathy

cs.AI · 2026-03-02 · unverdicted · novelty 6.0

Nano-EmoX is a compact 2.2B multimodal model that unifies six core affective tasks across perception, understanding, and interaction levels via a curriculum framework, achieving competitive benchmark performance.

Fusion or Confusion? Multimodal Complexity Is Not All You Need

cs.LG · 2025-12-28 · unverdicted · novelty 6.0

Complex multimodal architectures do not reliably outperform unimodal baselines or a simple multimodal baseline under standardized evaluation.

The More, the Merrier: Contrastive Fusion for Higher-Order Multimodal Alignment

cs.CV · 2025-11-26 · unverdicted · novelty 6.0

Contrastive Fusion (ConFu) adds a fused-modality contrastive term to jointly align individual modalities and their combinations, enabling capture of higher-order dependencies like XOR relations while preserving pairwise alignments.

Enhance-then-Balance Modality Collaboration for Robust Multimodal Sentiment Analysis

cs.CL · 2026-04-14 · unverdicted · novelty 6.0

EBMC framework enhances weaker modalities via semantic disentanglement and cross-modal boosting, then balances them with energy-guided coordination and instance-aware trust distillation for improved MSA performance and missing-modality robustness.

MULTIBENCH++: A Unified and Comprehensive Multimodal Fusion Benchmarking Across Specialized Domains

cs.LG · 2025-11-09 · unverdicted · novelty 5.0

MULTIBENCH++ is a new large-scale benchmark integrating over 30 datasets across 15 modalities and 20 tasks, accompanied by an open-source automated evaluation pipeline that establishes new performance baselines for multimodal fusion.

Mitigating Multimodal Inconsistency via Cognitive Dual-Pathway Reasoning for Intent Recognition

cs.MM · 2026-05-10 · unverdicted · novelty 5.0

CDPR uses an intuition pathway for cross-modal consensus and a reasoning pathway for quantifying and mitigating inconsistencies to improve multimodal intent recognition.

Controlling Decision Drift in Multimodal Sentiment Analysis with Missing Modalities

cs.CV · 2026-05-16 · unverdicted · novelty 4.0

A two-level reference alignment framework uses complete-modality samples and prototype voting to reduce decision drift and improve robustness in multimodal sentiment analysis under missing modalities.

Modality-Aware Contrastive and Uncertainty-Regularized Emotion Recognition

cs.MM · 2026-05-07 · unverdicted · novelty 4.0

MCUR improves multimodal emotion recognition across heterogeneous modality setups by combining modality-combination contrastive learning with sample-wise uncertainty regularization, yielding F1 gains of 2.2-4.37% on MOSI, MOSEI, and IEMOCAP.

A Conflict-Aware Penalty and Statistical Loss Framework for Balancing Modalities and Enhancing Stability in Multimodal Sentiment Analysis

cs.AI · 2026-05-27 · unverdicted · novelty 3.0

Introduces CP and SL to balance modalities and stabilize training in MSA, reporting SOTA results on CMU-MOSI with component ablations.

Looking Beyond the Obvious: A Survey on Abstract Concept Recognition for Video Understanding

cs.CV · 2025-08-28 · unverdicted · novelty 3.0

A literature survey on abstract concept recognition in videos that catalogs prior tasks and datasets while advocating for foundation models and reuse of decades of community experience.

Multi-modal Sentiment Analysis using Deep Canonical Correlation Analysis

cs.IR · 2019-07-15 · unverdicted · novelty 3.0

One-step DCCA fusing BERT text with audio and video embeddings outperforms prior multi-modal methods for sentiment classification on two benchmarks and a new Debate Emotion dataset.

citing papers explorer

Showing 17 of 17 citing papers.

Rethinking Gating Mechanism in Sparse MoE: Handling Arbitrary Modality Inputs with Confidence-Guided Gate cs.LG · 2025-05-26 · unverdicted · none · ref 36 · internal anchor
ConfSMoE adds expert-opinion imputation and detaches softmax routing scores to ground-truth task confidence to relieve expert collapse in SMoE without extra load-balance losses, evaluated on four real-world datasets.
Deep Multimodal Learning with Missing Modality: A Survey cs.CV · 2024-09-12 · unverdicted · none · ref 75 · internal anchor
This survey provides the first comprehensive overview of deep multimodal learning methods designed to remain robust when some input modalities are absent.
McNdroid: A Longitudinal Multimodal Benchmark for Robust Drift Detection in Android Malware cs.CR · 2026-05-07 · unverdicted · none · ref 87
McNdroid is a new longitudinal multimodal benchmark showing that Android malware detectors degrade over time but multimodal approaches maintain better performance across long temporal gaps.
Are We Making Progress in Multimodal Domain Generalization? A Comprehensive Benchmark Study cs.CV · 2026-05-07 · unverdicted · none · ref 47
A large-scale benchmark finds that recent multimodal domain generalization methods give only marginal gains over a plain ERM baseline, with no method winning consistently and all degrading sharply under corruption or missing modalities.
EmoTrans: A Benchmark for Understanding, Reasoning, and Predicting Emotion Transitions in Multimodal LLMs cs.CV · 2026-04-25 · unverdicted · none · ref 38
EmoTrans is a new video benchmark with four progressive tasks that measures how well current multimodal LLMs handle dynamic emotion transitions rather than static recognition.
SynIB: Informational Bottleneck for Maximizing Synergy in Multimodal Learning cs.LG · 2026-05-12 · unverdicted · none · ref 98 · internal anchor
SynIB is an information-theoretic objective that adds a penalty for unimodal confidence to standard task loss, improving accuracy on synergy-dependent examples by up to 7.8% across synthetic XOR tasks and five real-world multimodal benchmarks.
Nano-EmoX: Unifying Multimodal Emotional Intelligence from Perception to Empathy cs.AI · 2026-03-02 · unverdicted · none · ref 66 · internal anchor
Nano-EmoX is a compact 2.2B multimodal model that unifies six core affective tasks across perception, understanding, and interaction levels via a curriculum framework, achieving competitive benchmark performance.
Fusion or Confusion? Multimodal Complexity Is Not All You Need cs.LG · 2025-12-28 · unverdicted · none · ref 55 · internal anchor
Complex multimodal architectures do not reliably outperform unimodal baselines or a simple multimodal baseline under standardized evaluation.
The More, the Merrier: Contrastive Fusion for Higher-Order Multimodal Alignment cs.CV · 2025-11-26 · unverdicted · none · ref 45 · internal anchor
Contrastive Fusion (ConFu) adds a fused-modality contrastive term to jointly align individual modalities and their combinations, enabling capture of higher-order dependencies like XOR relations while preserving pairwise alignments.
Enhance-then-Balance Modality Collaboration for Robust Multimodal Sentiment Analysis cs.CL · 2026-04-14 · unverdicted · none · ref 62
EBMC framework enhances weaker modalities via semantic disentanglement and cross-modal boosting, then balances them with energy-guided coordination and instance-aware trust distillation for improved MSA performance and missing-modality robustness.
MULTIBENCH++: A Unified and Comprehensive Multimodal Fusion Benchmarking Across Specialized Domains cs.LG · 2025-11-09 · unverdicted · none · ref 65 · internal anchor
MULTIBENCH++ is a new large-scale benchmark integrating over 30 datasets across 15 modalities and 20 tasks, accompanied by an open-source automated evaluation pipeline that establishes new performance baselines for multimodal fusion.
Mitigating Multimodal Inconsistency via Cognitive Dual-Pathway Reasoning for Intent Recognition cs.MM · 2026-05-10 · unverdicted · none · ref 43
CDPR uses an intuition pathway for cross-modal consensus and a reasoning pathway for quantifying and mitigating inconsistencies to improve multimodal intent recognition.
Controlling Decision Drift in Multimodal Sentiment Analysis with Missing Modalities cs.CV · 2026-05-16 · unverdicted · none · ref 33 · internal anchor
A two-level reference alignment framework uses complete-modality samples and prototype voting to reduce decision drift and improve robustness in multimodal sentiment analysis under missing modalities.
Modality-Aware Contrastive and Uncertainty-Regularized Emotion Recognition cs.MM · 2026-05-07 · unverdicted · none · ref 36
MCUR improves multimodal emotion recognition across heterogeneous modality setups by combining modality-combination contrastive learning with sample-wise uncertainty regularization, yielding F1 gains of 2.2-4.37% on MOSI, MOSEI, and IEMOCAP.
A Conflict-Aware Penalty and Statistical Loss Framework for Balancing Modalities and Enhancing Stability in Multimodal Sentiment Analysis cs.AI · 2026-05-27 · unverdicted · none · ref 8 · internal anchor
Introduces CP and SL to balance modalities and stabilize training in MSA, reporting SOTA results on CMU-MOSI with component ablations.
Looking Beyond the Obvious: A Survey on Abstract Concept Recognition for Video Understanding cs.CV · 2025-08-28 · unverdicted · none · ref 111 · internal anchor
A literature survey on abstract concept recognition in videos that catalogs prior tasks and datasets while advocating for foundation models and reuse of decades of community experience.
Multi-modal Sentiment Analysis using Deep Canonical Correlation Analysis cs.IR · 2019-07-15 · unverdicted · none · ref 16 · internal anchor
One-step DCCA fusing BERT text with audio and video embeddings outperforms prior multi-modal methods for sentiment classification on two benchmarks and a new Debate Emotion dataset.

MOSI: Multimodal Corpus of Sentiment Intensity and Subjectivity Analysis in Online Opinion Videos

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer