Adaptive Contrastive Learning on Multimodal Transformer for Review Helpfulness Predictions

· 2022 · cs.CL · arXiv 2211.03524

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

open full Pith review browse 4 citing papers arXiv PDF

abstract

Modern Review Helpfulness Prediction systems are dependent upon multiple modalities, typically texts and images. Unfortunately, those contemporary approaches pay scarce attention to polish representations of cross-modal relations and tend to suffer from inferior optimization. This might cause harm to model's predictions in numerous cases. To overcome the aforementioned issues, we propose Multimodal Contrastive Learning for Multimodal Review Helpfulness Prediction (MRHP) problem, concentrating on mutual information between input modalities to explicitly elaborate cross-modal relations. In addition, we introduce Adaptive Weighting scheme for our contrastive learning approach in order to increase flexibility in optimization. Lastly, we propose Multimodal Interaction module to address the unalignment nature of multimodal data, thereby assisting the model in producing more reasonable multimodal representations. Experimental results show that our method outperforms prior baselines and achieves state-of-the-art results on two publicly available benchmark datasets for MRHP problem.

representative citing papers

Motion-aware Contrastive Learning for Temporal Panoptic Scene Graph Generation

cs.CV · 2024-12-10 · unverdicted · novelty 6.0

Motion-aware contrastive learning on mask tubes improves temporal panoptic scene graph generation over pooling-based methods on video and 4D datasets.

Multi-Scale Contrastive Learning for Video Temporal Grounding

cs.CV · 2024-12-10 · unverdicted · novelty 6.0

A multi-scale and cross-scale contrastive learning framework uses intra-encoder stage features and a new sampling process to link short-range and long-range video moments for temporal grounding.

Gradient-Boosted Decision Tree for Listwise Context Model in Multimodal Review Helpfulness Prediction

cs.CL · 2023-05-22 · unverdicted · novelty 5.0

Introduces listwise attention, listwise loss, and GBDT predictor to improve multimodal review helpfulness ranking over prior FCNN and pairwise approaches.

DemaFormer: Damped Exponential Moving Average Transformer with Energy-Based Modeling for Temporal Language Grounding

cs.CV · 2023-12-05 · unverdicted · novelty 4.0

DemaFormer pairs energy-based modeling with a damped-EMA Transformer to localize video moments matching language queries and reports gains over baselines on four datasets.

citing papers explorer

Showing 4 of 4 citing papers.

Motion-aware Contrastive Learning for Temporal Panoptic Scene Graph Generation cs.CV · 2024-12-10 · unverdicted · none · ref 28 · internal anchor
Motion-aware contrastive learning on mask tubes improves temporal panoptic scene graph generation over pooling-based methods on video and 4D datasets.
Multi-Scale Contrastive Learning for Video Temporal Grounding cs.CV · 2024-12-10 · unverdicted · none · ref 43 · internal anchor
A multi-scale and cross-scale contrastive learning framework uses intra-encoder stage features and a new sampling process to link short-range and long-range video moments for temporal grounding.
Gradient-Boosted Decision Tree for Listwise Context Model in Multimodal Review Helpfulness Prediction cs.CL · 2023-05-22 · unverdicted · none · ref 19 · internal anchor
Introduces listwise attention, listwise loss, and GBDT predictor to improve multimodal review helpfulness ranking over prior FCNN and pairwise approaches.
DemaFormer: Damped Exponential Moving Average Transformer with Energy-Based Modeling for Temporal Language Grounding cs.CV · 2023-12-05 · unverdicted · none · ref 29 · internal anchor
DemaFormer pairs energy-based modeling with a damped-EMA Transformer to localize video moments matching language queries and reports gains over baselines on four datasets.

Adaptive Contrastive Learning on Multimodal Transformer for Review Helpfulness Predictions

fields

years

verdicts

representative citing papers

citing papers explorer