What makes train- ing multi-modal classification networks hard? InProceed- ings of the IEEE/CVF conference on computer vision and pattern recognition, pages 12695–12705, 2020

Weiyao Wang, Du Tran, Matt Feiszli · 2020

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Robust Deepfake Detection: Mitigating Spatial Attention Drift via Calibrated Complementary Ensembles

cs.CV · 2026-04-28 · unverdicted · novelty 4.0

A multi-stream ensemble using DINOv2 and CLIP backbones trained with extreme degradations achieves stable deepfake detection and fourth place in the NTIRE 2026 challenge.

citing papers explorer

Showing 1 of 1 citing paper.

Robust Deepfake Detection: Mitigating Spatial Attention Drift via Calibrated Complementary Ensembles cs.CV · 2026-04-28 · unverdicted · none · ref 35
A multi-stream ensemble using DINOv2 and CLIP backbones trained with extreme degradations achieves stable deepfake detection and fourth place in the NTIRE 2026 challenge.

What makes train- ing multi-modal classification networks hard? InProceed- ings of the IEEE/CVF conference on computer vision and pattern recognition, pages 12695–12705, 2020

fields

years

verdicts

representative citing papers

citing papers explorer