Av-deepfake1m++: A large-scale audio-visual deepfake benchmark with real- world perturbations

Zhixi Cai, Kartik Kuckreja, Shreya Ghosh, Akanksha Chuchra, Muhammad Haris Khan, Usman Tariq, Tom Gedeon, Abhinav Dhall, “Av-deepfake1m++: A large-scale audio-visual deepfake benchmark with realworld perturbations,” inProceedings of t · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Towards multi-modal forgery representation learning for AI-generated video detection and localization

cs.CV · 2026-05-08 · unverdicted · novelty 5.0

A multi-modal model with LMM semantic, ST visual, and PS audio branches enables simultaneous detection and fine-grained temporal localization of partial AI video forgeries, outperforming prior methods.

citing papers explorer

Showing 1 of 1 citing paper.

Towards multi-modal forgery representation learning for AI-generated video detection and localization cs.CV · 2026-05-08 · unverdicted · none · ref 19
A multi-modal model with LMM semantic, ST visual, and PS audio branches enables simultaneous detection and fine-grained temporal localization of partial AI video forgeries, outperforming prior methods.

Av-deepfake1m++: A large-scale audio-visual deepfake benchmark with real- world perturbations

fields

years

verdicts

representative citing papers

citing papers explorer