SAGA: Source Attribution of Generative AI Videos

· 2025 · cs.CV · arXiv 2511.12834

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

The proliferation of generative AI has led to hyper-realistic synthetic videos, escalating misuse risks and outstripping binary real/fake detectors. We introduce SAGA (Source Attribution of Generative AI videos), the first comprehensive framework to address the urgent need for AI-generated video source attribution at a large scale. Unlike traditional detection, SAGA identifies the specific generative model used. It uniquely provides multi-granular attribution across five levels: authenticity, generation task (e.g., T2V/I2V), model version, development team, and the precise generator, offering far richer forensic insights. Our novel video transformer architecture, leveraging features from a robust vision foundation model, effectively captures spatio-temporal artifacts. Critically, we introduce a data-efficient pretrain-and-attribute strategy, enabling SAGA to achieve state-of-the-art attribution using only 0.5\% of source-labeled data per class, matching fully supervised performance. Furthermore, we propose Temporal Attention Signatures (T-Sigs), a novel interpretability method that visualizes learned temporal differences, offering the first explanation for why different video generators are distinguishable. Extensive experiments on public datasets, including cross-domain scenarios, demonstrate that SAGA sets a new benchmark for synthetic video provenance, providing crucial, interpretable insights for forensic and regulatory applications.

representative citing papers

Who Generated This 3D Asset? Learning Source Attribution for Generative 3D Models

cs.CV · 2026-05-18 · unverdicted · novelty 7.0

Introduces the first passive source attribution benchmark for 22 generative 3D models and a Transformer achieving 97.22% accuracy under full supervision and 77.17% with 1% training data.

Video as Natural Augmentation: Towards Unified AI-Generated Image and Video Detection

cs.CV · 2026-05-21 · unverdicted · novelty 5.0

VINA trains a single detector on images plus video frames using a cross-modal supervised contrastive objective, yielding bidirectional gains and SOTA results on 14 image, video, and in-the-wild benchmarks.

citing papers explorer

Showing 2 of 2 citing papers.

Who Generated This 3D Asset? Learning Source Attribution for Generative 3D Models cs.CV · 2026-05-18 · unverdicted · none · ref 18 · internal anchor
Introduces the first passive source attribution benchmark for 22 generative 3D models and a Transformer achieving 97.22% accuracy under full supervision and 77.17% with 1% training data.
Video as Natural Augmentation: Towards Unified AI-Generated Image and Video Detection cs.CV · 2026-05-21 · unverdicted · none · ref 57 · internal anchor
VINA trains a single detector on images plus video frames using a cross-modal supervised contrastive objective, yielding bidirectional gains and SOTA results on 14 image, video, and in-the-wild benchmarks.

SAGA: Source Attribution of Generative AI Videos

fields

years

verdicts

representative citing papers

citing papers explorer