wav2vec 2.0: A framework for self- supervised learning of speech representations

Alexei Baevski, Yuhao Zhou, Abdelrahman Mohamed, Michael Auli · 2020

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Generate Your Talking Avatar from Video Reference

cs.CV · 2026-04-30 · unverdicted · novelty 6.0

TAVR generates high-fidelity talking avatars from cross-scene video references via token selection and three-stage training (same-scene pretraining, cross-scene fine-tuning, identity RL), outperforming baselines on a new 158-pair benchmark.

citing papers explorer

Showing 1 of 1 citing paper.

Generate Your Talking Avatar from Video Reference cs.CV · 2026-04-30 · unverdicted · none · ref 1
TAVR generates high-fidelity talking avatars from cross-scene video references via token selection and three-stage training (same-scene pretraining, cross-scene fine-tuning, identity RL), outperforming baselines on a new 158-pair benchmark.

wav2vec 2.0: A framework for self- supervised learning of speech representations

fields

years

verdicts

representative citing papers

citing papers explorer