pith. sign in

Speech-Driven Facial Reenactment Using Conditional Generative Adversarial Networks

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it
abstract

We present a novel approach to generating photo-realistic images of a face with accurate lip sync, given an audio input. By using a recurrent neural network, we achieved mouth landmarks based on audio features. We exploited the power of conditional generative adversarial networks to produce highly-realistic face conditioned on a set of landmarks. These two networks together are capable of producing a sequence of natural faces in sync with an input audio track.

fields

cs.CV 1

years

2026 1

verdicts

UNVERDICTED 1

representative citing papers

citing papers explorer

Showing 1 of 1 citing paper.