pith. sign in

arxiv: 1805.04136 · v1 · pith:JTYWXGATnew · submitted 2018-05-10 · 💻 cs.CV

Unsupervised Deep Representations for Learning Audience Facial Behaviors

classification 💻 cs.CV
keywords audiencebehaviorsdeepfacialgenerativelatentlearningnetwork
0
0 comments X p. Extension
pith:JTYWXGAT Add to your LaTeX paper What is a Pith Number?
\usepackage{pith}
\pithnumber{JTYWXGAT}

Prints a linked pith:JTYWXGAT badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

In this paper, we present an unsupervised learning approach for analyzing facial behavior based on a deep generative model combined with a convolutional neural network (CNN). We jointly train a variational auto-encoder (VAE) and a generative adversarial network (GAN) to learn a powerful latent representation from footage of audiences viewing feature-length movies. We show that the learned latent representation successfully encodes meaningful signatures of behaviors related to audience engagement (smiling & laughing) and disengagement (yawning). Our results provide a proof of concept for a more general methodology for annotating hard-to-label multimedia data featuring sparse examples of signals of interest.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.