pith. sign in

arxiv: 1711.04598 · v1 · pith:STF3GDCNnew · submitted 2017-11-13 · 💻 cs.CV

Convolutional neural networks pretrained on large face recognition datasets for emotion classification from video

classification 💻 cs.CV
keywords recognitionemotionfacenetworksaccuracyclassificationconvolutionaldatasets
0
0 comments X
read the original abstract

In this paper we describe a solution to our entry for the emotion recognition challenge EmotiW 2017. We propose an ensemble of several models, which capture spatial and audio features from videos. Spatial features are captured by convolutional neural networks, pretrained on large face recognition datasets. We show that usage of strong industry-level face recognition networks increases the accuracy of emotion recognition. Using our ensemble we improve on the previous best result on the test set by about 1 %, achieving a 60.03 % classification accuracy without any use of visual temporal information.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.