AVEC 2016 - Depression, Mood, and Emotion Recognition Workshop and Challenge

· 2016 · cs.CV · arXiv 1605.01600

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

The Audio/Visual Emotion Challenge and Workshop (AVEC 2016) "Depression, Mood and Emotion" will be the sixth competition event aimed at comparison of multimedia processing and machine learning methods for automatic audio, visual and physiological depression and emotion analysis, with all participants competing under strictly the same conditions. The goal of the Challenge is to provide a common benchmark test set for multi-modal information processing and to bring together the depression and emotion recognition communities, as well as the audio, video and physiological processing communities, to compare the relative merits of the various approaches to depression and emotion recognition under well-defined and strictly comparable conditions and establish to what extent fusion of the approaches is possible and beneficial. This paper presents the challenge guidelines, the common data used, and the performance of the baseline system on the two tasks.

representative citing papers

Bag-of-Audio-Words based on Autoencoder Codebook for Continuous Emotion Prediction

eess.AS · 2019-07-06 · unverdicted · novelty 6.0

Autoencoder-based codebook for Bag-of-Audio-Words raises CCC for arousal from 0.225 to 0.322 and valence from 0.244 to 0.368 on AVEC 2017 audio data versus standard BoW.

Emotion Recognition Using Fusion of Audio and Video Features

cs.LG · 2019-06-25 · unverdicted · novelty 4.0

Feature-level or decision-level fusion of CNN video features and audio descriptors via SVR achieves CCC 0.749 (arousal) and 0.565 (valence) on RECOLA after preprocessing and post-processing.

citing papers explorer

Showing 2 of 2 citing papers.

Bag-of-Audio-Words based on Autoencoder Codebook for Continuous Emotion Prediction eess.AS · 2019-07-06 · unverdicted · none · ref 19 · internal anchor
Autoencoder-based codebook for Bag-of-Audio-Words raises CCC for arousal from 0.225 to 0.322 and valence from 0.244 to 0.368 on AVEC 2017 audio data versus standard BoW.
Emotion Recognition Using Fusion of Audio and Video Features cs.LG · 2019-06-25 · unverdicted · none · ref 17 · internal anchor
Feature-level or decision-level fusion of CNN video features and audio descriptors via SVR achieves CCC 0.749 (arousal) and 0.565 (valence) on RECOLA after preprocessing and post-processing.

AVEC 2016 - Depression, Mood, and Emotion Recognition Workshop and Challenge

fields

years

verdicts

representative citing papers

citing papers explorer