Transformer for Emotion Recognition
classification
💻 cs.HC
cs.LGcs.SDeess.AS
keywords
contextaccordingaccountarchitecturearousalchallengecontext-dependentdescribes
read the original abstract
This paper describes the UMONS solution for the OMG-Emotion Challenge. We explore a context-dependent architecture where the arousal and valence of an utterance are predicted according to its surrounding context (i.e. the preceding and following utterances of the video). We report an improvement when taking into account context for both unimodal and multimodal predictions.
This paper has not been read by Pith yet.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.