pith. sign in

arxiv: 1905.11760 · v1 · pith:2WGRJK2Wnew · submitted 2019-05-28 · 💻 cs.SD · cs.LG· eess.AS

Two-level Explanations in Music Emotion Recognition

classification 💻 cs.SD cs.LGeess.AS
keywords emotionexplanationsaudiofeaturesmusicperceptualpredictionrecognition
0
0 comments X
read the original abstract

Current ML models for music emotion recognition, while generally working quite well, do not give meaningful or intuitive explanations for their predictions. In this work, we propose a 2-step procedure to arrive at spectrogram-level explanations that connect certain aspects of the audio to interpretable mid-level perceptual features, and these to the actual emotion prediction. That makes it possible to focus on specific musical reasons for a prediction (in terms of perceptual features), and to trace these back to patterns in the audio that can be interpreted visually and acoustically.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.