pith. sign in

arxiv: 1903.10839 · v1 · pith:UG6FPMDNnew · submitted 2019-03-26 · 💻 cs.SD · cs.LG· eess.AS

Musical Tempo and Key Estimation using Convolutional Neural Networks with Directional Filters

classification 💻 cs.SD cs.LGeess.AS
keywords networksarchitecturesconvolutionaldirectionalestimationfiltersmusicalneural
0
0 comments X
read the original abstract

In this article we explore how the different semantics of spectrograms' time and frequency axes can be exploited for musical tempo and key estimation using Convolutional Neural Networks (CNN). By addressing both tasks with the same network architectures ranging from shallow, domain-specific approaches to deep variants with directional filters, we show that axis-aligned architectures perform similarly well as common VGG-style networks developed for computer vision, while being less vulnerable to confounding factors and requiring fewer model parameters.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.