pith. sign in

arxiv: 1611.09827 · v2 · pith:SQPDOUYCnew · submitted 2016-11-29 · 📊 stat.ML · cs.LG· cs.SD

Learning Features of Music from Scratch

classification 📊 stat.ML cs.LGcs.SD
keywords learningmusicend-to-endevaluationfeaturesmachinemusicnetneural
0
0 comments X
read the original abstract

This paper introduces a new large-scale music dataset, MusicNet, to serve as a source of supervision and evaluation of machine learning methods for music research. MusicNet consists of hundreds of freely-licensed classical music recordings by 10 composers, written for 11 instruments, together with instrument/note annotations resulting in over 1 million temporal labels on 34 hours of chamber music performances under various studio and microphone conditions. The paper defines a multi-label classification task to predict notes in musical recordings, along with an evaluation protocol, and benchmarks several machine learning architectures for this task: i) learning from spectrogram features; ii) end-to-end learning with a neural net; iii) end-to-end learning with a convolutional neural net. These experiments show that end-to-end models trained for note prediction learn frequency selective filters as a low-level representation of audio.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Heterogeneity-Aware Dataset Scheduling for Efficient Audio Large Language Model Training

    cs.SD 2026-05 unverdicted novelty 5.0

    GST uses gradient-based affinity metrics to form dataset groups and applies progressive scheduling, achieving 30-40% faster convergence than uniform mixture training on 14 AudioQA datasets while matching or exceeding ...