pith. sign in

arxiv: 1612.05050 · v1 · pith:JFME4PDFnew · submitted 2016-12-15 · 💻 cs.LG · cs.CV

Towards Score Following in Sheet Music Images

classification 💻 cs.LG cs.CV
keywords musicsheetaudiocorrespondingimageslearnsneuralnotes
0
0 comments X
read the original abstract

This paper addresses the matching of short music audio snippets to the corresponding pixel location in images of sheet music. A system is presented that simultaneously learns to read notes, listens to music and matches the currently played music to its corresponding notes in the sheet. It consists of an end-to-end multi-modal convolutional neural network that takes as input images of sheet music and spectrograms of the respective audio snippets. It learns to predict, for a given unseen audio snippet (covering approximately one bar of music), the corresponding position in the respective score line. Our results suggest that with the use of (deep) neural networks -- which have proven to be powerful image processing models -- working with sheet music becomes feasible and a promising future research direction.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.