pith. sign in

arxiv: 1904.08842 · v1 · pith:ANWGNG3Jnew · submitted 2019-04-18 · 💻 cs.SD · cs.HC· cs.IR· cs.LG· eess.AS

Inspecting and Interacting with Meaningful Music Representations using VAE

classification 💻 cs.SD cs.HCcs.IRcs.LGeess.AS
keywords musicrepresentationsgenerationlatentpitchmeaningfulprocessrhythm
0
0 comments X
read the original abstract

Variational Autoencoders(VAEs) have already achieved great results on image generation and recently made promising progress on music generation. However, the generation process is still quite difficult to control in the sense that the learned latent representations lack meaningful music semantics. It would be much more useful if people can modify certain music features, such as rhythm and pitch contour, via latent representations to test different composition ideas. In this paper, we propose a new method to inspect the pitch and rhythm interpretations of the latent representations and we name it disentanglement by augmentation. Based on the interpretable representations, an intuitive graphical user interface is designed for users to better direct the music creation process by manipulating the pitch contours and rhythmic complexity.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. MIDI-Sandwich: Multi-model Multi-task Hierarchical Conditional VAE-GAN networks for Symbolic Single-track Music Generation

    eess.AS 2019-07 unverdicted novelty 4.0

    MIDI-Sandwich is a hierarchical VAE-GAN architecture that generates structured 136-beat melodies by modeling local bars and global relationships on the Nottingham dataset.