pith. machine review for the scientific record. sign in

arxiv: 1805.07848 · v2 · submitted 2018-05-21 · 💻 cs.SD · cs.AI· cs.LG· stat.ML

Recognition: unknown

A Universal Music Translation Network

Authors on Pith no claims yet
classification 💻 cs.SD cs.AIcs.LGstat.ML
keywords methodmusicmusicaldatasetdomainsencodereventraining
0
0 comments X
read the original abstract

We present a method for translating music across musical instruments, genres, and styles. This method is based on a multi-domain wavenet autoencoder, with a shared encoder and a disentangled latent space that is trained end-to-end on waveforms. Employing a diverse training dataset and large net capacity, the domain-independent encoder allows us to translate even from musical domains that were not seen during training. The method is unsupervised and does not rely on supervision in the form of matched samples between domains or musical transcriptions. We evaluate our method on NSynth, as well as on a dataset collected from professional musicians, and achieve convincing translations, even when translating from whistling, potentially enabling the creation of instrumental music by untrained humans.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Remix the Timbre: Diffusion-Based Style Transfer Across Polyphonic Stems

    cs.SD 2026-05 unverdicted novelty 7.0

    MixtureTT performs direct per-stem timbre transfer on polyphonic mixtures via a shared diffusion transformer, outperforming single-stem baselines on SATB choral data while eliminating cascaded separation errors.