Rave: A variational autoencoder for fast and high-quality neural audio synthesis

AntoineCaillonandPhilippeEsling · 2021 · arXiv 2111.05011

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

read on arXiv browse 6 citing papers

citation-role summary

background 1 method 1

citation-polarity summary

background 1 use method 1

representative citing papers

Live Music Diffusion Models: Efficient Fine-Tuning and Post-Training of Interactive Diffusion Music Generators

cs.SD · 2026-05-21 · unverdicted · novelty 7.0

Live Music Diffusion Models adapt bidirectional diffusion for interactive music generation via KV caching and ARC-Forcing, recovering and exceeding discrete autoregressive efficiency while enabling post-training alignment without RL.

Remix the Timbre: Diffusion-Based Style Transfer Across Polyphonic Stems

cs.SD · 2026-05-10 · unverdicted · novelty 7.0

MixtureTT performs direct per-stem timbre transfer on polyphonic mixtures via a shared diffusion transformer, outperforming single-stem baselines on SATB choral data while eliminating cascaded separation errors.

Latent Fourier Transform

cs.SD · 2026-04-20 · unverdicted · novelty 7.0

LatentFT uses latent-space Fourier transforms and frequency masking in diffusion autoencoders to enable timescale-specific manipulation of musical structure in generative models.

Opening the Design Space: Two Years of Performance with Intelligent Musical Instruments

cs.SD · 2026-04-26 · conditional · novelty 6.0

A portable single-board-computer AI music platform and five case studies demonstrate that remapping inputs, interleaving fast and slow controls, small artist datasets, and cheap hardware can open new artist-centered design spaces for intelligent instruments.

Drivetrain simulation using variational autoencoders

cs.LG · 2025-01-29 · unverdicted · novelty 5.0

Variational autoencoders generate jerk signals from torque inputs in electric drivetrains and outperform physics-based baselines without detailed parametrization.

Hu\'i S\`u: Co-constructing a Dual Feedback Apparatus

cs.SD · 2026-04-28 · unverdicted · novelty 3.0

A musical performance co-produces sound through dual feedback loops between a RAVE-based neural audio instrument and a recurrent neural control system, exploring shared agency with human performers.

citing papers explorer

Showing 6 of 6 citing papers.

Live Music Diffusion Models: Efficient Fine-Tuning and Post-Training of Interactive Diffusion Music Generators cs.SD · 2026-05-21 · unverdicted · none · ref 4
Live Music Diffusion Models adapt bidirectional diffusion for interactive music generation via KV caching and ARC-Forcing, recovering and exceeding discrete autoregressive efficiency while enabling post-training alignment without RL.
Remix the Timbre: Diffusion-Based Style Transfer Across Polyphonic Stems cs.SD · 2026-05-10 · unverdicted · none · ref 35
MixtureTT performs direct per-stem timbre transfer on polyphonic mixtures via a shared diffusion transformer, outperforming single-stem baselines on SATB choral data while eliminating cascaded separation errors.
Latent Fourier Transform cs.SD · 2026-04-20 · unverdicted · none · ref 6
LatentFT uses latent-space Fourier transforms and frequency masking in diffusion autoencoders to enable timescale-specific manipulation of musical structure in generative models.
Opening the Design Space: Two Years of Performance with Intelligent Musical Instruments cs.SD · 2026-04-26 · conditional · none · ref 6
A portable single-board-computer AI music platform and five case studies demonstrate that remapping inputs, interleaving fast and slow controls, small artist datasets, and cheap hardware can open new artist-centered design spaces for intelligent instruments.
Drivetrain simulation using variational autoencoders cs.LG · 2025-01-29 · unverdicted · none · ref 7
Variational autoencoders generate jerk signals from torque inputs in electric drivetrains and outperform physics-based baselines without detailed parametrization.
Hu\'i S\`u: Co-constructing a Dual Feedback Apparatus cs.SD · 2026-04-28 · unverdicted · none · ref 2
A musical performance co-produces sound through dual feedback loops between a RAVE-based neural audio instrument and a recurrent neural control system, exploring shared agency with human performers.

Rave: A variational autoencoder for fast and high-quality neural audio synthesis

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer