pith. sign in

Controllable music production with diffusion models and guidance gradients

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

fields

cs.SD 2

years

2026 2

verdicts

UNVERDICTED 2

representative citing papers

Latent Fourier Transform

cs.SD · 2026-04-20 · unverdicted · novelty 7.0

LatentFT uses latent-space Fourier transforms and frequency masking in diffusion autoencoders to enable timescale-specific manipulation of musical structure in generative models.

Break-the-Beat! Controllable MIDI-to-Drum Audio Synthesis

cs.SD · 2026-05-14 · unverdicted · novelty 6.0

Break-the-Beat! renders drum MIDI audio that matches the timbre of a reference clip by fine-tuning a text-to-audio model with a content encoder and hybrid conditioning on a new paired dataset.

citing papers explorer

Showing 2 of 2 citing papers.

  • Latent Fourier Transform cs.SD · 2026-04-20 · unverdicted · none · ref 25

    LatentFT uses latent-space Fourier transforms and frequency masking in diffusion autoencoders to enable timescale-specific manipulation of musical structure in generative models.

  • Break-the-Beat! Controllable MIDI-to-Drum Audio Synthesis cs.SD · 2026-05-14 · unverdicted · none · ref 24

    Break-the-Beat! renders drum MIDI audio that matches the timbre of a reference clip by fine-tuning a text-to-audio model with a content encoder and hybrid conditioning on a new paired dataset.