Activation steering with Gram-Schmidt orthogonalization enables disentangled, deterministic control of pitch and duration attributes in the Multitrack Music Transformer without retraining.
Genre Controlled Music Generation via Activation Steering
1 Pith paper cite this work. Polarity classification is still indexing.
abstract
Computational Music Generation is evolving towards non-conventional styles, demanding methods that enable precise and controllable blending of diverse music elements. In this work, we present a method for fine grained control using inference-time interventions on an autoregressive generative transformer, MusicGen. Through our approach, we achieve genre control by steering the residual stream using weights of a linear probe on it. By framing activation steering as a human-controllable interaction, our work highlights how interpretable model behaviors can empower in co-creative music generation.Audio samples demonstrating our method are available on our demo page.
fields
cs.SD 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Latent Space Disentanglement via Activation Steering for Interpretable Attribute Control in Symbolic Music Generation
Activation steering with Gram-Schmidt orthogonalization enables disentangled, deterministic control of pitch and duration attributes in the Multitrack Music Transformer without retraining.