For MusicTI [6], we trained the style encoder following the au- thor’s protocol; for MusicGen [9], content audio served as the melody guide with text-based style descriptions

EXPERIMENTAL RESULTS We benchmark our model against state-of-the-art baselines by retraining their official codes with default configurations · arXiv 5350.1570

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

read on arXiv browse 1 citing papers

representative citing papers

Repurposing Image Diffusion Models for Training-Free Music Style Transfer on Mel-spectrograms

cs.SD · 2024-11-24 · conditional · novelty 7.0

Stylus achieves training-free music style transfer on Mel-spectrograms by repurposing image diffusion models via style-key injection in self-attention plus phase-preserving reconstruction, outperforming baselines by 34.1% in content preservation and 25.7% in perceptual quality per 2,925 human raters

citing papers explorer

Showing 1 of 1 citing paper.

Repurposing Image Diffusion Models for Training-Free Music Style Transfer on Mel-spectrograms cs.SD · 2024-11-24 · conditional · none · ref 4
Stylus achieves training-free music style transfer on Mel-spectrograms by repurposing image diffusion models via style-key injection in self-attention plus phase-preserving reconstruction, outperforming baselines by 34.1% in content preservation and 25.7% in perceptual quality per 2,925 human raters

For MusicTI [6], we trained the style encoder following the au- thor’s protocol; for MusicGen [9], content audio served as the melody guide with text-based style descriptions

fields

years

verdicts

representative citing papers

citing papers explorer