Muse: Towards reproducible long- form song generation with fine-grained style control,

Changhao Jiang, Jiahao Chen, Zhenghao Xiang, Zhixiong Yang, Hanchen Wang, Jiabao Zhuang, Xinmeng Che, Jiajun Sun, Hui Li, Yifei Cao, et al · 2026 · arXiv 2601.03973

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

representative citing papers

LeVo 2: Stable and Melodious Song Generation via Hierarchical Representation Modeling and Progressive Post-Training

cs.SD · 2026-06-29 · unverdicted · novelty 5.0

LeVo 2 presents a hierarchical LLM-Diffusion model with progressive post-training stages to generate full-length songs that balance semantic planning, track-specific acoustics, and musicality.

SketchSong: Hierarchical Song Generation with Sketch Planning and Fine-Grained Multi-Track Modeling

cs.SD · 2026-06-02 · unverdicted · novelty 5.0

SketchSong uses temporal sketch planning with high-level tokens and explicit modeling of four tracks (vocals, bass, drums, other) to generate more coherent songs than baselines.

citing papers explorer

Showing 2 of 2 citing papers after filters.

LeVo 2: Stable and Melodious Song Generation via Hierarchical Representation Modeling and Progressive Post-Training cs.SD · 2026-06-29 · unverdicted · none · ref 19
LeVo 2 presents a hierarchical LLM-Diffusion model with progressive post-training stages to generate full-length songs that balance semantic planning, track-specific acoustics, and musicality.
SketchSong: Hierarchical Song Generation with Sketch Planning and Fine-Grained Multi-Track Modeling cs.SD · 2026-06-02 · unverdicted · none · ref 9
SketchSong uses temporal sketch planning with high-level tokens and explicit modeling of four tracks (vocals, bass, drums, other) to generate more coherent songs than baselines.

Muse: Towards reproducible long- form song generation with fine-grained style control,

fields

years

verdicts

representative citing papers

citing papers explorer