Musicflow: Cascaded flow matching for text guided music generation.arXiv preprint arXiv:2410.20478,

KR Prajwal, Bowen Shi, Matthew Lee, Apoorv Vyas, Andros Tjandra, Mahi Luthra, Baishan Guo, Huiyu Wang, Triantafyllos Afouras, David Kant, et al · 2024 · arXiv 2410.20478

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

representative citing papers

JenBridge: Adaptive Long-Form Video Soundtracking across Scene Transitions

cs.SD · 2026-06-01 · unverdicted · novelty 6.0

JenBridge pretrains a flow-matching Transformer on text-audio data then adapts it with video conditioning and an LLM director to select transitions, claiming better coherence than prior methods on a new LVS benchmark.

SketchSong: Hierarchical Song Generation with Sketch Planning and Fine-Grained Multi-Track Modeling

cs.SD · 2026-06-02 · unverdicted · novelty 5.0

SketchSong uses temporal sketch planning with high-level tokens and explicit modeling of four tracks (vocals, bass, drums, other) to generate more coherent songs than baselines.

citing papers explorer

Showing 2 of 2 citing papers.

JenBridge: Adaptive Long-Form Video Soundtracking across Scene Transitions cs.SD · 2026-06-01 · unverdicted · none · ref 18
JenBridge pretrains a flow-matching Transformer on text-audio data then adapts it with video conditioning and an LLM director to select transitions, claiming better coherence than prior methods on a new LVS benchmark.
SketchSong: Hierarchical Song Generation with Sketch Planning and Fine-Grained Multi-Track Modeling cs.SD · 2026-06-02 · unverdicted · none · ref 20
SketchSong uses temporal sketch planning with high-level tokens and explicit modeling of four tracks (vocals, bass, drums, other) to generate more coherent songs than baselines.

Musicflow: Cascaded flow matching for text guided music generation.arXiv preprint arXiv:2410.20478,

fields

years

verdicts

representative citing papers

citing papers explorer