A text-to-music model is improved by conditioning on and selecting with a human preference reward, where expert iteration on top outputs contributes the largest measured gains on 100 Song Describer prompts.
TuneJury: An open metric for improving music generation preference alignment,
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.SD 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Improving Text-to-Music Generation with Human Preference Rewards
A text-to-music model is improved by conditioning on and selecting with a human preference reward, where expert iteration on top outputs contributes the largest measured gains on 100 Song Describer prompts.