SongEval: A benchmark dataset for song aesthetics evaluation

· 2025 · arXiv 2505.10793

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

read on arXiv browse 6 citing papers

citation-role summary

baseline 1

citation-polarity summary

baseline 1

representative citing papers

Polyphonia: Zero-Shot Timbre Transfer in Polyphonic Music with Acoustic-Informed Attention Calibration

cs.SD · 2026-05-11 · unverdicted · novelty 7.0

Polyphonia improves zero-shot stem-specific timbre transfer in polyphonic music by 15.5% target alignment via acoustic-informed attention calibration that uses probabilistic priors to set coarse boundaries.

MIDI-Informed Singing Accompaniment Generation in a Compositional Song Pipeline

cs.SD · 2026-02-24 · unverdicted · novelty 7.0

MIDI-SAG generates consistent long-form singing accompaniments by feeding symbolic MIDI timing, chords, and structure labels into a compositional pipeline built from pre-trained modules.

APEX: Large-scale Multi-task Aesthetic-Informed Popularity Prediction for AI-Generated Music

cs.SD · 2026-05-05 · unverdicted · novelty 6.0

APEX jointly predicts engagement-based popularity and five aesthetic quality dimensions for AI-generated music, improving human preference prediction on out-of-distribution generative systems.

SongBench: A Fine-Grained Multi-Aspect Benchmark for Song Quality Assessment

eess.AS · 2026-04-16 · unverdicted · novelty 6.0

SongBench is a new fine-grained benchmark for song quality assessment with seven dimensions and an expert-annotated dataset of 11,717 samples showing high correlation with professional ratings.

LaDA-Band: Language Diffusion Models for Vocal-to-Accompaniment Generation

cs.SD · 2026-04-13 · unverdicted · novelty 6.0

LaDA-Band applies discrete masked diffusion with dual-track conditioning and progressive training to generate vocal-to-accompaniment tracks that improve acoustic authenticity, global coherence, and dynamic orchestration over prior baselines.

Zero-Effort Image-to-Music Generation: An Interpretable RAG-based VLM Approach

cs.SD · 2025-09-26 · unverdicted · novelty 6.0

A zero-training VLM framework generates music from images via ABC notation, multi-modal RAG, and self-refinement while providing text and visual explanations for the outputs.

citing papers explorer

Showing 6 of 6 citing papers.

Polyphonia: Zero-Shot Timbre Transfer in Polyphonic Music with Acoustic-Informed Attention Calibration cs.SD · 2026-05-11 · unverdicted · none · ref 77
Polyphonia improves zero-shot stem-specific timbre transfer in polyphonic music by 15.5% target alignment via acoustic-informed attention calibration that uses probabilistic priors to set coarse boundaries.
MIDI-Informed Singing Accompaniment Generation in a Compositional Song Pipeline cs.SD · 2026-02-24 · unverdicted · none · ref 56
MIDI-SAG generates consistent long-form singing accompaniments by feeding symbolic MIDI timing, chords, and structure labels into a compositional pipeline built from pre-trained modules.
APEX: Large-scale Multi-task Aesthetic-Informed Popularity Prediction for AI-Generated Music cs.SD · 2026-05-05 · unverdicted · none · ref 10
APEX jointly predicts engagement-based popularity and five aesthetic quality dimensions for AI-generated music, improving human preference prediction on out-of-distribution generative systems.
SongBench: A Fine-Grained Multi-Aspect Benchmark for Song Quality Assessment eess.AS · 2026-04-16 · unverdicted · none · ref 20
SongBench is a new fine-grained benchmark for song quality assessment with seven dimensions and an expert-annotated dataset of 11,717 samples showing high correlation with professional ratings.
LaDA-Band: Language Diffusion Models for Vocal-to-Accompaniment Generation cs.SD · 2026-04-13 · unverdicted · none · ref 55
LaDA-Band applies discrete masked diffusion with dual-track conditioning and progressive training to generate vocal-to-accompaniment tracks that improve acoustic authenticity, global coherence, and dynamic orchestration over prior baselines.
Zero-Effort Image-to-Music Generation: An Interpretable RAG-based VLM Approach cs.SD · 2025-09-26 · unverdicted · none · ref 25
A zero-training VLM framework generates music from images via ABC notation, multi-modal RAG, and self-refinement while providing text and visual explanations for the outputs.

SongEval: A benchmark dataset for song aesthetics evaluation

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer