Songtrans: An unified song transcription and alignment method for lyrics and notes

· 2024 · arXiv 2409.14619

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

VocalParse: Towards Unified and Scalable Singing Voice Transcription with Large Audio Language Models

cs.SD · 2026-05-06 · unverdicted · novelty 6.0

VocalParse applies interleaved and Chain-of-Thought prompting to a Large Audio Language Model to jointly transcribe lyrics, melody and word-note alignments, achieving state-of-the-art results on multiple singing datasets.

Listening Like a Judge: A Music-Aware Framework for Automatic Singing Performance Evaluation

cs.SD · 2026-06-24 · unverdicted · novelty 5.0

MusicJudge is a modality-guided framework that performs block-aligned multimodal analysis for singing quality assessment by coupling lyrics with pitch-rhythm fidelity via multi-signal matching and Modality-Guided LoRA fine-tuning.

citing papers explorer

Showing 2 of 2 citing papers.

VocalParse: Towards Unified and Scalable Singing Voice Transcription with Large Audio Language Models cs.SD · 2026-05-06 · unverdicted · none · ref 38
VocalParse applies interleaved and Chain-of-Thought prompting to a Large Audio Language Model to jointly transcribe lyrics, melody and word-note alignments, achieving state-of-the-art results on multiple singing datasets.
Listening Like a Judge: A Music-Aware Framework for Automatic Singing Performance Evaluation cs.SD · 2026-06-24 · unverdicted · none · ref 13
MusicJudge is a modality-guided framework that performs block-aligned multimodal analysis for singing quality assessment by coupling lyrics with pitch-rhythm fidelity via multi-signal matching and Modality-Guided LoRA fine-tuning.