MakeSinger: A semi-supervised training method for data-efficient singing voice synthesis via classifier-free diffusion guidance.arXiv preprint arXiv:2406.05965, 2024

Semin Kim, Myeonghun Jeong, Hyeonseung Lee, Minchan Kim, Byoung Jin Choi, Nam Soo Kim · 2024 · arXiv 2406.05965

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

UniVoice: A Unified Model for Speech and Singing Voice Generation

cs.SD · 2026-06-04 · unverdicted · novelty 5.0

UniVoice is a conditional flow matching model with a Diffusion Transformer backbone that unifies TTS and SVS via modality-specific encoders and a null melody token for speech, achieving 5.26% speech PER and 16.22% singing PER.

citing papers explorer

Showing 1 of 1 citing paper after filters.

UniVoice: A Unified Model for Speech and Singing Voice Generation cs.SD · 2026-06-04 · unverdicted · none · ref 16
UniVoice is a conditional flow matching model with a Diffusion Transformer backbone that unifies TTS and SVS via modality-specific encoders and a null melody token for speech, achieving 5.26% speech PER and 16.22% singing PER.

MakeSinger: A semi-supervised training method for data-efficient singing voice synthesis via classifier-free diffusion guidance.arXiv preprint arXiv:2406.05965, 2024

fields

years

verdicts

representative citing papers

citing papers explorer