pith. sign in

MakeSinger: A semi-supervised training method for data-efficient singing voice synthesis via classifier-free diffusion guidance.arXiv preprint arXiv:2406.05965, 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.SD 1

years

2026 1

verdicts

UNVERDICTED 1

clear filters

representative citing papers

UniVoice: A Unified Model for Speech and Singing Voice Generation

cs.SD · 2026-06-04 · unverdicted · novelty 5.0

UniVoice is a conditional flow matching model with a Diffusion Transformer backbone that unifies TTS and SVS via modality-specific encoders and a null melody token for speech, achieving 5.26% speech PER and 16.22% singing PER.

citing papers explorer

Showing 1 of 1 citing paper after filters.

  • UniVoice: A Unified Model for Speech and Singing Voice Generation cs.SD · 2026-06-04 · unverdicted · none · ref 16

    UniVoice is a conditional flow matching model with a Diffusion Transformer backbone that unifies TTS and SVS via modality-specific encoders and a null melody token for speech, achieving 5.26% speech PER and 16.22% singing PER.