StyleTTS-ZS: Effi- cient high-quality zero-shot text-to-speech synthesis with distilled time- varying style diffusion

· 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

CoMelSinger: Discrete Token-Based Zero-Shot Singing Synthesis With Structured Melody Control and Guidance

cs.SD · 2025-09-24 · unverdicted · novelty 6.0

CoMelSinger introduces a discrete token-based zero-shot SVS framework on MaskGCT with coarse-to-fine contrastive learning and an SVT module to improve melody control and reduce prosody leakage.

citing papers explorer

Showing 1 of 1 citing paper.

CoMelSinger: Discrete Token-Based Zero-Shot Singing Synthesis With Structured Melody Control and Guidance cs.SD · 2025-09-24 · unverdicted · none · ref 69
CoMelSinger introduces a discrete token-based zero-shot SVS framework on MaskGCT with coarse-to-fine contrastive learning and an SVT module to improve melody control and reduce prosody leakage.

StyleTTS-ZS: Effi- cient high-quality zero-shot text-to-speech synthesis with distilled time- varying style diffusion

fields

years

verdicts

representative citing papers

citing papers explorer