arXiv:2203.11389 (2022)

Huang, et al · 2022 · arXiv 2203.11389

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

PS-TTS: Phonetic Synchronization in Text-to-Speech for Achieving Natural Automated Dubbing

eess.AS · 2026-04-10 · unverdicted · novelty 6.0

PS-TTS and PS-Comet TTS use isochrony via language model paraphrasing plus phonetic synchronization with DTW on vowel distances to achieve better lip-sync and semantic preservation in automated dubbing than standard TTS or voice actors on tested language pairs.

Voice Mapping of Text-to-Speech Systems: A Metric-Based Approach for Voice Quality Assessment

eess.AS · 2026-04-21 · unverdicted · novelty 3.0

Voice range indicates TTS model capability with VITS highest, Glow-TTS best at soft phonation, and CPPs of 7-8 dB marking natural quality while values over 10 dB sound robotic.

citing papers explorer

Showing 2 of 2 citing papers.

PS-TTS: Phonetic Synchronization in Text-to-Speech for Achieving Natural Automated Dubbing eess.AS · 2026-04-10 · unverdicted · none · ref 53
PS-TTS and PS-Comet TTS use isochrony via language model paraphrasing plus phonetic synchronization with DTW on vowel distances to achieve better lip-sync and semantic preservation in automated dubbing than standard TTS or voice actors on tested language pairs.
Voice Mapping of Text-to-Speech Systems: A Metric-Based Approach for Voice Quality Assessment eess.AS · 2026-04-21 · unverdicted · none · ref 15
Voice range indicates TTS model capability with VITS highest, Glow-TTS best at soft phonation, and CPPs of 7-8 dB marking natural quality while values over 10 dB sound robotic.

arXiv:2203.11389 (2022)

fields

years

verdicts

representative citing papers

citing papers explorer