Prosodic features control by sym- bols as input of sequence-to-sequence acoustic modeling for neural tts

Kiyoshi Kurihara, Nobumasa Seiyama, Tadashi Kumano · 2021 · DOI 10.1587/transinf.2020edp7104

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open at publisher browse 1 citing papers

representative citing papers

Sarashina2.2-TTS: Tackling Kanji Polyphony in Japanese Speech Generation via Data Scaling and Targeted Data Synthesis

cs.SD · 2026-06-24 · unverdicted · novelty 7.0

Sarashina2.2-TTS achieves SOTA kanji reading accuracy via data scaling and Joyo-kanji-targeted synthesis, introduces the Joyo Kanji Yomi Benchmark and Kana-CER metric, and shows stable cross-lingual performance.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Sarashina2.2-TTS: Tackling Kanji Polyphony in Japanese Speech Generation via Data Scaling and Targeted Data Synthesis cs.SD · 2026-06-24 · unverdicted · none · ref 17
Sarashina2.2-TTS achieves SOTA kanji reading accuracy via data scaling and Joyo-kanji-targeted synthesis, introduces the Joyo Kanji Yomi Benchmark and Kana-CER metric, and shows stable cross-lingual performance.

Prosodic features control by sym- bols as input of sequence-to-sequence acoustic modeling for neural tts

fields

years

verdicts

representative citing papers

citing papers explorer