Towards building text-to-speech systems for the next billion users

· 2023

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

PSP: An Interpretable Per-Dimension Accent Benchmark for Indic Text-to-Speech

cs.SD · 2026-04-28 · unverdicted · novelty 7.0

PSP decomposes TTS accent into retroflex collapse rate, aspiration fidelity, vowel-length fidelity, Tamil-zha fidelity, FAD, and prosodic signature divergence, revealing that commercial systems vary in accent fidelity beyond WER scores.

Praxy Voice: Voice-Prompt Recovery + BUPS for Commercial-Class Indic TTS from a Frozen Non-Indic Base at Zero Commercial-Training-Data Cost

cs.SD · 2026-04-28 · unverdicted · novelty 6.0

A combination of phoneme romanization, targeted LoRA adaptation, and voice-prompt recovery enables commercial-class Indic TTS from a non-Indic base without acoustic retraining or commercial data.

citing papers explorer

Showing 2 of 2 citing papers.

PSP: An Interpretable Per-Dimension Accent Benchmark for Indic Text-to-Speech cs.SD · 2026-04-28 · unverdicted · none · ref 18
PSP decomposes TTS accent into retroflex collapse rate, aspiration fidelity, vowel-length fidelity, Tamil-zha fidelity, FAD, and prosodic signature divergence, revealing that commercial systems vary in accent fidelity beyond WER scores.
Praxy Voice: Voice-Prompt Recovery + BUPS for Commercial-Class Indic TTS from a Frozen Non-Indic Base at Zero Commercial-Training-Data Cost cs.SD · 2026-04-28 · unverdicted · none · ref 15
A combination of phoneme romanization, targeted LoRA adaptation, and voice-prompt recovery enables commercial-class Indic TTS from a non-Indic base without acoustic retraining or commercial data.

Towards building text-to-speech systems for the next billion users

fields

years

verdicts

representative citing papers

citing papers explorer