Libritts-p: A corpus with speaking style and speaker identity prompts for text-to-speech and style captioning,

· 2024 · arXiv 2406.07969

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

FineCombo-TTS: Collaborative and Precise Controllable Speech Synthesis Using Text Descriptions and Reference Speech

cs.SD · 2026-06-17 · unverdicted · novelty 6.0

FineCombo-TTS learns a unified acoustic representation with a CFM-based Speech Variance Predictor for flexible precise TTS control from reference audio and text descriptions, supported by the new FineEdit paired dataset.

citing papers explorer

Showing 1 of 1 citing paper.

FineCombo-TTS: Collaborative and Precise Controllable Speech Synthesis Using Text Descriptions and Reference Speech cs.SD · 2026-06-17 · unverdicted · none · ref 33
FineCombo-TTS learns a unified acoustic representation with a CFM-based Speech Variance Predictor for flexible precise TTS control from reference audio and text descriptions, supported by the new FineEdit paired dataset.

Libritts-p: A corpus with speaking style and speaker identity prompts for text-to-speech and style captioning,

fields

years

verdicts

representative citing papers

citing papers explorer