DisCo-Speech: Control- lable Zero-Shot Speech Generation with A Disentangled Speech Codec,

· 2025 · arXiv 2512.13251

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

AugCodec: A Low-Bitrate Disentangled Neural Speech Codec via Data Augmentation

cs.SD · 2026-06-20 · unverdicted · novelty 5.0

AugCodec disentangles speech into semantic, speaker, and prosody tokens via tailored data augmentations, achieving 12.5 Hz operation with three streams and outperforming prior codecs on LibriSpeech reconstruction and disentanglement metrics.

citing papers explorer

Showing 1 of 1 citing paper after filters.

AugCodec: A Low-Bitrate Disentangled Neural Speech Codec via Data Augmentation cs.SD · 2026-06-20 · unverdicted · none · ref 17
AugCodec disentangles speech into semantic, speaker, and prosody tokens via tailored data augmentations, achieving 12.5 Hz operation with three streams and outperforming prior codecs on LibriSpeech reconstruction and disentanglement metrics.

DisCo-Speech: Control- lable Zero-Shot Speech Generation with A Disentangled Speech Codec,

fields

years

verdicts

representative citing papers

citing papers explorer