MSR-Codec: A low-bitrate multi-stream residual codec for high-fidelity speech generation with information disentanglement,

· 2025 · arXiv 2509.13068

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

One-Step Token-to-Waveform Generation with MeanFlow in Latent Space

eess.AS · 2026-06-16 · unverdicted · novelty 7.0

MeanFlow applied in latent space enables true one-step Token2Wav generation with up to 17x RTF improvement and negligible quality loss versus multi-step baselines.

SDP-Codec: A Speaker-Decoupled Speech Codec with Pitch Injection for Low-Bitrate Coding and Zero-Shot Voice Conversion

cs.SD · 2026-06-19 · unverdicted · novelty 5.0

SDP-Codec decouples speaker attributes from content and prosody via pitch injection in a single-stage pipeline, delivering competitive reconstruction, strong zero-shot voice conversion, and the lowest speaker-probing accuracy at comparable bitrates.

citing papers explorer

Showing 1 of 1 citing paper after filters.

SDP-Codec: A Speaker-Decoupled Speech Codec with Pitch Injection for Low-Bitrate Coding and Zero-Shot Voice Conversion cs.SD · 2026-06-19 · unverdicted · none · ref 37
SDP-Codec decouples speaker attributes from content and prosody via pitch injection in a single-stage pipeline, delivering competitive reconstruction, strong zero-shot voice conversion, and the lowest speaker-probing accuracy at comparable bitrates.

MSR-Codec: A low-bitrate multi-stream residual codec for high-fidelity speech generation with information disentanglement,

fields

years

verdicts

representative citing papers

citing papers explorer