Continuous speech synthesis using per-token latent diffusion.arXiv preprint arXiv:2410.16048,

Arnon Turetzky, Nimrod Shabtay, Slava Shechtman, Hagai Aronowitz, David Haws, Ron Hoory, Avihu Dekel · arXiv 2410.16048

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

MELD: Mel-Spectrogram-Based Speech Language Modeling with Discrete Latent Variables

eess.AS · 2026-05-28 · unverdicted · novelty 4.0

MELD jointly optimizes a discrete latent variable encoder on mel-spectrograms with an autoregressive speech LM, claiming gains over codec and mel baselines on zero-shot TTS/STT plus fewer autoregressive artifacts.

citing papers explorer

Showing 1 of 1 citing paper after filters.

MELD: Mel-Spectrogram-Based Speech Language Modeling with Discrete Latent Variables eess.AS · 2026-05-28 · unverdicted · none · ref 6
MELD jointly optimizes a discrete latent variable encoder on mel-spectrograms with an autoregressive speech LM, claiming gains over codec and mel baselines on zero-shot TTS/STT plus fewer autoregressive artifacts.

Continuous speech synthesis using per-token latent diffusion.arXiv preprint arXiv:2410.16048,

fields

years

verdicts

representative citing papers

citing papers explorer