Ultravoice: Scaling fine- grained style-controlled speech conversations for spoken dialogue models.arXiv preprint arXiv:2510.22588

Tu, W · arXiv 2510.22588

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Bridging What the Model Thinks and How It Speaks: Self-Aware Speech Language Models for Expressive Speech Generation

cs.CL · 2026-04-13 · unverdicted · novelty 6.0

SA-SLM uses variational information bottleneck for intent-aware bridging and self-criticism for realization-aware alignment to close the semantic-acoustic gap, outperforming open-source models and nearing GPT-4o-Audio expressiveness on EchoMind after training on 800 hours of data.

citing papers explorer

Showing 1 of 1 citing paper.

Bridging What the Model Thinks and How It Speaks: Self-Aware Speech Language Models for Expressive Speech Generation cs.CL · 2026-04-13 · unverdicted · none · ref 23
SA-SLM uses variational information bottleneck for intent-aware bridging and self-criticism for realization-aware alignment to close the semantic-acoustic gap, outperforming open-source models and nearing GPT-4o-Audio expressiveness on EchoMind after training on 800 hours of data.

Ultravoice: Scaling fine- grained style-controlled speech conversations for spoken dialogue models.arXiv preprint arXiv:2510.22588

fields

years

verdicts

representative citing papers

citing papers explorer