Globe: A high-quality english corpus with global accents for zero-shot speaker adaptive text-to- speech,

· 2024 · arXiv 2406.14875

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

RIVET: Robust Idempotent Voice Attribute Editing

cs.SD · 2026-06-17 · unverdicted · novelty 4.0

RIVET enforces an idempotency objective during training of voice attribute editing models to improve robustness to noisy labels, outperforming standard training on controlled noise and the GLOBE dataset.

SPARCLE: SPeaker-aware Aligned Representations via Contrastive Language Embeddings

cs.CL · 2026-05-01 · unverdicted · novelty 4.0

SPARCLE builds speaker-aware grapheme representations by contrastively aligning characters with Wav2Vec2 acoustic embeddings conditioned on speaker identity, replacing G2P for TTS and halving WER in low-resource cases.

citing papers explorer

Showing 1 of 1 citing paper after filters.

SPARCLE: SPeaker-aware Aligned Representations via Contrastive Language Embeddings cs.CL · 2026-05-01 · unverdicted · none · ref 32
SPARCLE builds speaker-aware grapheme representations by contrastively aligning characters with Wav2Vec2 acoustic embeddings conditioned on speaker identity, replacing G2P for TTS and halving WER in low-resource cases.

Globe: A high-quality english corpus with global accents for zero-shot speaker adaptive text-to- speech,

fields

years

verdicts

representative citing papers

citing papers explorer