Text-to-speech data augmentation for low resource speech recognition

· 2022 · arXiv 2204.00291

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Improving Code-Switching ASR with Code-Mixing Guided Synthetic Speech

cs.SD · 2026-06-14 · unverdicted · novelty 6.0

A code-mixing guided preference-learning method for TTS produces synthetic data that lowers mixed error rate when fine-tuning Whisper on the SEAME Mandarin-English corpus.

Deepfake audio as a data augmentation technique for training automatic speech to text transcription models

cs.SD · 2023-09-22 · unverdicted · novelty 3.0

The authors propose and test a data augmentation framework based on deepfake audio to improve training of speech-to-text transcription models.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Improving Code-Switching ASR with Code-Mixing Guided Synthetic Speech cs.SD · 2026-06-14 · unverdicted · none · ref 18
A code-mixing guided preference-learning method for TTS produces synthetic data that lowers mixed error rate when fine-tuning Whisper on the SEAME Mandarin-English corpus.
Deepfake audio as a data augmentation technique for training automatic speech to text transcription models cs.SD · 2023-09-22 · unverdicted · none · ref 5
The authors propose and test a data augmentation framework based on deepfake audio to improve training of speech-to-text transcription models.

Text-to-speech data augmentation for low resource speech recognition

fields

years

verdicts

representative citing papers

citing papers explorer