The authors propose and test a data augmentation framework based on deepfake audio to improve training of speech-to-text transcription models.
Text-to-speech data augmentation for low resource speech recognition,
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.SD 1years
2023 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Deepfake audio as a data augmentation technique for training automatic speech to text transcription models
The authors propose and test a data augmentation framework based on deepfake audio to improve training of speech-to-text transcription models.