A framework using speech-text alignment to generate expressive pseudo-audio prompts for improved text-only domain adaptation in LLM-based ASR.
Multi-speaker sequence-to-sequence speech synthesis for data augmentation in acoustic-to-word speech recognition
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.SD 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Refining Pseudo-Audio Prompts with Speech-Text Alignment for Text-Only Domain Adaptation in LLM-Based ASR
A framework using speech-text alignment to generate expressive pseudo-audio prompts for improved text-only domain adaptation in LLM-based ASR.