Direct prompting scales more consistently than CoT prompting for speech-to-text translation as the amount of S2TT data increases.
To do so, we generate pseudo- labeled S2TT data (S2TT pl) and evaluate a series of models trained with varying amounts of it
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2025 1verdicts
CONDITIONAL 1representative citing papers
citing papers explorer
-
Revisiting Direct Speech-to-Text Translation with Speech LLMs: Better Scaling than CoT Prompting?
Direct prompting scales more consistently than CoT prompting for speech-to-text translation as the amount of S2TT data increases.