Evaluation of WhisperIPA and ZIPA reveals persistent performance gaps across languages, accents, gender, ethnicity, and age even after allowing for similar phoneme substitutions.
OWSM - CTC : An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Evaluating Bias in Phoneme-Based Automatic Speech Recognition Systems: An Analysis of IPA Transcription Models
Evaluation of WhisperIPA and ZIPA reveals persistent performance gaps across languages, accents, gender, ethnicity, and age even after allowing for similar phoneme substitutions.