pith. sign in

arxiv: 2302.14062 · v2 · pith:6G6MV3CSnew · submitted 2023-02-27 · 💻 cs.SD · cs.AI· cs.CL· cs.LG· eess.AS

Explanations for Automatic Speech Recognition

classification 💻 cs.SD cs.AIcs.CLcs.LGeess.AS
keywords explanationsaudiobaselineexistinghelpinterpretablequalitysystem
0
0 comments X
read the original abstract

We address quality assessment for neural network based ASR by providing explanations that help increase our understanding of the system and ultimately help build trust in the system. Compared to simple classification labels, explaining transcriptions is more challenging as judging their correctness is not straightforward and transcriptions as a variable-length sequence is not handled by existing interpretable machine learning models. We provide an explanation for an ASR transcription as a subset of audio frames that is both a minimal and sufficient cause of the transcription. To do this, we adapt existing explainable AI (XAI) techniques from image classification-Statistical Fault Localisation(SFL) and Causal. Additionally, we use an adapted version of Local Interpretable Model-Agnostic Explanations (LIME) for ASR as a baseline in our experiments. We evaluate the quality of the explanations generated by the proposed techniques over three different ASR ,Google API, the baseline model of Sphinx, Deepspeech and 100 audio samples from the Commonvoice dataset.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Towards Dys-XAI: Influence-Based Explanations for Dysarthria Severity Assessment

    cs.AI 2026-06 unverdicted novelty 6.0

    Introduces an instance-level influence-based XAI method for dysarthria severity assessment that explains predictions by computing per-utterance influence scores from training samples and validates them via controlled ...