A Text-To-Text Alignment Algorithm for Better Evaluation of Modern Speech Recognition Systems

· 2025 · cs.CL · arXiv 2509.24478

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Modern neural networks have greatly improved performance across speech recognition benchmarks. However, gains are often driven by frequent words with limited semantic weight, which can obscure meaningful differences in word error rate, the primary evaluation metric. Errors in rare terms, named entities, and domain-specific vocabulary are more consequential, but remain hidden by aggregate metrics. This highlights the need for finer-grained error analysis, which depends on accurate alignment between reference and model transcripts. However, conventional alignment methods are not designed for such precision. We propose a novel alignment algorithm that couples dynamic programming with beam search scoring. Compared to traditional text alignment methods, our approach provides more accurate alignment of individual errors, enabling reliable error analysis. The algorithm is made available via PyPI.

representative citing papers

A Text-To-Text Alignment Algorithm for Better Evaluation of Modern Speech Recognition Systems

cs.CL · 2025-09-29 · unverdicted · novelty 5.0

A novel alignment algorithm using dynamic programming and beam search provides more accurate matching of individual errors between reference and model transcripts for improved speech recognition evaluation.

citing papers explorer

Showing 1 of 1 citing paper.

A Text-To-Text Alignment Algorithm for Better Evaluation of Modern Speech Recognition Systems cs.CL · 2025-09-29 · unverdicted · none · ref 2 · internal anchor
A novel alignment algorithm using dynamic programming and beam search provides more accurate matching of individual errors between reference and model transcripts for improved speech recognition evaluation.

A Text-To-Text Alignment Algorithm for Better Evaluation of Modern Speech Recognition Systems

fields

years

verdicts

representative citing papers

citing papers explorer