Disfluency Detection using a Bidirectional LSTM

Vicky Zayats , Mari Ostendorf , Hannaneh Hajishirzi

Authors on Pith no claims yet

classification 💻 cs.CL

keywords detectiondisfluencymodeladditionbidirectionalblstmperformancesequence

read the original abstract

We introduce a new approach for disfluency detection using a Bidirectional Long-Short Term Memory neural network (BLSTM). In addition to the word sequence, the model takes as input pattern match features that were developed to reduce sensitivity to vocabulary size in training, which lead to improved performance over the word sequence alone. The BLSTM takes advantage of explicit repair states in addition to the standard reparandum states. The final output leverages integer linear programming to incorporate constraints of disfluency structure. In experiments on the Switchboard corpus, the model achieves state-of-the-art performance for both the standard disfluency detection task and the correction detection task. Analysis shows that the model has better detection of non-repetition disfluencies, which tend to be much harder to detect.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Mind the Pause: Disfluency-Aware Objective Tuning for Multilingual Speech Correction with LLMs
cs.CL 2026-05 unverdicted novelty 6.0

A sequence-tagger-guided LLM with contrastive objective corrects disfluencies in Hindi, Bengali, and Marathi ASR transcripts, outperforming removal-only baselines.