pith. sign in

arxiv: 1904.04388 · v1 · pith:7Z7YPXVAnew · submitted 2019-04-08 · 💻 cs.CL · cs.AI

Giving Attention to the Unexpected: Using Prosody Innovations in Disfluency Detection

classification 💻 cs.CL cs.AI
keywords cuesacousticdetectiondisfluencyinnovationsintegratingprosodicprosody
0
0 comments X
read the original abstract

Disfluencies in spontaneous speech are known to be associated with prosodic disruptions. However, most algorithms for disfluency detection use only word transcripts. Integrating prosodic cues has proved difficult because of the many sources of variability affecting the acoustic correlates. This paper introduces a new approach to extracting acoustic-prosodic cues using text-based distributional prediction of acoustic cues to derive vector z-score features (innovations). We explore both early and late fusion techniques for integrating text and prosody, showing gains over a high-accuracy text-only model.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.