ASR Setup and Baselines We consider TIMIT [44] and Aurora-4 [45] for training ASR systems to study robustness of the proposed method to speaker, channel, and noise

Experiments 5

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Transfer Learning from Audio-Visual Grounding to Speech Recognition

cs.CL · 2019-07-09 · unverdicted · novelty 7.0

Features from audio-visual semantic grounding models improve speech recognition when used as input, with earlier layers retaining more phonetic detail and deeper layers showing greater domain invariance.

citing papers explorer

Showing 1 of 1 citing paper.

Transfer Learning from Audio-Visual Grounding to Speech Recognition cs.CL · 2019-07-09 · unverdicted · none · ref 5
Features from audio-visual semantic grounding models improve speech recognition when used as input, with earlier layers retaining more phonetic detail and deeper layers showing greater domain invariance.

ASR Setup and Baselines We consider TIMIT [44] and Aurora-4 [45] for training ASR systems to study robustness of the proposed method to speaker, channel, and noise

fields

years

verdicts

representative citing papers

citing papers explorer