Two-stream convolutional net- works for action recognition in videos,

· 2014

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Transfer Learning from Audio-Visual Grounding to Speech Recognition

cs.CL · 2019-07-09 · unverdicted · novelty 7.0

Features from audio-visual semantic grounding models improve speech recognition when used as input, with earlier layers retaining more phonetic detail and deeper layers showing greater domain invariance.

citing papers explorer

Showing 1 of 1 citing paper.

Transfer Learning from Audio-Visual Grounding to Speech Recognition cs.CL · 2019-07-09 · unverdicted · none · ref 39
Features from audio-visual semantic grounding models improve speech recognition when used as input, with earlier layers retaining more phonetic detail and deeper layers showing greater domain invariance.

Two-stream convolutional net- works for action recognition in videos,

fields

years

verdicts

representative citing papers

citing papers explorer