Deep Feature Flow for Video Recognition

Jifeng Dai; Lu Yuan; Xizhou Zhu; Yichen Wei; Yuwen Xiong

arxiv: 1611.07715 · v2 · pith:T2TC72EUnew · submitted 2016-11-23 · 💻 cs.CV

Deep Feature Flow for Video Recognition

Xizhou Zhu , Yuwen Xiong , Jifeng Dai , Lu Yuan , Yichen Wei This is my paper

classification 💻 cs.CV

keywords recognitiondeepflowfeaturevideoconvolutionalfastframes

0 comments

read the original abstract

Deep convolutional neutral networks have achieved great success on image recognition tasks. Yet, it is non-trivial to transfer the state-of-the-art image recognition networks to videos as per-frame evaluation is too slow and unaffordable. We present deep feature flow, a fast and accurate framework for video recognition. It runs the expensive convolutional sub-network only on sparse key frames and propagates their deep feature maps to other frames via a flow field. It achieves significant speedup as flow computation is relatively fast. The end-to-end training of the whole architecture significantly boosts the recognition accuracy. Deep feature flow is flexible and general. It is validated on two recent large scale video datasets. It makes a large step towards practical video recognition.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

FAST-ME: Foundation-aware Adaptive Stopping for Motion Estimation for Efficient IoT Video Analysis
cs.CV 2026-05 unverdicted novelty 5.0

A hybrid motion estimation framework combines optimal stopping theory with foundation model semantic scores to reduce computation while maintaining accuracy and semantic coverage in video analysis.