End-to-end people detection in crowded scenes

Mykhaylo Andriluka; Russell Stewart

arxiv: 1506.04878 · v3 · pith:JCFRE3RQnew · submitted 2015-06-16 · 💻 cs.CV

End-to-end people detection in crowded scenes

Russell Stewart , Mykhaylo Andriluka This is my paper

classification 💻 cs.CV

keywords peopleimagecrowdeddetectiondetectionsend-to-endmodelscenes

0 comments

read the original abstract

Current people detectors operate either by scanning an image in a sliding window fashion or by classifying a discrete set of proposals. We propose a model that is based on decoding an image into a set of people detections. Our system takes an image as input and directly outputs a set of distinct detection hypotheses. Because we generate predictions jointly, common post-processing steps such as non-maximum suppression are unnecessary. We use a recurrent LSTM layer for sequence generation and train our model end-to-end with a new loss function that operates on sets of detections. We demonstrate the effectiveness of our approach on the challenging task of detecting people in crowded scenes.

This paper has not been read by Pith yet.

End-to-end people detection in crowded scenes

discussion (0)