pith. sign in

arxiv: 1810.00434 · v2 · pith:4SEL5UJPnew · submitted 2018-09-30 · 💻 cs.CV · cs.LG

CaTDet: Cascaded Tracked Detector for Efficient Object Detection from Video

classification 💻 cs.CV cs.LG
keywords catdetvideodetectoradditionalcascadeddatasetdelaydetection
0
0 comments X
read the original abstract

Detecting objects in a video is a compute-intensive task. In this paper we propose CaTDet, a system to speedup object detection by leveraging the temporal correlation in video. CaTDet consists of two DNN models that form a cascaded detector, and an additional tracker to predict regions of interests based on historic detections. We also propose a new metric, mean Delay(mD), which is designed for latency-critical video applications. Experiments on the KITTI dataset show that CaTDet reduces operation count by 5.1-8.7x with the same mean Average Precision(mAP) as the single-model Faster R-CNN detector and incurs additional delay of 0.3 frame. On CityPersons dataset, CaTDet achieves 13.0x reduction in operations with 0.8% mAP loss.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.