Seq-NMS for Video Object Detection

Honghui Shi; Jianan Li; Mohammad Babaeizadeh; Pooya Khorrami; Prajit Ramachandran; Shuicheng Yan; Thomas S. Huang; Tom Le Paine; Wei Han

arxiv: 1602.08465 · v3 · pith:F74K3WGNnew · submitted 2016-02-26 · 💻 cs.CV

Seq-NMS for Video Object Detection

Wei Han , Pooya Khorrami , Tom Le Paine , Prajit Ramachandran , Mohammad Babaeizadeh , Honghui Shi , Jianan Li , Shuicheng Yan

show 1 more author

Thomas S. Huang

This is my paper

classification 💻 cs.CV

keywords objectdetectionvideoclipdetectionsframeimagemethod

0 comments

read the original abstract

Video object detection is challenging because objects that are easily detected in one frame may be difficult to detect in another frame within the same clip. Recently, there have been major advances for doing object detection in a single image. These methods typically contain three phases: (i) object proposal generation (ii) object classification and (iii) post-processing. We propose a modification of the post-processing phase that uses high-scoring object detections from nearby frames to boost scores of weaker detections within the same clip. We show that our method obtains superior results to state-of-the-art single image object detection techniques. Our method placed 3rd in the video object detection (VID) task of the ImageNet Large Scale Visual Recognition Challenge 2015 (ILSVRC2015).

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

MR2-ByteTrack: CNN and Transformer-based Video Object Detection for AI-augmented Embedded Vision Sensor Nodes
cs.CV 2026-05 conditional novelty 5.0

MR2-ByteTrack maintains high accuracy in video object detection on MCUs by combining multi-resolution processing, ByteTrack for frame linking, and Rescore for confidence aggregation, achieving up to 55% energy savings...