DSSD : Deconvolutional Single Shot Detector

· 2017 · cs.CV · arXiv 1701.06659

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

open full Pith review browse 7 citing papers arXiv PDF

abstract

The main contribution of this paper is an approach for introducing additional context into state-of-the-art general object detection. To achieve this we first combine a state-of-the-art classifier (Residual-101[14]) with a fast detection framework (SSD[18]). We then augment SSD+Residual-101 with deconvolution layers to introduce additional large-scale context in object detection and improve accuracy, especially for small objects, calling our resulting system DSSD for deconvolutional single shot detector. While these two contributions are easily described at a high-level, a naive implementation does not succeed. Instead we show that carefully adding additional stages of learned transformations, specifically a module for feed-forward connections in deconvolution and a new output module, enables this new approach and forms a potential way forward for further detection research. Results are shown on both PASCAL VOC and COCO detection. Our DSSD with $513 \times 513$ input achieves 81.5% mAP on VOC2007 test, 80.0% mAP on VOC2012 test, and 33.2% mAP on COCO, outperforming a state-of-the-art method R-FCN[3] on each dataset.

representative citing papers

ALFA: Agglomerative Late Fusion Algorithm for Object Detection

cs.CV · 2019-07-13 · unverdicted · novelty 7.0

ALFA is a late-fusion algorithm that clusters predictions from detectors like SSD and Faster R-CNN using location and score information, yielding lower error than individual detectors or prior fusion methods on PASCAL VOC 2007/2012.

SAM-Sode: Towards Faithful Explanations for Tiny Bacteria Detection

cs.CV · 2026-05-20 · unverdicted · novelty 6.0

SAM-Sode refines explanation maps for tiny bacteria detection by converting them into prompts for the SAM3 model and applying physical and geometric dual constraints to suppress background noise.

Reprojection R-CNN: A Fast and Accurate Object Detector for 360{\deg} Images

cs.CV · 2019-07-27 · unverdicted · novelty 6.0

Reprojection R-CNN is a two-stage detector for 360° images combining a distortion-aware spherical RPN on ERP with a reprojection network on perspective projections, reporting higher mAP than prior methods on two new synthetic datasets at 178 ms per image.

Towards Adversarially Robust Object Detection

cs.CV · 2019-07-24 · unverdicted · novelty 5.0

Develops a multi-task learning based adversarial training approach to improve robustness of object detectors to adversarial attacks, with experiments on PASCAL-VOC and MS-COCO.

Cascade RetinaNet: Maintaining Consistency for Single-Stage Object Detection

cs.CV · 2019-07-16 · unverdicted · novelty 5.0

Cas-RetinaNet improves RetinaNet by 2 AP on MS COCO by training cascade stages on rising IoU thresholds and adding a Feature Consistency Module to align classification confidence with localization accuracy.

YOLOv3: An Incremental Improvement

cs.CV · 2018-04-08 · accept · novelty 4.0

YOLOv3 achieves accuracy comparable to SSD and RetinaNet but runs substantially faster, with 28.2 mAP at 320x320 in 22 ms and 57.9 mAP@50 in 51 ms on Titan X.

A unified neural network for object detection, multiple object tracking and vehicle re-identification

cs.CV · 2019-07-08 · unverdicted · novelty 3.0

Faster RCNN is extended with a track branch and trained end-to-end on concatenated video frames to unify detection and re-identification, reaching 57.79% mAP on the AIC19 vehicle dataset.

citing papers explorer

Showing 7 of 7 citing papers.

ALFA: Agglomerative Late Fusion Algorithm for Object Detection cs.CV · 2019-07-13 · unverdicted · none · ref 4 · internal anchor
ALFA is a late-fusion algorithm that clusters predictions from detectors like SSD and Faster R-CNN using location and score information, yielding lower error than individual detectors or prior fusion methods on PASCAL VOC 2007/2012.
SAM-Sode: Towards Faithful Explanations for Tiny Bacteria Detection cs.CV · 2026-05-20 · unverdicted · none · ref 12 · internal anchor
SAM-Sode refines explanation maps for tiny bacteria detection by converting them into prompts for the SAM3 model and applying physical and geometric dual constraints to suppress background noise.
Reprojection R-CNN: A Fast and Accurate Object Detector for 360{\deg} Images cs.CV · 2019-07-27 · unverdicted · none · ref 9 · internal anchor
Reprojection R-CNN is a two-stage detector for 360° images combining a distortion-aware spherical RPN on ERP with a reprojection network on perspective projections, reporting higher mAP than prior methods on two new synthetic datasets at 178 ms per image.
Towards Adversarially Robust Object Detection cs.CV · 2019-07-24 · unverdicted · none · ref 13 · internal anchor
Develops a multi-task learning based adversarial training approach to improve robustness of object detectors to adversarial attacks, with experiments on PASCAL-VOC and MS-COCO.
Cascade RetinaNet: Maintaining Consistency for Single-Stage Object Detection cs.CV · 2019-07-16 · unverdicted · none · ref 8 · internal anchor
Cas-RetinaNet improves RetinaNet by 2 AP on MS COCO by training cascade stages on rising IoU thresholds and adding a Feature Consistency Module to align classification confidence with localization accuracy.
YOLOv3: An Incremental Improvement cs.CV · 2018-04-08 · accept · none · ref 3
YOLOv3 achieves accuracy comparable to SSD and RetinaNet but runs substantially faster, with 28.2 mAP at 320x320 in 22 ms and 57.9 mAP@50 in 51 ms on Titan X.
A unified neural network for object detection, multiple object tracking and vehicle re-identification cs.CV · 2019-07-08 · unverdicted · none · ref 16 · internal anchor
Faster RCNN is extended with a track branch and trained end-to-end on concatenated video frames to unify detection and re-identification, reaching 57.79% mAP on the AIC19 vehicle dataset.

DSSD : Deconvolutional Single Shot Detector

fields

years

verdicts

representative citing papers

citing papers explorer