Proposal-level spatio-temporal context aggregation for video object detection achieves 80.3% mAP on ImageNet VID, improving Faster R-CNN baseline by 5.8%.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CV 1years
2019 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Object Detection in Video with Spatial-temporal Context Aggregation
Proposal-level spatio-temporal context aggregation for video object detection achieves 80.3% mAP on ImageNet VID, improving Faster R-CNN baseline by 5.8%.