The paper reports a multi-stage system for activity detection in extended videos that uses spatial object detections, temporal localization, tubelet generation variants, and late fusion of component outputs.
Learning spatiotemporal features with 3d convolutional networks
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CV 1years
2019 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
vireoJD-MM at Activity Detection in Extended Videos
The paper reports a multi-stage system for activity detection in extended videos that uses spatial object detections, temporal localization, tubelet generation variants, and late fusion of component outputs.