Trimmed Action Recognition, Dense-Captioning Events in Videos, and Spatio-temporal Action Localization with Focus on ActivityNet Challenge 2019
classification
💻 cs.CV
keywords
actionactivitynetchallengedense-captioningeventslocalizationrecognitionspatio-temporal
read the original abstract
This notebook paper presents an overview and comparative analysis of our systems designed for the following three tasks in ActivityNet Challenge 2019: trimmed action recognition, dense-captioning events in videos, and spatio-temporal action localization.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
vireoJD-MM at Activity Detection in Extended Videos
The paper reports a multi-stage system for activity detection in extended videos that uses spatial object detections, temporal localization, tubelet generation variants, and late fusion of component outputs.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.