Trimmed Action Recognition, Dense-Captioning Events in Videos, and Spatio-temporal Action Localization with Focus on ActivityNet Challenge 2019

Dong Li; Qi Cai; Ting Yao; Yehao Li; Yingwei Pan; Zhaofan Qiu

arxiv: 1906.07016 · v1 · pith:53NFL6UXnew · submitted 2019-06-14 · 💻 cs.CV

Trimmed Action Recognition, Dense-Captioning Events in Videos, and Spatio-temporal Action Localization with Focus on ActivityNet Challenge 2019

Zhaofan Qiu , Dong Li , Yehao Li , Qi Cai , Yingwei Pan , Ting Yao This is my paper

classification 💻 cs.CV

keywords actionactivitynetchallengedense-captioningeventslocalizationrecognitionspatio-temporal

0 comments

read the original abstract

This notebook paper presents an overview and comparative analysis of our systems designed for the following three tasks in ActivityNet Challenge 2019: trimmed action recognition, dense-captioning events in videos, and spatio-temporal action localization.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

vireoJD-MM at Activity Detection in Extended Videos
cs.CV 2019-06 unverdicted novelty 2.0

The paper reports a multi-stage system for activity detection in extended videos that uses spatial object detections, temporal localization, tubelet generation variants, and late fusion of component outputs.