pith. sign in

arxiv: 1904.12993 · v2 · pith:GFQVEOH5new · submitted 2019-04-29 · 💻 cs.CV

A Study on Action Detection in the Wild

classification 💻 cs.CV
keywords actiondatasetcategoriesdetectionproblemexamplesproposerare
0
0 comments X
read the original abstract

The recent introduction of the AVA dataset for action detection has caused a renewed interest to this problem. Several approaches have been recently proposed that improved the performance. However, all of them have ignored the main difficulty of the AVA dataset - its realistic distribution of training and test examples. This dataset was collected by exhaustive annotation of human action in uncurated videos. As a result, the most common categories, such as `stand' or `sit', contain tens of thousands of examples, whereas rare ones have only dozens. In this work we study the problem of action detection in a highly-imbalanced dataset. Differently from previous work on handling long-tail category distributions, we begin by analyzing the imbalance in the test set. We demonstrate that the standard AP metric is not informative for the categories in the tail, and propose an alternative one - Sampled AP. Armed with this new measure, we study the problem of transferring representations from the data-rich head to the rare tail categories and propose a simple but effective approach.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Submission to ActivityNet Challenge 2019: Task B Spatio-temporal Action Localization

    cs.CV 2019-07 unverdicted novelty 3.0

    Technical report describing use of SlowFast Networks with correlation-preserving augmentation and random label subsampling for ActivityNet 2019 spatio-temporal action localization.