pith. sign in

arxiv: 1812.10000 · v1 · pith:NNLJQR2Inew · submitted 2018-12-25 · 💻 cs.CV

Similarity R-C3D for Few-shot Temporal Activity Detection

classification 💻 cs.CV
keywords few-shotactivitydetectiontemporalexamplessimilarityactivitiesactivitynet1
0
0 comments X
read the original abstract

Many activities of interest are rare events, with only a few labeled examples available. Therefore models for temporal activity detection which are able to learn from a few examples are desirable. In this paper, we present a conceptually simple and general yet novel framework for few-shot temporal activity detection which detects the start and end time of the few-shot input activities in an untrimmed video. Our model is end-to-end trainable and can benefit from more few-shot examples. At test time, each proposal is assigned the label of the few-shot activity class corresponding to the maximum similarity score. Our Similarity R-C3D method outperforms previous work on three large-scale benchmarks for temporal activity detection (THUMOS14, ActivityNet1.2, and ActivityNet1.3 datasets) in the few-shot setting. Our code will be made available.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.