pith. sign in

arxiv: 1611.02447 · v2 · pith:3UDUUU2Ynew · submitted 2016-11-08 · 💻 cs.CV

Action Recognition Based on Joint Trajectory Maps Using Convolutional Neural Networks

classification 💻 cs.CV
keywords recognitionactionconvnetsdatasetconvolutionalhumanjointmaps
0
0 comments X
read the original abstract

Recently, Convolutional Neural Networks (ConvNets) have shown promising performances in many computer vision tasks, especially image-based recognition. How to effectively use ConvNets for video-based recognition is still an open problem. In this paper, we propose a compact, effective yet simple method to encode spatio-temporal information carried in $3D$ skeleton sequences into multiple $2D$ images, referred to as Joint Trajectory Maps (JTM), and ConvNets are adopted to exploit the discriminative features for real-time human action recognition. The proposed method has been evaluated on three public benchmarks, i.e., MSRC-12 Kinect gesture dataset (MSRC-12), G3D dataset and UTD multimodal human action dataset (UTD-MHAD) and achieved the state-of-the-art results.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.