NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis

arxiv: 1604.02808 · v1 · pith:MJ4VTAR5new · submitted 2016-04-11 · 💻 cs.CV

NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis

Amir Shahroudy , Jun Liu , Tian-Tsong Ng , Gang Wang This is my paper

classification 💻 cs.CV

keywords actiondatasethumanactivityanalysisdepth-basedclassesclassification

0 comments p. Extension

pith:MJ4VTAR5 Add to your LaTeX paper

What is a Pith Number?

\usepackage{pith}
\pithnumber{MJ4VTAR5}

Prints a linked pith:MJ4VTAR5 badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

Recent approaches in depth-based human activity analysis achieved outstanding performance and proved the effectiveness of 3D representation for classification of action classes. Currently available depth-based and RGB+D-based action recognition benchmarks have a number of limitations, including the lack of training samples, distinct class labels, camera views and variety of subjects. In this paper we introduce a large-scale dataset for RGB+D human action recognition with more than 56 thousand video samples and 4 million frames, collected from 40 distinct subjects. Our dataset contains 60 different action classes including daily, mutual, and health-related actions. In addition, we propose a new recurrent neural network structure to model the long-term temporal correlation of the features for each body part, and utilize them for better action classification. Experimental results show the advantages of applying deep learning methods over state-of-the-art hand-crafted features on the suggested cross-subject and cross-view evaluation criteria for our dataset. The introduction of this large scale dataset will enable the community to apply, develop and adapt various data-hungry learning techniques for the task of depth-based and RGB+D-based human activity analysis.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding
cs.CV 2026-01 unverdicted novelty 6.0

HERMES organizes the KV cache into a hierarchical memory to enable real-time streaming video understanding in MLLMs, achieving 10x faster TTFT and up to 11.4% accuracy gains on streaming benchmarks with 68% fewer tokens.
Explainable Fall Detection for Elderly Monitoring via Temporally Stable SHAP in Skeleton-Based Human Activity Recognition
cs.CV 2026-04 unverdicted novelty 5.0

T-SHAP stabilizes SHAP attributions temporally for LSTM fall detection, achieving 94.3% accuracy and improved faithfulness on NTU RGB+D dataset.