CHAM: action recognition using convolutional hierarchical attention model

Bailing Zhang; Jeremy S. Smith; Shiyang Yan; Wenjin Lu

arxiv: 1705.03146 · v2 · pith:VQXZIDOUnew · submitted 2017-05-09 · 💻 cs.CV

CHAM: action recognition using convolutional hierarchical attention model

Shiyang Yan , Jeremy S. Smith , Wenjin Lu , Bailing Zhang This is my paper

classification 💻 cs.CV

keywords modelattentionconvolutionalhierarchicalactionarchitecturedatasetcategories

0 comments

read the original abstract

Recently, the soft attention mechanism, which was originally proposed in language processing, has been applied in computer vision tasks like image captioning. This paper presents improvements to the soft attention model by combining a convolutional LSTM with a hierarchical system architecture to recognize action categories in videos. We call this model the Convolutional Hierarchical Attention Model (CHAM). The model applies a convolutional operation inside the LSTM cell and an attention map generation process to recognize actions. The hierarchical architecture of this model is able to explicitly reason on multi-granularities of action categories. The proposed architecture achieved improved results on three publicly available datasets: the UCF sports dataset, the Olympic sports dataset and the HMDB51 dataset.

This paper has not been read by Pith yet.

CHAM: action recognition using convolutional hierarchical attention model

discussion (0)