Chrome Extension
WeChat Mini Program
Use on ChatGLM

Skeleton-based Attention-Aware Spatial-Temporal Model for Action Detection and Recognition.

Ran Cui,Aichun Zhu, Jingran Wu, Gang Hua

IET computer vision(2020)

Cited 9|Views15
No score
Abstract
Action detection and recognition are popular subjects of research in the field of computer vision. The task of action detection can be regarded as the sum of action location and recognition. Action features described by using information concerning the human skeleton have the advantages of robustness against external factors and requiring a small amount of calculation. This study proposes a skeleton‐based action analysis model based on a recurrent neural network framework. The model learns action features by modelling static and dynamic features of skeleton joints and the importance of different video frames by introducing an attention module. For action location, conditional random field loss function is introduced to establish the context dependency of output labels. In the aspect of action recognition, the hierarchical training mechanism with triple loss models action features at coarse‐grained and fine‐grained levels. The authors’ proposed method delivers state‐of‐the‐art results on action location and recognition tasks.
More
Translated text
Key words
recurrent neural nets,image motion analysis,computer vision,image representation,image recognition,video signal processing,human skeleton,skeleton-based action analysis model,action features,static features,dynamic features,skeleton joints,action location,action recognition,triple loss models action,skeleton-based attention-aware spatial–temporal model,action detection,recurrent neural network framework,conditional random field loss function
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined