Learning spatio-temporal features for action recognition from the side of the video

Lishen Pei,Mao Ye,Xuezhuan Zhao,Tao Xiang,Tao Li

Signal, Image and Video Processing（2014）

引用 10|浏览43

暂无评分

摘要

A novel spatio-temporal feature learning approach is introduced for action recognition. First, we automatically detect and track the actor, and map the action track to a cuboid. Then, we split the cuboid into block sequences. Each block sequence is represented as a data vector by concatenating the block shape features. For each action category, we use a two-layer network to learn the distribution of the data vectors. The first layer network is constituted by multiple Restricted Boltzmann Machines (RBMs). Each RBM is trained by the data vectors that have the same spatial location. The output of the second layer RBM is the learned spatio-temporal feature. At last, we train a Support Vector Machine classifier for each class to recognize the actions. Experiments on challenging data sets confirm the effectiveness of our approach.

查看译文

关键词

Action recognition,Restricted Boltzmann Machines (RBMs),Spatio-temporal features,Support vector machine

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要