Spatio-temporal cuboid pyramid for action recognition using depth motion sequences

2016 Eighth International Conference on Advanced Computational Intelligence (ICACI) Pub Date : 2016-02-01 DOI:10.1109/ICACI.2016.7449827

Xiaopeng Ji, Jun Cheng, Wei Feng

引用次数: 11

Abstract

In this paper, we present an effective method to recognize human actions from sequences of depth maps, which are captured by a consume depth sensor. In our approach, we first project each frame of a depth sequence onto three orthogonal planes and generate the depth motion sequence (DMS) between two consecutive frames from the three projected views. Then we propose a spatio-temporal cuboid pyramid (STCP) to subdivide the DMS volumes into a set of spatial cuboids on scaled temporal levels. And a cuboid fusion scheme is presented to concatenate the histograms of oriented gradients (HOG) features extracted from the spatial cuboid. The proposed approach is evaluated on three public benchmark datasets, i.e., MSRAction3D, MSRGesture3D and MSRActionPairs dataset. The experimental results demonstrate that the proposed method achieves state-of-the-art performance.

查看原文本刊更多论文

基于深度运动序列的时空长方体金字塔动作识别

在本文中，我们提出了一种有效的方法，从深度传感器捕获的深度图序列中识别人类行为。在我们的方法中，我们首先将深度序列的每一帧投影到三个正交的平面上，并从三个投影视图中生成两个连续帧之间的深度运动序列(DMS)。然后，我们提出了一个时空长方体金字塔(STCP)，将DMS体积在时间尺度上细分为一组空间长方体。提出了一种长方体融合方案，将从空间长方体中提取的定向梯度直方图(HOG)特征进行拼接。该方法在MSRAction3D、MSRGesture3D和MSRActionPairs三个公共基准数据集上进行了评估。实验结果表明，该方法达到了最先进的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 Eighth International Conference on Advanced Computational Intelligence (ICACI)

自引率

0.00%

发文量