Modeling spatial layout of features for real world scenario RGB-D action recognition

2016 13th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS) Pub Date : 2016-08-01 DOI:10.1109/AVSS.2016.7738023

Michal Koperski, F. Brémond

引用次数: 16

Abstract

Depth information improves skeleton detection, thus skeleton based methods are the most popular methods in RGB-D action recognition. But skeleton detection working range is limited in terms of distance and view-point. Most of the skeleton based action recognition methods ignore fact that skeleton may be missing. Local points-of-interest (POIs) do not require skeleton detection. But they fail if they cannot detect enough POIs e.g. amount of motion in action is low. Most of them ignore spatial-location of features. We cope with the above problems by employing people detector instead of skeleton detector. We propose method to encode spatial-layout of features inside bounding box. We also introduce descriptor which encodes static information for actions with low amount of motion. We validate our approach on: 3 public data-sets. The results show that our method is competitive to skeleton based methods, while requiring much simpler people detection instead of skeleton detection.

查看原文本刊更多论文

真实场景RGB-D动作识别特征空间布局建模

深度信息改进了骨骼检测，因此基于骨骼的方法是RGB-D动作识别中最常用的方法。但是骨骼检测的工作范围受到距离和视点的限制。大多数基于骨架的动作识别方法都忽略了骨架可能缺失的事实。局部兴趣点(poi)不需要骨架检测。但如果它们不能检测到足够的poi，例如动作中的运动量很低，它们就会失败。它们大多忽略了特征的空间定位。为了解决以上问题，我们采用了人体探测器而不是骨骼探测器。提出了对边界框内特征的空间布局进行编码的方法。我们还引入了描述符，对低运动量动作的静态信息进行编码。我们在3个公共数据集上验证了我们的方法。结果表明，该方法与基于骨架的方法相比具有一定的竞争力，但需要更简单的人物检测而不是骨架检测。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 13th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)

自引率

0.00%

发文量