Kinematic-Layout-aware Random Forests for Depth-based Action Recognition

arXiv: Computer Vision and Pattern Recognition Pub Date : 2016-07-23 DOI:10.5244/C.31.13

Seungryul Baek, Zhiyuan Shi, M. Kawade, Tae-Kyun Kim

{"title":"Kinematic-Layout-aware Random Forests for Depth-based Action Recognition","authors":"Seungryul Baek, Zhiyuan Shi, M. Kawade, Tae-Kyun Kim","doi":"10.5244/C.31.13","DOIUrl":null,"url":null,"abstract":"In this paper, we tackle the problem of 24 hours-monitoring patient actions in a ward such as \"stretching an arm out of the bed\", \"falling out of the bed\", where temporal movements are subtle or significant. In the concerned scenarios, the relations between scene layouts and body kinematics (skeletons) become important cues to recognize actions; however they are hard to be secured at a testing stage. To address this problem, we propose a kinematic-layout-aware random forest which takes into account the kinematic-layout (\\ie layout and skeletons), to maximize the discriminative power of depth image appearance. We integrate the kinematic-layout in the split criteria of random forests to guide the learning process by 1) determining the switch to either the depth appearance or the kinematic-layout information, and 2) implicitly closing the gap between two distributions obtained by the kinematic-layout and the appearance, when the kinematic-layout appears useful. The kinematic-layout information is not required for the test data, thus called \"privileged information prior\". The proposed method has also been testified in cross-view settings, by the use of view-invariant features and enforcing the consistency among synthetic-view data. Experimental evaluations on our new dataset PATIENT, CAD-60 and UWA3D (multiview) demonstrate that our method outperforms various state-of-the-arts.","PeriodicalId":185904,"journal":{"name":"arXiv: Computer Vision and Pattern Recognition","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv: Computer Vision and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5244/C.31.13","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 11

Abstract

In this paper, we tackle the problem of 24 hours-monitoring patient actions in a ward such as "stretching an arm out of the bed", "falling out of the bed", where temporal movements are subtle or significant. In the concerned scenarios, the relations between scene layouts and body kinematics (skeletons) become important cues to recognize actions; however they are hard to be secured at a testing stage. To address this problem, we propose a kinematic-layout-aware random forest which takes into account the kinematic-layout (\ie layout and skeletons), to maximize the discriminative power of depth image appearance. We integrate the kinematic-layout in the split criteria of random forests to guide the learning process by 1) determining the switch to either the depth appearance or the kinematic-layout information, and 2) implicitly closing the gap between two distributions obtained by the kinematic-layout and the appearance, when the kinematic-layout appears useful. The kinematic-layout information is not required for the test data, thus called "privileged information prior". The proposed method has also been testified in cross-view settings, by the use of view-invariant features and enforcing the consistency among synthetic-view data. Experimental evaluations on our new dataset PATIENT, CAD-60 and UWA3D (multiview) demonstrate that our method outperforms various state-of-the-arts.

查看原文本刊更多论文

基于深度动作识别的运动布局感知随机森林

在本文中，我们解决了24小时监测病房中患者行为的问题，例如“将手臂伸出床外”，“从床上掉下来”，这些时间运动是微妙或重要的。在相关场景中，场景布局与人体运动学(骨架)之间的关系成为识别动作的重要线索;然而，它们在测试阶段很难得到保证。为了解决这个问题，我们提出了一个考虑运动布局(即布局和骨架)的动态布局感知随机森林，以最大限度地提高深度图像外观的判别能力。我们将运动学布局整合到随机森林的分割准则中，以指导学习过程:1)确定向深度外观或运动学布局信息的切换;2)当运动学布局有用时，隐式地缩小由运动学布局和外观获得的两个分布之间的差距。测试数据不需要运动学布局信息，因此称为“优先特权信息”。通过使用视图不变性特征和增强合成视图数据之间的一致性，该方法在交叉视图设置中也得到了验证。在我们的新数据集PATIENT、CAD-60和UWA3D(多视图)上的实验评估表明，我们的方法优于各种最先进的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

arXiv: Computer Vision and Pattern Recognition

自引率

0.00%

发文量