基于RGBD相机的人体轮廓提取及其动作识别

2016 IEEE International Conference on Information and Automation (ICIA) Pub Date : 2016-08-01 DOI:10.1109/ICINFA.2016.7832114

Xiaohui Huang, Jun Cheng, Xiaopeng Ji

{"title":"基于RGBD相机的人体轮廓提取及其动作识别","authors":"Xiaohui Huang, Jun Cheng, Xiaopeng Ji","doi":"10.1109/ICINFA.2016.7832114","DOIUrl":null,"url":null,"abstract":"Spatio-temporal cuboid pyramid (STCP) for action recognition using depth motion sequences [1] is influenced by depth camera error which leads the depth motion sequence (DMS) existing many kinds of noise, especially on the surface. It means that the dimension of DMS is awfully high and the feature for action recognition becomes less apparent. In this paper, we present an effective method to reduce noise, which is to segment foreground. We firstly segment and extract human contour in the color image using convolutional network model. Then, human contour is re-segmented utilizing depth information. Thirdly we project each frame of the segmented depth sequence onto three views. We finally extract features from cuboids and recognize human actions. The proposed approach is evaluated on three public benchmark datasets, i.e., UTKinect-Action Dataset, MSRActionPairs Dataset and 3D Online Action Dataset. Experimental results show that our method achieves state-of-the-art performance.","PeriodicalId":389619,"journal":{"name":"2016 IEEE International Conference on Information and Automation (ICIA)","volume":"114 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Human contour extraction from RGBD camera for action recognition\",\"authors\":\"Xiaohui Huang, Jun Cheng, Xiaopeng Ji\",\"doi\":\"10.1109/ICINFA.2016.7832114\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Spatio-temporal cuboid pyramid (STCP) for action recognition using depth motion sequences [1] is influenced by depth camera error which leads the depth motion sequence (DMS) existing many kinds of noise, especially on the surface. It means that the dimension of DMS is awfully high and the feature for action recognition becomes less apparent. In this paper, we present an effective method to reduce noise, which is to segment foreground. We firstly segment and extract human contour in the color image using convolutional network model. Then, human contour is re-segmented utilizing depth information. Thirdly we project each frame of the segmented depth sequence onto three views. We finally extract features from cuboids and recognize human actions. The proposed approach is evaluated on three public benchmark datasets, i.e., UTKinect-Action Dataset, MSRActionPairs Dataset and 3D Online Action Dataset. Experimental results show that our method achieves state-of-the-art performance.\",\"PeriodicalId\":389619,\"journal\":{\"name\":\"2016 IEEE International Conference on Information and Automation (ICIA)\",\"volume\":\"114 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE International Conference on Information and Automation (ICIA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICINFA.2016.7832114\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE International Conference on Information and Automation (ICIA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICINFA.2016.7832114","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

摘要

基于深度运动序列的时空长方体金字塔(STCP)动作识别[1]受到深度相机误差的影响，导致深度运动序列(DMS)存在多种噪声，尤其是表面噪声。这意味着DMS的维度非常高，动作识别的特征变得不那么明显。本文提出了一种有效的降噪方法，即前景分割。首先利用卷积网络模型对彩色图像中的人体轮廓进行分割和提取。然后，利用深度信息对人体轮廓进行重新分割。第三，我们将分割深度序列的每一帧投影到三个视图上。最后，我们从长方体中提取特征并识别人类的行为。该方法在三个公共基准数据集上进行了评估，即UTKinect-Action数据集、MSRActionPairs数据集和3D Online Action数据集。实验结果表明，该方法达到了最先进的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Human contour extraction from RGBD camera for action recognition

Spatio-temporal cuboid pyramid (STCP) for action recognition using depth motion sequences [1] is influenced by depth camera error which leads the depth motion sequence (DMS) existing many kinds of noise, especially on the surface. It means that the dimension of DMS is awfully high and the feature for action recognition becomes less apparent. In this paper, we present an effective method to reduce noise, which is to segment foreground. We firstly segment and extract human contour in the color image using convolutional network model. Then, human contour is re-segmented utilizing depth information. Thirdly we project each frame of the segmented depth sequence onto three views. We finally extract features from cuboids and recognize human actions. The proposed approach is evaluated on three public benchmark datasets, i.e., UTKinect-Action Dataset, MSRActionPairs Dataset and 3D Online Action Dataset. Experimental results show that our method achieves state-of-the-art performance.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2016 IEEE International Conference on Information and Automation (ICIA)

自引率

0.00%

发文量