{"title":"Gesture recognition from depth images using motion and shape features","authors":"Shuxin Qin, Yiping Yang, Yongshi Jiang","doi":"10.1109/IMSNA.2013.6743244","DOIUrl":null,"url":null,"abstract":"In this paper, we propose an effective method to recognize 3D gestures from depth images which provide additional body motion and shape features. We project depth images onto three orthogonal planes and calculate the Motion History Image (MHI) of each projection to generate the 3 views MHI (3VMHI). Pyramid Histogram of Oriented Gradients (PHOG) is used to extract the features of the 3VMHI. Then, 3VMHI and PHOG are used together as a combined spacetime descriptor for gesture recognition. We provide a method to extract different gestures from a single video. Consecutive frame difference is employed to perform informative frame selection, which is able to remove uninformative frames. The experimental results on two challenging datasets demonstrate that our approach is effective and efficient.","PeriodicalId":111582,"journal":{"name":"2013 2nd International Symposium on Instrumentation and Measurement, Sensor Network and Automation (IMSNA)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 2nd International Symposium on Instrumentation and Measurement, Sensor Network and Automation (IMSNA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IMSNA.2013.6743244","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
In this paper, we propose an effective method to recognize 3D gestures from depth images which provide additional body motion and shape features. We project depth images onto three orthogonal planes and calculate the Motion History Image (MHI) of each projection to generate the 3 views MHI (3VMHI). Pyramid Histogram of Oriented Gradients (PHOG) is used to extract the features of the 3VMHI. Then, 3VMHI and PHOG are used together as a combined spacetime descriptor for gesture recognition. We provide a method to extract different gestures from a single video. Consecutive frame difference is employed to perform informative frame selection, which is able to remove uninformative frames. The experimental results on two challenging datasets demonstrate that our approach is effective and efficient.