{"title":"基于GLAC特征的多视点二值编码图像人体动作识别","authors":"Mohammad Farhad Bulbul, S. Galib, Hazrat Ali","doi":"10.1109/UCET.2019.8881844","DOIUrl":null,"url":null,"abstract":"This paper presents a human action recognition framework by focusing on the auto-correlation features extracted on the binary coded motion and static information images of depth action video clips. At first, the action video clips are passed to the 3D Motion Trail Model (3DMTM) in order to generate the 2D motion and static information images. Clearly, for a depth video clip, the 3DMTM yields a set of three motion information images and a set of three static information images about the front, side and top projections of all the action video frames. Next, those images are transformed into the binary coded images by the Local Binary Pattern (LBP) operator. Finally, the Gradient Local Auto-Correlation (GLAC) algorithm is employed on those binary coded images for representing depth actions through the auto-correlation features. To classify the multiple actions with the gained features, the Extreme Learning Machine (ELM) is adopted here with the kernel trick. The introduced approach is extensively validated on the Microsoft Research Action3D (MSRAction3D) database. The experimental result demonstrates, the classification outcome of our system is better in comparison with the state-of-the-art systems.","PeriodicalId":169373,"journal":{"name":"2019 UK/ China Emerging Technologies (UCET)","volume":"127 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Human Action Recognition Using GLAC Features on Multi-view Binary Coded Images\",\"authors\":\"Mohammad Farhad Bulbul, S. Galib, Hazrat Ali\",\"doi\":\"10.1109/UCET.2019.8881844\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a human action recognition framework by focusing on the auto-correlation features extracted on the binary coded motion and static information images of depth action video clips. At first, the action video clips are passed to the 3D Motion Trail Model (3DMTM) in order to generate the 2D motion and static information images. Clearly, for a depth video clip, the 3DMTM yields a set of three motion information images and a set of three static information images about the front, side and top projections of all the action video frames. Next, those images are transformed into the binary coded images by the Local Binary Pattern (LBP) operator. Finally, the Gradient Local Auto-Correlation (GLAC) algorithm is employed on those binary coded images for representing depth actions through the auto-correlation features. To classify the multiple actions with the gained features, the Extreme Learning Machine (ELM) is adopted here with the kernel trick. The introduced approach is extensively validated on the Microsoft Research Action3D (MSRAction3D) database. The experimental result demonstrates, the classification outcome of our system is better in comparison with the state-of-the-art systems.\",\"PeriodicalId\":169373,\"journal\":{\"name\":\"2019 UK/ China Emerging Technologies (UCET)\",\"volume\":\"127 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 UK/ China Emerging Technologies (UCET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/UCET.2019.8881844\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 UK/ China Emerging Technologies (UCET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UCET.2019.8881844","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
摘要
本文针对深度动作视频片段的二值编码运动和静态信息图像提取自相关特征,提出了一种人体动作识别框架。首先,将动作视频片段传递给三维运动轨迹模型(3DMTM),生成二维运动和静态信息图像。显然,对于一个深度视频剪辑,3DMTM产生一组三幅运动信息图像和一组三幅关于所有动作视频帧的正面、侧面和顶部投影的静态信息图像。然后,通过局部二值模式(Local binary Pattern, LBP)算子将这些图像转换为二值编码图像。最后,利用梯度局部自相关(GLAC)算法对二值编码图像进行深度动作表征。为了利用获得的特征对多个动作进行分类,本文采用了极限学习机(ELM)和核技巧。该方法在Microsoft Research Action3D (MSRAction3D)数据库上得到了广泛的验证。实验结果表明,与现有的分类系统相比,该系统的分类效果更好。
Human Action Recognition Using GLAC Features on Multi-view Binary Coded Images
This paper presents a human action recognition framework by focusing on the auto-correlation features extracted on the binary coded motion and static information images of depth action video clips. At first, the action video clips are passed to the 3D Motion Trail Model (3DMTM) in order to generate the 2D motion and static information images. Clearly, for a depth video clip, the 3DMTM yields a set of three motion information images and a set of three static information images about the front, side and top projections of all the action video frames. Next, those images are transformed into the binary coded images by the Local Binary Pattern (LBP) operator. Finally, the Gradient Local Auto-Correlation (GLAC) algorithm is employed on those binary coded images for representing depth actions through the auto-correlation features. To classify the multiple actions with the gained features, the Extreme Learning Machine (ELM) is adopted here with the kernel trick. The introduced approach is extensively validated on the Microsoft Research Action3D (MSRAction3D) database. The experimental result demonstrates, the classification outcome of our system is better in comparison with the state-of-the-art systems.