Similar Gesture Recognition using Hierarchical Classification Approach in RGB Videos

2018 Digital Image Computing: Techniques and Applications (DICTA) Pub Date : 2018-12-01 DOI:10.1109/DICTA.2018.8615804

Di Wu, N. Sharma, M. Blumenstein

{"title":"Similar Gesture Recognition using Hierarchical Classification Approach in RGB Videos","authors":"Di Wu, N. Sharma, M. Blumenstein","doi":"10.1109/DICTA.2018.8615804","DOIUrl":null,"url":null,"abstract":"Recognizing human actions from the video streams has become one of the very popular research areas in computer vision and deep learning in the recent years. Action recognition is wildly used in different scenarios in real life, such as surveillance, robotics, healthcare, video indexing and human-computer interaction. The challenges and complexity involved in developing a video-based human action recognition system are manifold. In particular, recognizing actions with similar gestures and describing complex actions is a very challenging problem. To address these issues, we study the problem of classifying human actions using Convolutional Neural Networks (CNN) and develop a hierarchical 3DCNN architecture for similar gesture recognition. The proposed model firstly combines similar gesture pairs into one class, and classify them along with all other class, as a stage-1 classification. In stage-2, similar gesture pairs are classified individually, which reduces the problem to binary classification. We apply and evaluate the developed models to recognize the similar human actions on the HMDB51 dataset. The result shows that the proposed model can achieve high performance in comparison to the state-of-the-art methods.","PeriodicalId":130057,"journal":{"name":"2018 Digital Image Computing: Techniques and Applications (DICTA)","volume":"204 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Digital Image Computing: Techniques and Applications (DICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DICTA.2018.8615804","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Recognizing human actions from the video streams has become one of the very popular research areas in computer vision and deep learning in the recent years. Action recognition is wildly used in different scenarios in real life, such as surveillance, robotics, healthcare, video indexing and human-computer interaction. The challenges and complexity involved in developing a video-based human action recognition system are manifold. In particular, recognizing actions with similar gestures and describing complex actions is a very challenging problem. To address these issues, we study the problem of classifying human actions using Convolutional Neural Networks (CNN) and develop a hierarchical 3DCNN architecture for similar gesture recognition. The proposed model firstly combines similar gesture pairs into one class, and classify them along with all other class, as a stage-1 classification. In stage-2, similar gesture pairs are classified individually, which reduces the problem to binary classification. We apply and evaluate the developed models to recognize the similar human actions on the HMDB51 dataset. The result shows that the proposed model can achieve high performance in comparison to the state-of-the-art methods.

查看原文本刊更多论文

基于层次分类方法的RGB视频相似手势识别

从视频流中识别人类行为已成为近年来计算机视觉和深度学习领域的热门研究领域之一。动作识别广泛应用于现实生活中的不同场景，如监控、机器人、医疗保健、视频索引和人机交互。开发基于视频的人体动作识别系统所涉及的挑战和复杂性是多方面的。特别是，识别具有相似手势的动作和描述复杂的动作是一个非常具有挑战性的问题。为了解决这些问题，我们研究了使用卷积神经网络(CNN)对人类行为进行分类的问题，并开发了用于类似手势识别的分层3DCNN架构。该模型首先将相似的手势组合成一个类别，并将其与所有其他类别一起分类，作为第一阶段的分类。在第二阶段，对相似的手势对进行单独分类，将问题简化为二值分类。我们应用和评估了开发的模型来识别HMDB51数据集上类似的人类活动。结果表明，与现有方法相比，该模型具有较高的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 Digital Image Computing: Techniques and Applications (DICTA)

自引率

0.00%

发文量