{"title":"基于多任务神经网络的人机交互识别","authors":"W. Yan, Yue Gao, Qiming Liu","doi":"10.1109/ISASS.2019.8757767","DOIUrl":null,"url":null,"abstract":"Human behavior recognition is a popular research area in the field of computer vision and has been studied due to its important applications such as visual surveillance and video retrieval. In this paper, we propose a new approach for recognizing human-object interaction actions based on multitask 2D convolutional neural network, which combines human body motion, human hand motion and object recognition network. By using RGBD camera and digital gloves, refined recognition of human body and hand movements are collected and learned. In addition, a new object recognition network based on YOLOv3 is introduced which increases the accuracy of predicting human-object interaction labels. We designed eight representative actions and built our own data set containing body and accurate hand motions. In our experiment, the accuracy of recognizing interactive actions reached 93%, which shows the correctness and effectiveness of the multitasking framework we propose.","PeriodicalId":359959,"journal":{"name":"2019 3rd International Symposium on Autonomous Systems (ISAS)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Human-object Interaction Recognition Using Multitask Neural Network\",\"authors\":\"W. Yan, Yue Gao, Qiming Liu\",\"doi\":\"10.1109/ISASS.2019.8757767\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Human behavior recognition is a popular research area in the field of computer vision and has been studied due to its important applications such as visual surveillance and video retrieval. In this paper, we propose a new approach for recognizing human-object interaction actions based on multitask 2D convolutional neural network, which combines human body motion, human hand motion and object recognition network. By using RGBD camera and digital gloves, refined recognition of human body and hand movements are collected and learned. In addition, a new object recognition network based on YOLOv3 is introduced which increases the accuracy of predicting human-object interaction labels. We designed eight representative actions and built our own data set containing body and accurate hand motions. In our experiment, the accuracy of recognizing interactive actions reached 93%, which shows the correctness and effectiveness of the multitasking framework we propose.\",\"PeriodicalId\":359959,\"journal\":{\"name\":\"2019 3rd International Symposium on Autonomous Systems (ISAS)\",\"volume\":\"38 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 3rd International Symposium on Autonomous Systems (ISAS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISASS.2019.8757767\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 3rd International Symposium on Autonomous Systems (ISAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISASS.2019.8757767","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Human-object Interaction Recognition Using Multitask Neural Network
Human behavior recognition is a popular research area in the field of computer vision and has been studied due to its important applications such as visual surveillance and video retrieval. In this paper, we propose a new approach for recognizing human-object interaction actions based on multitask 2D convolutional neural network, which combines human body motion, human hand motion and object recognition network. By using RGBD camera and digital gloves, refined recognition of human body and hand movements are collected and learned. In addition, a new object recognition network based on YOLOv3 is introduced which increases the accuracy of predicting human-object interaction labels. We designed eight representative actions and built our own data set containing body and accurate hand motions. In our experiment, the accuracy of recognizing interactive actions reached 93%, which shows the correctness and effectiveness of the multitasking framework we propose.