{"title":"基于高效轻量级卷积神经网络的手部检测器","authors":"Duy-Linh Nguyen, M. D. Putro, K. Jo","doi":"10.23919/ICCAS50221.2020.9268320","DOIUrl":null,"url":null,"abstract":"Hand detection and recognition topic has been studied since the last century and is particularly concerned with the development of machine learning today. Inspired by the benefits of a convolution neural network (CNN), this paper proposed an efficient and lightweight architecture to detect the location of hand in the images. This network is deployed with two main blocks which are the feature extraction and the detection block. The feature extraction block starts by convolution layers, CReLU (Concatenated Rectified Linear Unit) module, and max pooling layers alternately. After that, the six inception modules are used and final by four convolution layers. The detection block is constructed by three blocks of two-sibling convolution layers using for classification and regression. The experiment was trained on the combination of EgoHands and Hand dataset. As evaluation, the detector was tested on Egohands test dataset with the results achieved 93.32% of AP (Average Precision). In addition, the speed was tested in real-time by 33.87 fps (frames per second) on Intel Core I7-4770 CPU @ 3.40 GHz.","PeriodicalId":6732,"journal":{"name":"2020 20th International Conference on Control, Automation and Systems (ICCAS)","volume":"28 1","pages":"432-436"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Hand Detector based on Efficient and Lighweight Convolutional Neural Network\",\"authors\":\"Duy-Linh Nguyen, M. D. Putro, K. Jo\",\"doi\":\"10.23919/ICCAS50221.2020.9268320\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Hand detection and recognition topic has been studied since the last century and is particularly concerned with the development of machine learning today. Inspired by the benefits of a convolution neural network (CNN), this paper proposed an efficient and lightweight architecture to detect the location of hand in the images. This network is deployed with two main blocks which are the feature extraction and the detection block. The feature extraction block starts by convolution layers, CReLU (Concatenated Rectified Linear Unit) module, and max pooling layers alternately. After that, the six inception modules are used and final by four convolution layers. The detection block is constructed by three blocks of two-sibling convolution layers using for classification and regression. The experiment was trained on the combination of EgoHands and Hand dataset. As evaluation, the detector was tested on Egohands test dataset with the results achieved 93.32% of AP (Average Precision). In addition, the speed was tested in real-time by 33.87 fps (frames per second) on Intel Core I7-4770 CPU @ 3.40 GHz.\",\"PeriodicalId\":6732,\"journal\":{\"name\":\"2020 20th International Conference on Control, Automation and Systems (ICCAS)\",\"volume\":\"28 1\",\"pages\":\"432-436\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 20th International Conference on Control, Automation and Systems (ICCAS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/ICCAS50221.2020.9268320\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 20th International Conference on Control, Automation and Systems (ICCAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/ICCAS50221.2020.9268320","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
摘要
手部检测与识别这一课题自上个世纪以来一直被研究,尤其关注当今机器学习的发展。受卷积神经网络(CNN)优势的启发,本文提出了一种高效、轻量级的图像手部位置检测体系结构。该网络由特征提取和检测两个主要块组成。特征提取块由卷积层、CReLU (concatated Rectified Linear Unit)模块和最大池化层交替进行。然后,使用六个初始模块,最后使用四个卷积层。检测块由两个兄弟卷积层的三个块构成,用于分类和回归。实验是在EgoHands和Hand数据集的组合上进行训练的。作为评价,该检测器在Egohands测试数据集上进行了测试,结果达到了93.32%的AP (Average Precision)。此外,在英特尔酷睿I7-4770 CPU @ 3.40 GHz上实时测试了33.87 fps(每秒帧数)的速度。
Hand Detector based on Efficient and Lighweight Convolutional Neural Network
Hand detection and recognition topic has been studied since the last century and is particularly concerned with the development of machine learning today. Inspired by the benefits of a convolution neural network (CNN), this paper proposed an efficient and lightweight architecture to detect the location of hand in the images. This network is deployed with two main blocks which are the feature extraction and the detection block. The feature extraction block starts by convolution layers, CReLU (Concatenated Rectified Linear Unit) module, and max pooling layers alternately. After that, the six inception modules are used and final by four convolution layers. The detection block is constructed by three blocks of two-sibling convolution layers using for classification and regression. The experiment was trained on the combination of EgoHands and Hand dataset. As evaluation, the detector was tested on Egohands test dataset with the results achieved 93.32% of AP (Average Precision). In addition, the speed was tested in real-time by 33.87 fps (frames per second) on Intel Core I7-4770 CPU @ 3.40 GHz.