Keqiao Huang, Linyan Ling, Tianlang Tan, Jin Zhan, Zhenmeng Yue, Si Tang, Zhiyong Lin, Guiyuan Xie
{"title":"A Real Time Mask Wearing Detection Based on Lightweight CenterNet in Complex Scenes","authors":"Keqiao Huang, Linyan Ling, Tianlang Tan, Jin Zhan, Zhenmeng Yue, Si Tang, Zhiyong Lin, Guiyuan Xie","doi":"10.1145/3577117.3577130","DOIUrl":null,"url":null,"abstract":"CenterNet is a one-stage target detector based on key points with high detection accuracy. However, its backbone is Hourglass network with a large number of parameters, the recognition speed is slow and cannot be recognized in real time. In this paper, we proposes a lightweight Hourglass network based on CenterNet model for mask wearing detection. Firstly, we adopts the depth wise separable convolution network in the reverse residual block of the Hourglass network. In the upsampling and downsampling block, different stride set and two branches are used to reduce the number of model parameters and improve the detection speed. Secondly, we redefine the focal loss function which can correlate the loss values of two Hourglass networks and complement each other to improve the accuracy of difficult targets in complex environments. Finally, in order to improve the test robustness of the method, we constructed a data set of masks under different challenge scenarios. The experimental results show that the average accuracy of our method is 0.922 and the parameters are reduced to 1/25 of CenterNet, and the detection speed is increased by nearly 3 times. Our method can achieve real-time mask wearing detection in videos with better robustness, which provides practicality for deploying the network model to mobile terminals.","PeriodicalId":309874,"journal":{"name":"Proceedings of the 6th International Conference on Advances in Image Processing","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 6th International Conference on Advances in Image Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3577117.3577130","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
CenterNet is a one-stage target detector based on key points with high detection accuracy. However, its backbone is Hourglass network with a large number of parameters, the recognition speed is slow and cannot be recognized in real time. In this paper, we proposes a lightweight Hourglass network based on CenterNet model for mask wearing detection. Firstly, we adopts the depth wise separable convolution network in the reverse residual block of the Hourglass network. In the upsampling and downsampling block, different stride set and two branches are used to reduce the number of model parameters and improve the detection speed. Secondly, we redefine the focal loss function which can correlate the loss values of two Hourglass networks and complement each other to improve the accuracy of difficult targets in complex environments. Finally, in order to improve the test robustness of the method, we constructed a data set of masks under different challenge scenarios. The experimental results show that the average accuracy of our method is 0.922 and the parameters are reduced to 1/25 of CenterNet, and the detection speed is increased by nearly 3 times. Our method can achieve real-time mask wearing detection in videos with better robustness, which provides practicality for deploying the network model to mobile terminals.