{"title":"Covid-19 Control: Face Mask Detection Using Deep Learning for Balanced and Unbalanced Dataset","authors":"Ademola A. Adesokan","doi":"10.5815/ijisa.2022.06.05","DOIUrl":null,"url":null,"abstract":"Facemask wearing is becoming a norm in our daily lives to curb the spread of Covid-19. Ensuring facemasks are worn correctly is a topic of concern worldwide. It could go beyond manual human control and enforcement, leading to the spread of this deadly virus and many cases globally. The main aim of wearing a facemask is to curtail the spread of the covid-19 virus, but the biggest concern of most deep learning research is about who is wearing the mask or not, and not who is incorrectly wearing the facemask while the main objective of mask wearing is to prevent the spread of the covid-19 virus. This paper compares three state-of-the- art object detection approaches: Haarcascade, Multi-task Cascaded Convolutional Networks (MTCNN), and You Only Look Once version 4 (YOLOv4) to classify who is wearing a mask, who is not wearing a mask, and most importantly, who is incorrectly wearing the mask in a real-time video stream using FPS as a benchmark to select the best model. Yolov4 got about 40 Frame Per Seconds (FPS), outperforming Haarcascade with 16 and MTCNN with 1.4. YOLOv4 was later used to compare the two datasets using Intersection over Union (IoU) and mean Average Precision (mAP) as a comparative measure; dataset2 (balanced dataset) performed better than dataset1 (unbalanced dataset). Yolov4 model on dataset2 mapped and detected images of masks worn incorrectly with one correct class label rather than giving them two label classes with uncertainty in dataset1, this work shows the advantage of having a balanced dataset for accuracy. This work would help decrease human interference in enforcing the COVID-19 face mask rules and create awareness for people who do not comply with the facemask policy of wearing it correctly. Hence, significantly reducing the spread of COVID-19.","PeriodicalId":14067,"journal":{"name":"International Journal of Intelligent Systems and Applications in Engineering","volume":"49 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Intelligent Systems and Applications in Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5815/ijisa.2022.06.05","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 0
Abstract
Facemask wearing is becoming a norm in our daily lives to curb the spread of Covid-19. Ensuring facemasks are worn correctly is a topic of concern worldwide. It could go beyond manual human control and enforcement, leading to the spread of this deadly virus and many cases globally. The main aim of wearing a facemask is to curtail the spread of the covid-19 virus, but the biggest concern of most deep learning research is about who is wearing the mask or not, and not who is incorrectly wearing the facemask while the main objective of mask wearing is to prevent the spread of the covid-19 virus. This paper compares three state-of-the- art object detection approaches: Haarcascade, Multi-task Cascaded Convolutional Networks (MTCNN), and You Only Look Once version 4 (YOLOv4) to classify who is wearing a mask, who is not wearing a mask, and most importantly, who is incorrectly wearing the mask in a real-time video stream using FPS as a benchmark to select the best model. Yolov4 got about 40 Frame Per Seconds (FPS), outperforming Haarcascade with 16 and MTCNN with 1.4. YOLOv4 was later used to compare the two datasets using Intersection over Union (IoU) and mean Average Precision (mAP) as a comparative measure; dataset2 (balanced dataset) performed better than dataset1 (unbalanced dataset). Yolov4 model on dataset2 mapped and detected images of masks worn incorrectly with one correct class label rather than giving them two label classes with uncertainty in dataset1, this work shows the advantage of having a balanced dataset for accuracy. This work would help decrease human interference in enforcing the COVID-19 face mask rules and create awareness for people who do not comply with the facemask policy of wearing it correctly. Hence, significantly reducing the spread of COVID-19.
为了遏制新冠肺炎的传播,戴口罩正成为我们日常生活中的一种常态。确保正确佩戴口罩是全世界关注的一个话题。它可能超出人工控制和强制执行的范围,导致这种致命病毒的传播和全球许多病例。戴口罩的主要目的是遏制新冠病毒的传播,但大多数深度学习研究最关心的是谁戴了口罩,而不是谁戴错了口罩,而戴口罩的主要目的是防止新冠病毒的传播。本文比较了三种最先进的目标检测方法:Haarcascade, Multi-task cascade卷积网络(MTCNN)和You Only Look Once version 4 (YOLOv4),以FPS为基准,在实时视频流中区分谁戴了面具,谁没有戴面具,最重要的是,谁错误地戴了面具,以选择最佳模型。yolo4的帧率约为每秒40帧,超过了每秒16帧的Haarcascade和每秒1.4帧的MTCNN。随后使用YOLOv4比较两个数据集,使用Intersection over Union (IoU)和mean Average Precision (mAP)作为比较度量;Dataset2(平衡数据集)的性能优于dataset1(不平衡数据集)。dataset2上的Yolov4模型用一个正确的类别标签映射和检测了不正确佩戴的面具图像,而不是在dataset1中给它们两个不确定的标签类别,这项工作显示了拥有平衡数据集的优势。这项工作将有助于减少人为干预执行COVID-19口罩规则,并提高那些不遵守正确佩戴口罩政策的人的意识。从而大大减少COVID-19的传播。