ABANet：用于图像分割的注意力边界感知网络

IF 3 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems Pub Date : 2024-05-17 DOI:10.1111/exsy.13625

Sadjad Rezvani, Mansoor Fateh, Hossein Khosravi

{"title":"ABANet：用于图像分割的注意力边界感知网络","authors":"Sadjad Rezvani, Mansoor Fateh, Hossein Khosravi","doi":"10.1111/exsy.13625","DOIUrl":null,"url":null,"abstract":"Deep learning techniques have attained substantial progress in various face-related tasks, such as face recognition, face inpainting, and facial expression recognition. To prevent infection or the spread of the virus, wearing of masks in public places has been mandated following the COVID-19 epidemic, which has led to face occlusion and posed significant challenges for face recognition systems. Most prominent masked face recognition solutions rely on mask segmentation tasks. Therefore, segmentation can be used to mitigate the negative impacts of wearing a mask and improve recognition accuracy. Mask region segmentation suffers from two main problems: there is no standard type of masks that people wear, they come in different colours and designs, and there is no publicly available masked face dataset with appropriate ground truth for the mask region. In order to address these issues, we propose an encoder–decoder framework that utilizes a boundary-aware attention network combined with a new hybrid loss to provide a map, patch, and pixel-level supervision. We also introduce a dataset called MFSD, with 11,601 images and 12,758 masked faces for masked face segmentation. Furthermore, we compare the performance of different cutting-edge deep learning semantic segmentation models on the presented dataset. Experimental results on the MSFD dataset reveal that the suggested approach outperforms state-of-the-art, algorithms with 97.623% accuracy, 93.814% IoU, and 96.817% F1-score rate. Our dataset of masked faces with mask region labels and source code will be available online.","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"41 9","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2024-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ABANet: Attention boundary-aware network for image segmentation\",\"authors\":\"Sadjad Rezvani, Mansoor Fateh, Hossein Khosravi\",\"doi\":\"10.1111/exsy.13625\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Deep learning techniques have attained substantial progress in various face-related tasks, such as face recognition, face inpainting, and facial expression recognition. To prevent infection or the spread of the virus, wearing of masks in public places has been mandated following the COVID-19 epidemic, which has led to face occlusion and posed significant challenges for face recognition systems. Most prominent masked face recognition solutions rely on mask segmentation tasks. Therefore, segmentation can be used to mitigate the negative impacts of wearing a mask and improve recognition accuracy. Mask region segmentation suffers from two main problems: there is no standard type of masks that people wear, they come in different colours and designs, and there is no publicly available masked face dataset with appropriate ground truth for the mask region. In order to address these issues, we propose an encoder–decoder framework that utilizes a boundary-aware attention network combined with a new hybrid loss to provide a map, patch, and pixel-level supervision. We also introduce a dataset called MFSD, with 11,601 images and 12,758 masked faces for masked face segmentation. Furthermore, we compare the performance of different cutting-edge deep learning semantic segmentation models on the presented dataset. Experimental results on the MSFD dataset reveal that the suggested approach outperforms state-of-the-art, algorithms with 97.623% accuracy, 93.814% IoU, and 96.817% F1-score rate. Our dataset of masked faces with mask region labels and source code will be available online.\",\"PeriodicalId\":51053,\"journal\":{\"name\":\"Expert Systems\",\"volume\":\"41 9\",\"pages\":\"\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2024-05-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/exsy.13625\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/exsy.13625","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

深度学习技术在人脸识别、人脸内画和人脸表情识别等各种人脸相关任务中取得了长足进展。COVID-19 流行后，为防止病毒感染或传播，公共场所强制要求佩戴口罩，这导致了人脸闭塞，给人脸识别系统带来了巨大挑战。大多数著名的面具人脸识别解决方案都依赖于面具分割任务。因此，分割可用于减轻佩戴面具带来的负面影响，提高识别准确率。面具区域分割存在两个主要问题：人们佩戴的面具没有标准类型，它们有不同的颜色和设计；没有公开可用的面具人脸数据集为面具区域提供适当的基本事实。为了解决这些问题，我们提出了一个编码器-解码器框架，利用边界感知注意力网络结合新的混合损失来提供地图、补丁和像素级监督。我们还引入了一个名为 MFSD 的数据集，该数据集包含 11,601 张图像和 12,758 张蒙面人脸，用于蒙面人脸分割。此外，我们还比较了不同前沿深度学习语义分割模型在该数据集上的表现。在 MSFD 数据集上的实验结果表明，建议的方法以 97.623% 的准确率、93.814% 的 IoU 和 96.817% 的 F1 分数超过了最先进的算法。我们的掩码人脸数据集、掩码区域标签和源代码将在网上公布。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

ABANet: Attention boundary-aware network for image segmentation

Deep learning techniques have attained substantial progress in various face-related tasks, such as face recognition, face inpainting, and facial expression recognition. To prevent infection or the spread of the virus, wearing of masks in public places has been mandated following the COVID-19 epidemic, which has led to face occlusion and posed significant challenges for face recognition systems. Most prominent masked face recognition solutions rely on mask segmentation tasks. Therefore, segmentation can be used to mitigate the negative impacts of wearing a mask and improve recognition accuracy. Mask region segmentation suffers from two main problems: there is no standard type of masks that people wear, they come in different colours and designs, and there is no publicly available masked face dataset with appropriate ground truth for the mask region. In order to address these issues, we propose an encoder–decoder framework that utilizes a boundary-aware attention network combined with a new hybrid loss to provide a map, patch, and pixel-level supervision. We also introduce a dataset called MFSD, with 11,601 images and 12,758 masked faces for masked face segmentation. Furthermore, we compare the performance of different cutting-edge deep learning semantic segmentation models on the presented dataset. Experimental results on the MSFD dataset reveal that the suggested approach outperforms state-of-the-art, algorithms with 97.623% accuracy, 93.814% IoU, and 96.817% F1-score rate. Our dataset of masked faces with mask region labels and source code will be available online.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Expert Systems 工程技术-计算机：理论方法

CiteScore

7.40

自引率

6.10%

发文量

266

审稿时长

24 months

期刊介绍： Expert Systems: The Journal of Knowledge Engineering publishes papers dealing with all aspects of knowledge engineering, including individual methods and techniques in knowledge acquisition and representation, and their application in the construction of systems – including expert systems – based thereon. Detailed scientific evaluation is an essential part of any paper. As well as traditional application areas, such as Software and Requirements Engineering, Human-Computer Interaction, and Artificial Intelligence, we are aiming at the new and growing markets for these technologies, such as Business, Economy, Market Research, and Medical and Health Care. The shift towards this new focus will be marked by a series of special issues covering hot and emergent topics.