M. D. Putro, Adri Priadana, Duy-Linh Nguyen, K. Jo
{"title":"A Real-time Face Detector on CPU Using Efficient Transformer","authors":"M. D. Putro, Adri Priadana, Duy-Linh Nguyen, K. Jo","doi":"10.1109/IWIS56333.2022.9920713","DOIUrl":null,"url":null,"abstract":"Face detection is a basic vision method to find the fa-ciallocation. It is usually used in the initial step of advanced facial analysis. Therefore, this approach is required to work quickly, especially on low-cost devices to support practical applications. A deep learning architecture can robustly extract the distinctive feature by employing a lot of weighted filters. However, the model produces heavy parameters and computational complexity. A transformer is a deep learning architecture that can capture the feature position relationship, which increases the detector performance. This work in this paper proposes a new efficient transformer architecture that is implemented to face detection. It can highlight the spatial information from a similarity map by utilizing a 2D-convolutional filter. This architecture generates low computation and lightweight trainable parameters that serve the proposed face detector to run fast on an inexpensive device. As a result, this proposed network achieves high performance and competitive precision with the low-cost model. Additionally, the proposed transformer module does not significantly add computation and parameters that can run fast at 95 frames per second on a Core is CPU.","PeriodicalId":340399,"journal":{"name":"2022 International Workshop on Intelligent Systems (IWIS)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Workshop on Intelligent Systems (IWIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IWIS56333.2022.9920713","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Face detection is a basic vision method to find the fa-ciallocation. It is usually used in the initial step of advanced facial analysis. Therefore, this approach is required to work quickly, especially on low-cost devices to support practical applications. A deep learning architecture can robustly extract the distinctive feature by employing a lot of weighted filters. However, the model produces heavy parameters and computational complexity. A transformer is a deep learning architecture that can capture the feature position relationship, which increases the detector performance. This work in this paper proposes a new efficient transformer architecture that is implemented to face detection. It can highlight the spatial information from a similarity map by utilizing a 2D-convolutional filter. This architecture generates low computation and lightweight trainable parameters that serve the proposed face detector to run fast on an inexpensive device. As a result, this proposed network achieves high performance and competitive precision with the low-cost model. Additionally, the proposed transformer module does not significantly add computation and parameters that can run fast at 95 frames per second on a Core is CPU.
人脸检测是一种基本的视觉识别方法。它通常用于高级面部分析的初始步骤。因此,这种方法需要快速工作,特别是在低成本设备上支持实际应用。深度学习架构可以通过使用大量的加权滤波器来鲁棒地提取特征。然而,该模型产生了大量的参数和计算复杂度。变压器是一种深度学习架构,可以捕获特征位置关系,从而提高检测器的性能。本文提出了一种新的高效变压器结构,并将其应用于人脸检测。该算法利用二维卷积滤波器突出相似图中的空间信息。该架构产生低计算量和轻量级可训练参数,使所提出的人脸检测器在廉价设备上快速运行。结果表明,该网络在低成本的基础上实现了高性能和具有竞争力的精度。此外,提议的变压器模块不会显著增加计算和参数,可以在Core is CPU上以每秒95帧的速度快速运行。