Batch Image Processing in Facial Detection Applications

Mousa Al-Qawasmi, N. Mekhiel, Kwame Bannor, Blessvin Christer Devakumar
{"title":"Batch Image Processing in Facial Detection Applications","authors":"Mousa Al-Qawasmi, N. Mekhiel, Kwame Bannor, Blessvin Christer Devakumar","doi":"10.1109/ICCIKE51210.2021.9410789","DOIUrl":null,"url":null,"abstract":"Batch image processing for facial detection involves running a facial detection algorithm, in parallel, on batches containing multiple images rather than serially on a sequence consisting of single images. Batch image processing is crucial in live-video facial detection applications where the real-time processing of many frames is required. The performance of a facial detection application can be drastically improved when facial detection is done in parallel on batches containing multiple images. In this work, we analyze the performance gain due to running a GPU-based facial detection algorithm, in parallel, on batches of images versus the performance of running the GPU-based facial detection algorithm serially on a sequence of single images. We vary the number of images in which faces are to-be detected from 128 images to 1024 images. For each of the prior mentioned image sets, we measure the performance when the number of images per detection batch is varied from 1 image per batch (sequential) to 1024 images per batch in multiples of 2. We find that the technique of batch image processing improves the performance of a face detection application by approximately 10-11x on to-be-detected image sets consisting of 128, 256, and 1024 images. This performance improvement is attributed to a reduction in the communication overhead between the host CPU and GPU occurring on the PCI bus. Moreover, the technique of batch image processing enables the utilization of more of the GPU’s resources that are left underutilized when GPU-based facial detection is done serially on a sequence of single images.","PeriodicalId":254711,"journal":{"name":"2021 International Conference on Computational Intelligence and Knowledge Economy (ICCIKE)","volume":"74 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Computational Intelligence and Knowledge Economy (ICCIKE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIKE51210.2021.9410789","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Batch image processing for facial detection involves running a facial detection algorithm, in parallel, on batches containing multiple images rather than serially on a sequence consisting of single images. Batch image processing is crucial in live-video facial detection applications where the real-time processing of many frames is required. The performance of a facial detection application can be drastically improved when facial detection is done in parallel on batches containing multiple images. In this work, we analyze the performance gain due to running a GPU-based facial detection algorithm, in parallel, on batches of images versus the performance of running the GPU-based facial detection algorithm serially on a sequence of single images. We vary the number of images in which faces are to-be detected from 128 images to 1024 images. For each of the prior mentioned image sets, we measure the performance when the number of images per detection batch is varied from 1 image per batch (sequential) to 1024 images per batch in multiples of 2. We find that the technique of batch image processing improves the performance of a face detection application by approximately 10-11x on to-be-detected image sets consisting of 128, 256, and 1024 images. This performance improvement is attributed to a reduction in the communication overhead between the host CPU and GPU occurring on the PCI bus. Moreover, the technique of batch image processing enables the utilization of more of the GPU’s resources that are left underutilized when GPU-based facial detection is done serially on a sequence of single images.
批处理图像处理在人脸检测中的应用
用于面部检测的批处理图像处理涉及在包含多个图像的批上并行运行面部检测算法,而不是在由单个图像组成的序列上串行运行。批量图像处理在实时视频人脸检测应用中是至关重要的,其中需要对许多帧进行实时处理。当对包含多个图像的批处理并行进行面部检测时,可以大大提高面部检测应用程序的性能。在这项工作中,我们分析了在批量图像上并行运行基于gpu的面部检测算法与在单个图像序列上串行运行基于gpu的面部检测算法的性能增益。我们将检测人脸的图像数量从128张增加到1024张。对于前面提到的每个图像集,当每个检测批的图像数量从每批1个图像(顺序)到每批1024个图像(2的倍数)变化时,我们测量性能。我们发现,批处理图像处理技术将人脸检测应用程序的性能提高了大约10-11倍,待检测图像集由128、256和1024张图像组成。这种性能改进归因于主机CPU和GPU之间在PCI总线上的通信开销的减少。此外,批处理图像处理技术能够利用更多GPU的资源,当基于GPU的面部检测在一系列单个图像上串行完成时,这些资源未得到充分利用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信