{"title":"基于CUDA的人脸检测系统的高效并行实现","authors":"Hana Ben Fredj, Souhir Sghaier, C. Souani","doi":"10.1109/ATSIP49331.2020.9231723","DOIUrl":null,"url":null,"abstract":"Face detection is a highly efficient component in diverse domains such as security surveillance. Especially, the Viola-Jones algorithm has achieved significant performances in the field of detection face. In the last years, graphics processors have fast become the mainstay to solve the problem of detection face applications and to accelerate data parallel computing. This is due to their flexibility, and in particular, to the single-instruction, multiple-data execution model exploited for streaming processors by a Graphics Processing Unit (GPU). Therefore, in this paper, the researchers develop a robust face detection implementation based on the GPU component. The implementation has been optimized by following up a strategy to use the different memory resources in GPU and the warp scheduler technique, so as to accelerate the access to the memory, with better exploitation of resources proved by GPU. The results display that the suggested method is very important and consumes less execution time compared with the standard implementation and sequential implementation.","PeriodicalId":384018,"journal":{"name":"2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","volume":"104 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"An Efficient Parallel Implementation of Face Detection System Using CUDA\",\"authors\":\"Hana Ben Fredj, Souhir Sghaier, C. Souani\",\"doi\":\"10.1109/ATSIP49331.2020.9231723\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Face detection is a highly efficient component in diverse domains such as security surveillance. Especially, the Viola-Jones algorithm has achieved significant performances in the field of detection face. In the last years, graphics processors have fast become the mainstay to solve the problem of detection face applications and to accelerate data parallel computing. This is due to their flexibility, and in particular, to the single-instruction, multiple-data execution model exploited for streaming processors by a Graphics Processing Unit (GPU). Therefore, in this paper, the researchers develop a robust face detection implementation based on the GPU component. The implementation has been optimized by following up a strategy to use the different memory resources in GPU and the warp scheduler technique, so as to accelerate the access to the memory, with better exploitation of resources proved by GPU. The results display that the suggested method is very important and consumes less execution time compared with the standard implementation and sequential implementation.\",\"PeriodicalId\":384018,\"journal\":{\"name\":\"2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)\",\"volume\":\"104 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ATSIP49331.2020.9231723\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ATSIP49331.2020.9231723","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An Efficient Parallel Implementation of Face Detection System Using CUDA
Face detection is a highly efficient component in diverse domains such as security surveillance. Especially, the Viola-Jones algorithm has achieved significant performances in the field of detection face. In the last years, graphics processors have fast become the mainstay to solve the problem of detection face applications and to accelerate data parallel computing. This is due to their flexibility, and in particular, to the single-instruction, multiple-data execution model exploited for streaming processors by a Graphics Processing Unit (GPU). Therefore, in this paper, the researchers develop a robust face detection implementation based on the GPU component. The implementation has been optimized by following up a strategy to use the different memory resources in GPU and the warp scheduler technique, so as to accelerate the access to the memory, with better exploitation of resources proved by GPU. The results display that the suggested method is very important and consumes less execution time compared with the standard implementation and sequential implementation.