Gui-Hong Shi, Jiezhong Huang, Junhua Zhang, Guoqin Tan, Gaoli Sang
{"title":"Combined Channel and Spatial Attention for YOLOv5 during Target Detection","authors":"Gui-Hong Shi, Jiezhong Huang, Junhua Zhang, Guoqin Tan, Gaoli Sang","doi":"10.1109/PRML52754.2021.9520728","DOIUrl":null,"url":null,"abstract":"Accuracy target detection can benefit many target detection applications. The latest YOLOv5 method has faster detection speed and better accuracy in target detection. However, there are still insufficient on bounding box positioning and it is difficult to distinguish overlapping objects. This paper proposes an improved Attention-YOLO v5, which adds channel attention and spatial attention mechanisms to the feature extraction. Furthermore, a squeeze and excitation(SE) module is applied to improve the resolution of the input image. Experiments on two public datasets show that our proposed method effectively reduces the positioning error of the bounding box and improves the detection accuracy. The accuracy on INRIA and PnPLO datasets are 97.9% and 96.2%.","PeriodicalId":429603,"journal":{"name":"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PRML52754.2021.9520728","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Accuracy target detection can benefit many target detection applications. The latest YOLOv5 method has faster detection speed and better accuracy in target detection. However, there are still insufficient on bounding box positioning and it is difficult to distinguish overlapping objects. This paper proposes an improved Attention-YOLO v5, which adds channel attention and spatial attention mechanisms to the feature extraction. Furthermore, a squeeze and excitation(SE) module is applied to improve the resolution of the input image. Experiments on two public datasets show that our proposed method effectively reduces the positioning error of the bounding box and improves the detection accuracy. The accuracy on INRIA and PnPLO datasets are 97.9% and 96.2%.