{"title":"基于可变形卷积核的YOLO目标检测算法","authors":"Hui Wang, Shuai Zhang, Lijun Yu, Ce Shi","doi":"10.1109/ICMA52036.2021.9512626","DOIUrl":null,"url":null,"abstract":"In order to solve the problem of inaccurate detection of targets with large scale and shape changes and small targets by the YOLO target detection algorithm, a YOLO target detection algorithm with deformable convolution kernel is proposed. The algorithm adds deformable convolution to the network, and designs three progressive embedding schemes, so that the network can adaptively change the receptive field of feature points according to the shape of the target, thereby extracting features more effectively and improving detection accuracy; by adjusting the backbone network the structure reduces the calculation amount of the algorithm. Tested on the VOC data set, the results show that the algorithm can effectively improve the accuracy of target detection, and the detection speed meets the real-time requirements.","PeriodicalId":339025,"journal":{"name":"2021 IEEE International Conference on Mechatronics and Automation (ICMA)","volume":"107 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"YOLO Target Detection Algorithm with Deformable Convolution Kernel\",\"authors\":\"Hui Wang, Shuai Zhang, Lijun Yu, Ce Shi\",\"doi\":\"10.1109/ICMA52036.2021.9512626\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In order to solve the problem of inaccurate detection of targets with large scale and shape changes and small targets by the YOLO target detection algorithm, a YOLO target detection algorithm with deformable convolution kernel is proposed. The algorithm adds deformable convolution to the network, and designs three progressive embedding schemes, so that the network can adaptively change the receptive field of feature points according to the shape of the target, thereby extracting features more effectively and improving detection accuracy; by adjusting the backbone network the structure reduces the calculation amount of the algorithm. Tested on the VOC data set, the results show that the algorithm can effectively improve the accuracy of target detection, and the detection speed meets the real-time requirements.\",\"PeriodicalId\":339025,\"journal\":{\"name\":\"2021 IEEE International Conference on Mechatronics and Automation (ICMA)\",\"volume\":\"107 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Mechatronics and Automation (ICMA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMA52036.2021.9512626\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Mechatronics and Automation (ICMA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMA52036.2021.9512626","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
YOLO Target Detection Algorithm with Deformable Convolution Kernel
In order to solve the problem of inaccurate detection of targets with large scale and shape changes and small targets by the YOLO target detection algorithm, a YOLO target detection algorithm with deformable convolution kernel is proposed. The algorithm adds deformable convolution to the network, and designs three progressive embedding schemes, so that the network can adaptively change the receptive field of feature points according to the shape of the target, thereby extracting features more effectively and improving detection accuracy; by adjusting the backbone network the structure reduces the calculation amount of the algorithm. Tested on the VOC data set, the results show that the algorithm can effectively improve the accuracy of target detection, and the detection speed meets the real-time requirements.