{"title":"基于余弦相似度IoU的目标检测算法","authors":"Sugang Ma, Ningbo Li, Peng Guansheng, Yanping Chen, Wang Ying, Zhiqiang Hou","doi":"10.1109/NaNA56854.2022.00077","DOIUrl":null,"url":null,"abstract":"Aiming at the problem that the traditional IoU-NMS algorithm has poor filtering of redundant boxes with high confidence scores in RetinaNet, an object detection algorithm based on cosine similarity IoU is proposed. Based on the original IoU calculation method, the cosine similarity between the detection boxes is calculated by the vector to better evaluate the similarity between them, which removes redundant boxes with high confidence scores and retain more accurate detection boxes. Meanwhile, CBAM is added to the ResNet-50 network to extract richer semantic information. the detection accuracy of the PASCAL VOC dataset reaches 80.6%, which is 2.1% higher than the benchmark algorithm.","PeriodicalId":113743,"journal":{"name":"2022 International Conference on Networking and Network Applications (NaNA)","volume":"19 10","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Object detection algorithm based on cosine similarity IoU\",\"authors\":\"Sugang Ma, Ningbo Li, Peng Guansheng, Yanping Chen, Wang Ying, Zhiqiang Hou\",\"doi\":\"10.1109/NaNA56854.2022.00077\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Aiming at the problem that the traditional IoU-NMS algorithm has poor filtering of redundant boxes with high confidence scores in RetinaNet, an object detection algorithm based on cosine similarity IoU is proposed. Based on the original IoU calculation method, the cosine similarity between the detection boxes is calculated by the vector to better evaluate the similarity between them, which removes redundant boxes with high confidence scores and retain more accurate detection boxes. Meanwhile, CBAM is added to the ResNet-50 network to extract richer semantic information. the detection accuracy of the PASCAL VOC dataset reaches 80.6%, which is 2.1% higher than the benchmark algorithm.\",\"PeriodicalId\":113743,\"journal\":{\"name\":\"2022 International Conference on Networking and Network Applications (NaNA)\",\"volume\":\"19 10\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Networking and Network Applications (NaNA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NaNA56854.2022.00077\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Networking and Network Applications (NaNA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NaNA56854.2022.00077","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Object detection algorithm based on cosine similarity IoU
Aiming at the problem that the traditional IoU-NMS algorithm has poor filtering of redundant boxes with high confidence scores in RetinaNet, an object detection algorithm based on cosine similarity IoU is proposed. Based on the original IoU calculation method, the cosine similarity between the detection boxes is calculated by the vector to better evaluate the similarity between them, which removes redundant boxes with high confidence scores and retain more accurate detection boxes. Meanwhile, CBAM is added to the ResNet-50 network to extract richer semantic information. the detection accuracy of the PASCAL VOC dataset reaches 80.6%, which is 2.1% higher than the benchmark algorithm.