{"title":"一种基于增强R-FCN和支持向量机的目标检测算法","authors":"Cong Xu, Jiahao Fan, Lin Liu","doi":"10.1109/TOCS50858.2020.9339689","DOIUrl":null,"url":null,"abstract":"Object detection is an extremely important part of computer vision. However, the object detection result of R-FCN is not good enough in terms of speed and accuracy. In this paper, a novel architecture called Enhanced R-FCN (ER-FCN) is proposed for object detection. Two improvements are presented in ER-FCN. Firstly, novel anchor boxes, 3 scales with box areas of 5122, 2562 and 1282 pixels, and 3 aspect ratios of 0.618:1, 1:1 and 1:0.618, are designed to suit the different scales object detection in RPN. Hence, the performance of object localization and detection speed are increased. Secondly, since the softmax classifier is not optimal to deal with the binary classification problem, a Whale Optimization Algorithm based on support vector machine, termed WOA-SVM, is introduced to improve the accuracy of classification. Extensive experimental results on PASCAL VOC 2007 and PASCAL VOC 2012 datasets show that the mean average precision of ER-FCN is improved by 3.9% compared with that of R-FCN.","PeriodicalId":373862,"journal":{"name":"2020 IEEE Conference on Telecommunications, Optics and Computer Science (TOCS)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A novel object detection algorithm based on enhanced R-FCN and SVM\",\"authors\":\"Cong Xu, Jiahao Fan, Lin Liu\",\"doi\":\"10.1109/TOCS50858.2020.9339689\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Object detection is an extremely important part of computer vision. However, the object detection result of R-FCN is not good enough in terms of speed and accuracy. In this paper, a novel architecture called Enhanced R-FCN (ER-FCN) is proposed for object detection. Two improvements are presented in ER-FCN. Firstly, novel anchor boxes, 3 scales with box areas of 5122, 2562 and 1282 pixels, and 3 aspect ratios of 0.618:1, 1:1 and 1:0.618, are designed to suit the different scales object detection in RPN. Hence, the performance of object localization and detection speed are increased. Secondly, since the softmax classifier is not optimal to deal with the binary classification problem, a Whale Optimization Algorithm based on support vector machine, termed WOA-SVM, is introduced to improve the accuracy of classification. Extensive experimental results on PASCAL VOC 2007 and PASCAL VOC 2012 datasets show that the mean average precision of ER-FCN is improved by 3.9% compared with that of R-FCN.\",\"PeriodicalId\":373862,\"journal\":{\"name\":\"2020 IEEE Conference on Telecommunications, Optics and Computer Science (TOCS)\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE Conference on Telecommunications, Optics and Computer Science (TOCS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TOCS50858.2020.9339689\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Conference on Telecommunications, Optics and Computer Science (TOCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TOCS50858.2020.9339689","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A novel object detection algorithm based on enhanced R-FCN and SVM
Object detection is an extremely important part of computer vision. However, the object detection result of R-FCN is not good enough in terms of speed and accuracy. In this paper, a novel architecture called Enhanced R-FCN (ER-FCN) is proposed for object detection. Two improvements are presented in ER-FCN. Firstly, novel anchor boxes, 3 scales with box areas of 5122, 2562 and 1282 pixels, and 3 aspect ratios of 0.618:1, 1:1 and 1:0.618, are designed to suit the different scales object detection in RPN. Hence, the performance of object localization and detection speed are increased. Secondly, since the softmax classifier is not optimal to deal with the binary classification problem, a Whale Optimization Algorithm based on support vector machine, termed WOA-SVM, is introduced to improve the accuracy of classification. Extensive experimental results on PASCAL VOC 2007 and PASCAL VOC 2012 datasets show that the mean average precision of ER-FCN is improved by 3.9% compared with that of R-FCN.