{"title":"聚类锚更快R-CNN提高检测结果","authors":"Han Wei-yue, L. Xiaohong","doi":"10.1109/ICAICA50127.2020.9182521","DOIUrl":null,"url":null,"abstract":"Object detection has made impressive progress in recent years where Faster R-CNN is the mainstream framework for region-based object detection methods. However, a single Faster R-CNN framework no longer has advantages compared with the latest detection models. So based on Faster R-CNN, a model that focuses on features, normalization methods, and anchor sizes is proposed to improve detection results. The model integrates Feature Pyramid Networks (FPN), Group Normalization (GN) with k-means clustering. FPN is used to produce a multi-scale feature representation, which enables the model to detect objects across a wide range of scales. GN addresses the problem of the small training batch size effectively. K-means clustering algorithm is used finally to determine anchor sizes of the network on the purpose of making the network do bounding-box regression more easily. Without bells and whistles, the detection model achieves state-of-the-art object detection accuracy on the MSCOCO datasets.","PeriodicalId":113564,"journal":{"name":"2020 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Clustering Anchor for Faster R-CNN to Improve Detection Results\",\"authors\":\"Han Wei-yue, L. Xiaohong\",\"doi\":\"10.1109/ICAICA50127.2020.9182521\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Object detection has made impressive progress in recent years where Faster R-CNN is the mainstream framework for region-based object detection methods. However, a single Faster R-CNN framework no longer has advantages compared with the latest detection models. So based on Faster R-CNN, a model that focuses on features, normalization methods, and anchor sizes is proposed to improve detection results. The model integrates Feature Pyramid Networks (FPN), Group Normalization (GN) with k-means clustering. FPN is used to produce a multi-scale feature representation, which enables the model to detect objects across a wide range of scales. GN addresses the problem of the small training batch size effectively. K-means clustering algorithm is used finally to determine anchor sizes of the network on the purpose of making the network do bounding-box regression more easily. Without bells and whistles, the detection model achieves state-of-the-art object detection accuracy on the MSCOCO datasets.\",\"PeriodicalId\":113564,\"journal\":{\"name\":\"2020 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA)\",\"volume\":\"50 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAICA50127.2020.9182521\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAICA50127.2020.9182521","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Clustering Anchor for Faster R-CNN to Improve Detection Results
Object detection has made impressive progress in recent years where Faster R-CNN is the mainstream framework for region-based object detection methods. However, a single Faster R-CNN framework no longer has advantages compared with the latest detection models. So based on Faster R-CNN, a model that focuses on features, normalization methods, and anchor sizes is proposed to improve detection results. The model integrates Feature Pyramid Networks (FPN), Group Normalization (GN) with k-means clustering. FPN is used to produce a multi-scale feature representation, which enables the model to detect objects across a wide range of scales. GN addresses the problem of the small training batch size effectively. K-means clustering algorithm is used finally to determine anchor sizes of the network on the purpose of making the network do bounding-box regression more easily. Without bells and whistles, the detection model achieves state-of-the-art object detection accuracy on the MSCOCO datasets.