聚类锚更快R-CNN提高检测结果

Han Wei-yue, L. Xiaohong
{"title":"聚类锚更快R-CNN提高检测结果","authors":"Han Wei-yue, L. Xiaohong","doi":"10.1109/ICAICA50127.2020.9182521","DOIUrl":null,"url":null,"abstract":"Object detection has made impressive progress in recent years where Faster R-CNN is the mainstream framework for region-based object detection methods. However, a single Faster R-CNN framework no longer has advantages compared with the latest detection models. So based on Faster R-CNN, a model that focuses on features, normalization methods, and anchor sizes is proposed to improve detection results. The model integrates Feature Pyramid Networks (FPN), Group Normalization (GN) with k-means clustering. FPN is used to produce a multi-scale feature representation, which enables the model to detect objects across a wide range of scales. GN addresses the problem of the small training batch size effectively. K-means clustering algorithm is used finally to determine anchor sizes of the network on the purpose of making the network do bounding-box regression more easily. Without bells and whistles, the detection model achieves state-of-the-art object detection accuracy on the MSCOCO datasets.","PeriodicalId":113564,"journal":{"name":"2020 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Clustering Anchor for Faster R-CNN to Improve Detection Results\",\"authors\":\"Han Wei-yue, L. Xiaohong\",\"doi\":\"10.1109/ICAICA50127.2020.9182521\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Object detection has made impressive progress in recent years where Faster R-CNN is the mainstream framework for region-based object detection methods. However, a single Faster R-CNN framework no longer has advantages compared with the latest detection models. So based on Faster R-CNN, a model that focuses on features, normalization methods, and anchor sizes is proposed to improve detection results. The model integrates Feature Pyramid Networks (FPN), Group Normalization (GN) with k-means clustering. FPN is used to produce a multi-scale feature representation, which enables the model to detect objects across a wide range of scales. GN addresses the problem of the small training batch size effectively. K-means clustering algorithm is used finally to determine anchor sizes of the network on the purpose of making the network do bounding-box regression more easily. Without bells and whistles, the detection model achieves state-of-the-art object detection accuracy on the MSCOCO datasets.\",\"PeriodicalId\":113564,\"journal\":{\"name\":\"2020 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA)\",\"volume\":\"50 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAICA50127.2020.9182521\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAICA50127.2020.9182521","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

近年来,目标检测取得了令人印象深刻的进展,其中Faster R-CNN是基于区域的目标检测方法的主流框架。然而,与最新的检测模型相比,单一的Faster R-CNN框架不再具有优势。因此,基于Faster R-CNN,提出了一种以特征、归一化方法和锚点大小为重点的模型,以提高检测结果。该模型集成了特征金字塔网络(FPN)、群归一化(GN)和k-means聚类。FPN用于产生多尺度特征表示,这使得模型能够在大范围的尺度上检测物体。GN有效地解决了训练批大小小的问题。最后使用K-means聚类算法确定网络的锚大小,使网络更容易进行边界盒回归。该检测模型在MSCOCO数据集上实现了最先进的目标检测精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Clustering Anchor for Faster R-CNN to Improve Detection Results
Object detection has made impressive progress in recent years where Faster R-CNN is the mainstream framework for region-based object detection methods. However, a single Faster R-CNN framework no longer has advantages compared with the latest detection models. So based on Faster R-CNN, a model that focuses on features, normalization methods, and anchor sizes is proposed to improve detection results. The model integrates Feature Pyramid Networks (FPN), Group Normalization (GN) with k-means clustering. FPN is used to produce a multi-scale feature representation, which enables the model to detect objects across a wide range of scales. GN addresses the problem of the small training batch size effectively. K-means clustering algorithm is used finally to determine anchor sizes of the network on the purpose of making the network do bounding-box regression more easily. Without bells and whistles, the detection model achieves state-of-the-art object detection accuracy on the MSCOCO datasets.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信