Hao Huijun, Ye Rong-hua, Chen Zhongyu, Zheng Zhong-long
{"title":"Multi-scale Pyramid Feature Maps for Object Detection","authors":"Hao Huijun, Ye Rong-hua, Chen Zhongyu, Zheng Zhong-long","doi":"10.1109/DCABES.2017.59","DOIUrl":null,"url":null,"abstract":"this paper presents how we can achieve the state-of-the-art accuracy in multi-scale objection detection task while adopting and combining the recent technical innovation in deep learning. Following the common pipeline of CNN feature extraction, we mainly design the architecture of feature extraction which exploits the idea of feature pyramid. We further add an extra 1*1 convolution layer to benefit feature extraction, via the batch normalization. In addition, the designed network architecture for feature extraction combines low-resolution and high-resolution feature layers to predict the category of the object in images. The new architecture is trained with the help of batch normalization, mean pooling based on plateau detection. The proposed architecture shows competitive results compared to some state-of-the-art algorithms both in accuracy and in speed on some datasets.","PeriodicalId":446641,"journal":{"name":"2017 16th International Symposium on Distributed Computing and Applications to Business, Engineering and Science (DCABES)","volume":"307 ","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 16th International Symposium on Distributed Computing and Applications to Business, Engineering and Science (DCABES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DCABES.2017.59","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
this paper presents how we can achieve the state-of-the-art accuracy in multi-scale objection detection task while adopting and combining the recent technical innovation in deep learning. Following the common pipeline of CNN feature extraction, we mainly design the architecture of feature extraction which exploits the idea of feature pyramid. We further add an extra 1*1 convolution layer to benefit feature extraction, via the batch normalization. In addition, the designed network architecture for feature extraction combines low-resolution and high-resolution feature layers to predict the category of the object in images. The new architecture is trained with the help of batch normalization, mean pooling based on plateau detection. The proposed architecture shows competitive results compared to some state-of-the-art algorithms both in accuracy and in speed on some datasets.