{"title":"基于改进的Faster R-RDN算法的地标建筑检测与识别","authors":"Wu Jun, Kai Yan, ZiBo Huang, Haiyan Tan, Xiaofang Tu, Chengjun Zhu","doi":"10.1117/12.2636502","DOIUrl":null,"url":null,"abstract":"This paper proposes an improved Faster R-DRN (Dense Residual Network, DRN) algorithm, which is based on Faster R-CNN, using densely connected residual network DRNet to replace VGG network. This algorithm is suitable for special scenes of building recognition. It has a residual network and a deep convolution residual network structure, which can efficiently perform image detection, classification and recognition. This design optimizes the problem of algorithm overfitting due to the increase of network depth. In this paper, a comprehensive sample data set for various landmark buildings is established, and samples with different weather, different lighting, and different angles are taken to effectively improve the resistance of the training model. Combined with the optimization of the network structure and the training of targeted data sets, the final feature block diagram generated by DRNet not only does not lose the lowlevel edge texture information, but also reuses the low-level feature block diagrams in the deep convolutional network to make the fused feature block Richer feature information effectively improves the model's recognition rate for photos taken in complex environments. The experimental results show that the accuracy of this method for predicting landmark buildings can reach 82.0% of mAP, and the recognition performance of images taken in complex environments is excellent.","PeriodicalId":342847,"journal":{"name":"International Conference on Algorithms, Microchips and Network Applications","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A landmark building detection and recognition based on improved Faster R-RDN algorithm\",\"authors\":\"Wu Jun, Kai Yan, ZiBo Huang, Haiyan Tan, Xiaofang Tu, Chengjun Zhu\",\"doi\":\"10.1117/12.2636502\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes an improved Faster R-DRN (Dense Residual Network, DRN) algorithm, which is based on Faster R-CNN, using densely connected residual network DRNet to replace VGG network. This algorithm is suitable for special scenes of building recognition. It has a residual network and a deep convolution residual network structure, which can efficiently perform image detection, classification and recognition. This design optimizes the problem of algorithm overfitting due to the increase of network depth. In this paper, a comprehensive sample data set for various landmark buildings is established, and samples with different weather, different lighting, and different angles are taken to effectively improve the resistance of the training model. Combined with the optimization of the network structure and the training of targeted data sets, the final feature block diagram generated by DRNet not only does not lose the lowlevel edge texture information, but also reuses the low-level feature block diagrams in the deep convolutional network to make the fused feature block Richer feature information effectively improves the model's recognition rate for photos taken in complex environments. The experimental results show that the accuracy of this method for predicting landmark buildings can reach 82.0% of mAP, and the recognition performance of images taken in complex environments is excellent.\",\"PeriodicalId\":342847,\"journal\":{\"name\":\"International Conference on Algorithms, Microchips and Network Applications\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Algorithms, Microchips and Network Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2636502\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Algorithms, Microchips and Network Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2636502","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A landmark building detection and recognition based on improved Faster R-RDN algorithm
This paper proposes an improved Faster R-DRN (Dense Residual Network, DRN) algorithm, which is based on Faster R-CNN, using densely connected residual network DRNet to replace VGG network. This algorithm is suitable for special scenes of building recognition. It has a residual network and a deep convolution residual network structure, which can efficiently perform image detection, classification and recognition. This design optimizes the problem of algorithm overfitting due to the increase of network depth. In this paper, a comprehensive sample data set for various landmark buildings is established, and samples with different weather, different lighting, and different angles are taken to effectively improve the resistance of the training model. Combined with the optimization of the network structure and the training of targeted data sets, the final feature block diagram generated by DRNet not only does not lose the lowlevel edge texture information, but also reuses the low-level feature block diagrams in the deep convolutional network to make the fused feature block Richer feature information effectively improves the model's recognition rate for photos taken in complex environments. The experimental results show that the accuracy of this method for predicting landmark buildings can reach 82.0% of mAP, and the recognition performance of images taken in complex environments is excellent.