基于改进的Faster R-RDN算法的地标建筑检测与识别

International Conference on Algorithms, Microchips and Network Applications Pub Date : 2022-05-06 DOI:10.1117/12.2636502

Wu Jun, Kai Yan, ZiBo Huang, Haiyan Tan, Xiaofang Tu, Chengjun Zhu

{"title":"基于改进的Faster R-RDN算法的地标建筑检测与识别","authors":"Wu Jun, Kai Yan, ZiBo Huang, Haiyan Tan, Xiaofang Tu, Chengjun Zhu","doi":"10.1117/12.2636502","DOIUrl":null,"url":null,"abstract":"This paper proposes an improved Faster R-DRN (Dense Residual Network, DRN) algorithm, which is based on Faster R-CNN, using densely connected residual network DRNet to replace VGG network. This algorithm is suitable for special scenes of building recognition. It has a residual network and a deep convolution residual network structure, which can efficiently perform image detection, classification and recognition. This design optimizes the problem of algorithm overfitting due to the increase of network depth. In this paper, a comprehensive sample data set for various landmark buildings is established, and samples with different weather, different lighting, and different angles are taken to effectively improve the resistance of the training model. Combined with the optimization of the network structure and the training of targeted data sets, the final feature block diagram generated by DRNet not only does not lose the lowlevel edge texture information, but also reuses the low-level feature block diagrams in the deep convolutional network to make the fused feature block Richer feature information effectively improves the model's recognition rate for photos taken in complex environments. The experimental results show that the accuracy of this method for predicting landmark buildings can reach 82.0% of mAP, and the recognition performance of images taken in complex environments is excellent.","PeriodicalId":342847,"journal":{"name":"International Conference on Algorithms, Microchips and Network Applications","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A landmark building detection and recognition based on improved Faster R-RDN algorithm\",\"authors\":\"Wu Jun, Kai Yan, ZiBo Huang, Haiyan Tan, Xiaofang Tu, Chengjun Zhu\",\"doi\":\"10.1117/12.2636502\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes an improved Faster R-DRN (Dense Residual Network, DRN) algorithm, which is based on Faster R-CNN, using densely connected residual network DRNet to replace VGG network. This algorithm is suitable for special scenes of building recognition. It has a residual network and a deep convolution residual network structure, which can efficiently perform image detection, classification and recognition. This design optimizes the problem of algorithm overfitting due to the increase of network depth. In this paper, a comprehensive sample data set for various landmark buildings is established, and samples with different weather, different lighting, and different angles are taken to effectively improve the resistance of the training model. Combined with the optimization of the network structure and the training of targeted data sets, the final feature block diagram generated by DRNet not only does not lose the lowlevel edge texture information, but also reuses the low-level feature block diagrams in the deep convolutional network to make the fused feature block Richer feature information effectively improves the model's recognition rate for photos taken in complex environments. The experimental results show that the accuracy of this method for predicting landmark buildings can reach 82.0% of mAP, and the recognition performance of images taken in complex environments is excellent.\",\"PeriodicalId\":342847,\"journal\":{\"name\":\"International Conference on Algorithms, Microchips and Network Applications\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Algorithms, Microchips and Network Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2636502\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Algorithms, Microchips and Network Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2636502","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文在Faster R-CNN的基础上，提出了一种改进的Faster R-DRN (Dense Residual Network, DRN)算法，使用密集连接的残差网络DRNet代替VGG网络。该算法适用于特殊场景的建筑物识别。它具有残差网络和深度卷积残差网络结构，可以有效地进行图像检测、分类和识别。本设计针对网络深度增加导致的算法过拟合问题进行了优化。本文建立了各种地标性建筑的综合样本数据集，采用不同天气、不同光照、不同角度的样本，有效提高了训练模型的阻力。结合网络结构的优化和目标数据集的训练，DRNet生成的最终特征块图不仅没有丢失底层边缘纹理信息，而且在深度卷积网络中重用底层特征块图，使融合后的特征块特征信息更加丰富，有效提高了模型对复杂环境下拍摄的照片的识别率。实验结果表明，该方法对地标性建筑的预测准确率可达到mAP的82.0%，对复杂环境下拍摄的图像具有优异的识别性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A landmark building detection and recognition based on improved Faster R-RDN algorithm

This paper proposes an improved Faster R-DRN (Dense Residual Network, DRN) algorithm, which is based on Faster R-CNN, using densely connected residual network DRNet to replace VGG network. This algorithm is suitable for special scenes of building recognition. It has a residual network and a deep convolution residual network structure, which can efficiently perform image detection, classification and recognition. This design optimizes the problem of algorithm overfitting due to the increase of network depth. In this paper, a comprehensive sample data set for various landmark buildings is established, and samples with different weather, different lighting, and different angles are taken to effectively improve the resistance of the training model. Combined with the optimization of the network structure and the training of targeted data sets, the final feature block diagram generated by DRNet not only does not lose the lowlevel edge texture information, but also reuses the low-level feature block diagrams in the deep convolutional network to make the fused feature block Richer feature information effectively improves the model's recognition rate for photos taken in complex environments. The experimental results show that the accuracy of this method for predicting landmark buildings can reach 82.0% of mAP, and the recognition performance of images taken in complex environments is excellent.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Conference on Algorithms, Microchips and Network Applications

自引率

0.00%

发文量