Building detection in VHR remote sensing images using a novel dual attention residual-based U-Net (DAttResU-Net): An application to generating building change maps

IF 3.8 Q2 ENVIRONMENTAL SCIENCES
Ehsan Khankeshizadeh , Ali Mohammadzadeh , Amin Mohsenifar , Armin Moghimi , Saied Pirasteh , Sheng Feng , Keli Hu , Jonathan Li
{"title":"Building detection in VHR remote sensing images using a novel dual attention residual-based U-Net (DAttResU-Net): An application to generating building change maps","authors":"Ehsan Khankeshizadeh ,&nbsp;Ali Mohammadzadeh ,&nbsp;Amin Mohsenifar ,&nbsp;Armin Moghimi ,&nbsp;Saied Pirasteh ,&nbsp;Sheng Feng ,&nbsp;Keli Hu ,&nbsp;Jonathan Li","doi":"10.1016/j.rsase.2024.101336","DOIUrl":null,"url":null,"abstract":"<div><p>In today's era, increasing access to very high-resolution remote sensing images (VHR-RSIs) has enhanced building detection and change assessment capabilities. These applications provide accurate urban mapping, facilitate effective land management, and support disaster assessment by delivering detailed insights into building structures and their temporal changes. This study uses a two-stage process to present a pioneering approach for generating precise building maps (BMs) and subsequent building change maps (BCMs) from VHR-RSIs. The primary question addressed by the research is how to enhance the U-Net architecture to improve its sensitivity to both high-level semantic features (HLSF) and low-level spatial features (LLSF) in the building detection task. For this purpose, in the initial stage of the method, a novel deep learning model called dual attention residual-based U-Net (DAttResU-Net) is introduced. This model incorporates two significant modifications to the conventional U-Net, enhancing its capacity to yield bi-temporal BMs. Firstly, each standard convolutional block (CB) is replaced with an optimized CB incorporating a channel-spatial attention module attuned to the building objects' crucial HLSF. Secondly, an additional attention module is integrated into the encoder-decoder path of the model, heightening the sensitivity of U-Net to vital LLSF of buildings while disregarding extraneous background spatial information during the fusion of HLSF and LLSF. In the subsequent stage, the bi-temporal BMs generated by the DAttResU-Net are subjected to a box-based class-object change detection methodology to produce accurate BCMs. The effectiveness of the proposed architecture is rigorously evaluated against state-of-the-art models in both BM and BCM generation contexts, utilizing the well-established WHU dataset for experimentation. The experimental results indicated that the DAttResU-Net model, boasting an average of <span><math><mrow><msub><mi>P</mi><mtext>FN</mtext></msub></mrow></math></span>/ <span><math><mrow><msub><mi>P</mi><mtext>FP</mtext></msub></mrow></math></span> value of 2.33/1.34 (%) surpasses the performance of the state-of-the-art models in generating bi-temporal BMs. Furthermore, the building change detection outcomes demonstrated the proficient role of the bi-temporal BMs predicted by the proposed model in leading to the most optimal BCMs, exhibiting average <span><math><mrow><msub><mi>P</mi><mtext>FN</mtext></msub></mrow></math></span>/ <span><math><mrow><msub><mi>P</mi><mtext>FP</mtext></msub></mrow></math></span> value of 2.63/8.93 (%), outperforming comparative networks. Finally, we concluded that the proposed DAttResU-Net architecture is a highly promising and applicable model for producing reliable BMs and BCMs.</p></div>","PeriodicalId":53227,"journal":{"name":"Remote Sensing Applications-Society and Environment","volume":"36 ","pages":"Article 101336"},"PeriodicalIF":3.8000,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Remote Sensing Applications-Society and Environment","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352938524002003","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

In today's era, increasing access to very high-resolution remote sensing images (VHR-RSIs) has enhanced building detection and change assessment capabilities. These applications provide accurate urban mapping, facilitate effective land management, and support disaster assessment by delivering detailed insights into building structures and their temporal changes. This study uses a two-stage process to present a pioneering approach for generating precise building maps (BMs) and subsequent building change maps (BCMs) from VHR-RSIs. The primary question addressed by the research is how to enhance the U-Net architecture to improve its sensitivity to both high-level semantic features (HLSF) and low-level spatial features (LLSF) in the building detection task. For this purpose, in the initial stage of the method, a novel deep learning model called dual attention residual-based U-Net (DAttResU-Net) is introduced. This model incorporates two significant modifications to the conventional U-Net, enhancing its capacity to yield bi-temporal BMs. Firstly, each standard convolutional block (CB) is replaced with an optimized CB incorporating a channel-spatial attention module attuned to the building objects' crucial HLSF. Secondly, an additional attention module is integrated into the encoder-decoder path of the model, heightening the sensitivity of U-Net to vital LLSF of buildings while disregarding extraneous background spatial information during the fusion of HLSF and LLSF. In the subsequent stage, the bi-temporal BMs generated by the DAttResU-Net are subjected to a box-based class-object change detection methodology to produce accurate BCMs. The effectiveness of the proposed architecture is rigorously evaluated against state-of-the-art models in both BM and BCM generation contexts, utilizing the well-established WHU dataset for experimentation. The experimental results indicated that the DAttResU-Net model, boasting an average of PFN/ PFP value of 2.33/1.34 (%) surpasses the performance of the state-of-the-art models in generating bi-temporal BMs. Furthermore, the building change detection outcomes demonstrated the proficient role of the bi-temporal BMs predicted by the proposed model in leading to the most optimal BCMs, exhibiting average PFN/ PFP value of 2.63/8.93 (%), outperforming comparative networks. Finally, we concluded that the proposed DAttResU-Net architecture is a highly promising and applicable model for producing reliable BMs and BCMs.

使用基于残差的新型双重注意 U-Net (DAttResU-Net) 在 VHR 遥感图像中检测建筑物:应用于生成建筑物变化图
在当今时代,越来越多的人能够获取甚高分辨率遥感图像(VHR-RSIs),从而提高了建筑物探测和变化评估能力。这些应用提供了准确的城市地图,促进了有效的土地管理,并通过详细了解建筑结构及其时间变化来支持灾害评估。本研究采用两阶段流程,提出了一种从 VHR-RSI 生成精确建筑物地图(BM)和随后的建筑物变化地图(BCM)的开创性方法。研究解决的主要问题是如何增强 U-Net 架构,以提高其在建筑物检测任务中对高层语义特征 (HLSF) 和低层空间特征 (LLSF) 的灵敏度。为此,在该方法的初始阶段,引入了一种名为基于双注意残差的 U-Net (DAttResU-Net)的新型深度学习模型。该模型对传统的 U-Net 进行了两项重大修改,增强了其生成双时态 BM 的能力。首先,每个标准卷积块(CB)都被一个优化的 CB 所取代,该 CB 包含一个通道空间注意模块,与建筑对象的关键 HLSF 相匹配。其次,在模型的编码器-解码器路径中集成了一个额外的关注模块,提高了 U-Net 对建筑物重要的 LLSF 的灵敏度,同时在融合 HLSF 和 LLSF 的过程中忽略了无关的背景空间信息。在随后的阶段,DAttResU-Net 生成的双时态 BM 将通过基于盒的类对象变化检测方法生成精确的 BCM。利用成熟的 WHU 数据集进行实验,在生成 BM 和 BCM 的情况下,对照最先进的模型对所提出的架构的有效性进行了严格评估。实验结果表明,DAttResU-Net 模型的 PFN/ PFP 平均值为 2.33/1.34(%),在生成双时态 BM 方面的性能超过了最先进的模型。此外,建筑物变化检测结果表明,拟议模型预测的双时态 BM 在生成最佳 BCM 方面发挥了重要作用,其平均 PFN/ PFP 值为 2.63/8.93(%),优于比较网络。最后,我们得出结论,所提出的 DAttResU-Net 架构是一种非常有前途且适用的模型,可用于生成可靠的 BM 和 BCM。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
8.00
自引率
8.50%
发文量
204
审稿时长
65 days
期刊介绍: The journal ''Remote Sensing Applications: Society and Environment'' (RSASE) focuses on remote sensing studies that address specific topics with an emphasis on environmental and societal issues - regional / local studies with global significance. Subjects are encouraged to have an interdisciplinary approach and include, but are not limited by: " -Global and climate change studies addressing the impact of increasing concentrations of greenhouse gases, CO2 emission, carbon balance and carbon mitigation, energy system on social and environmental systems -Ecological and environmental issues including biodiversity, ecosystem dynamics, land degradation, atmospheric and water pollution, urban footprint, ecosystem management and natural hazards (e.g. earthquakes, typhoons, floods, landslides) -Natural resource studies including land-use in general, biomass estimation, forests, agricultural land, plantation, soils, coral reefs, wetland and water resources -Agriculture, food production systems and food security outcomes -Socio-economic issues including urban systems, urban growth, public health, epidemics, land-use transition and land use conflicts -Oceanography and coastal zone studies, including sea level rise projections, coastlines changes and the ocean-land interface -Regional challenges for remote sensing application techniques, monitoring and analysis, such as cloud screening and atmospheric correction for tropical regions -Interdisciplinary studies combining remote sensing, household survey data, field measurements and models to address environmental, societal and sustainability issues -Quantitative and qualitative analysis that documents the impact of using remote sensing studies in social, political, environmental or economic systems
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信