Jiaqi Wang , Guanzhou Chen , Xiaodong Zhang , Tong Wang , Xiaoliang Tan , Qingyuan Yang , Wenlin Zhou , Kun Zhu
{"title":"RoofMapNet: Utilizing geometric primitives for depicting planar building roof structure from high-resolution remote sensing imagery","authors":"Jiaqi Wang , Guanzhou Chen , Xiaodong Zhang , Tong Wang , Xiaoliang Tan , Qingyuan Yang , Wenlin Zhou , Kun Zhu","doi":"10.1016/j.jag.2025.104630","DOIUrl":null,"url":null,"abstract":"<div><div>The accurate extraction of building roof structures from aerial imagery represents a fundamental task for urban digital twin systems, facilitating critical applications such as 3D city modeling and solar potential assessment. Despite recent advancements in geospatial artificial intelligence, existing methods frequently encounter challenges posed by real-world complexities. These include structural heterogeneity caused by diverse architectural styles, discontinuities in roof structures due to occlusions from vegetation and other obstacles, and the limited generalization ability of models stemming from the scarcity of specialized annotated datasets. In this paper, we introduce an end-to-end network called RoofMapNet, specifically designed for extracting roof structures. First, we propose a strategy for roof junction extraction that integrates dynamic Gaussian heatmaps with quadratic coordinate calibration. This strategy enhances the model’s robustness in junction prediction under heterogeneous sample distribution scenarios. To address the loss or blurring of roof lines caused by occlusion and shadow, we propose an adaptive occlusion-aware module. This module employs a bidirectional mapping between geometric and feature spaces to refine candidate lines accurately, thus improving the model’s generalization ability and robustness in roof line detection. Additionally, to comprehensively evaluate the performance of roof structure detection models, we meticulously annotated a diverse, large-scale remote sensing imagery dataset for roof structure extraction, named RoofMapSet. Comprehensive evaluations on the VWB and RoofMapSet datasets demonstrate state-of-the-art performance, with mean <span><math><mrow><mi>s</mi><mi>A</mi><mi>P</mi></mrow></math></span> improvements of 4.13% and 2.85% over competitors, respectively. Further analyses confirm the resilience to varying spatial resolutions and complex occlusion patterns. Our code and data are available at: <span><span>https://github.com/CVEO/RoofMapNet</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"141 ","pages":"Article 104630"},"PeriodicalIF":7.6000,"publicationDate":"2025-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of applied earth observation and geoinformation : ITC journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1569843225002778","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"REMOTE SENSING","Score":null,"Total":0}
引用次数: 0
Abstract
The accurate extraction of building roof structures from aerial imagery represents a fundamental task for urban digital twin systems, facilitating critical applications such as 3D city modeling and solar potential assessment. Despite recent advancements in geospatial artificial intelligence, existing methods frequently encounter challenges posed by real-world complexities. These include structural heterogeneity caused by diverse architectural styles, discontinuities in roof structures due to occlusions from vegetation and other obstacles, and the limited generalization ability of models stemming from the scarcity of specialized annotated datasets. In this paper, we introduce an end-to-end network called RoofMapNet, specifically designed for extracting roof structures. First, we propose a strategy for roof junction extraction that integrates dynamic Gaussian heatmaps with quadratic coordinate calibration. This strategy enhances the model’s robustness in junction prediction under heterogeneous sample distribution scenarios. To address the loss or blurring of roof lines caused by occlusion and shadow, we propose an adaptive occlusion-aware module. This module employs a bidirectional mapping between geometric and feature spaces to refine candidate lines accurately, thus improving the model’s generalization ability and robustness in roof line detection. Additionally, to comprehensively evaluate the performance of roof structure detection models, we meticulously annotated a diverse, large-scale remote sensing imagery dataset for roof structure extraction, named RoofMapSet. Comprehensive evaluations on the VWB and RoofMapSet datasets demonstrate state-of-the-art performance, with mean improvements of 4.13% and 2.85% over competitors, respectively. Further analyses confirm the resilience to varying spatial resolutions and complex occlusion patterns. Our code and data are available at: https://github.com/CVEO/RoofMapNet.
期刊介绍:
The International Journal of Applied Earth Observation and Geoinformation publishes original papers that utilize earth observation data for natural resource and environmental inventory and management. These data primarily originate from remote sensing platforms, including satellites and aircraft, supplemented by surface and subsurface measurements. Addressing natural resources such as forests, agricultural land, soils, and water, as well as environmental concerns like biodiversity, land degradation, and hazards, the journal explores conceptual and data-driven approaches. It covers geoinformation themes like capturing, databasing, visualization, interpretation, data quality, and spatial uncertainty.