Extraction of building footprint using MASK-RCNN for high resolution aerial imagery

IF 2.9 4区环境科学与生态学 Q3 ENVIRONMENTAL SCIENCES

Environmental Research Communications Pub Date : 2024-07-03 DOI:10.1088/2515-7620/ad5b3d

Jenila Vincent M and Varalakshmi P

{"title":"Extraction of building footprint using MASK-RCNN for high resolution aerial imagery","authors":"Jenila Vincent M and Varalakshmi P","doi":"10.1088/2515-7620/ad5b3d","DOIUrl":null,"url":null,"abstract":"Extracting individual buildings from satellite images is crucial for various urban applications, including population estimation, urban planning, and other related fields. However, Extracting building footprints from remote sensing data is a challenging task because of scale differences, complex structures and different types of building. Addressing these issues, an approach that can efficiently detect buildings in images by generating a segmentation mask for each instance is proposed in this paper. This approach incorporates the Regional Convolutional Neural Network (MASK-RCNN), which combines Faster R-CNN for object mask prediction and boundary box recognition and was evaluated against other models like YOLOv5, YOLOv7 and YOLOv8 in a comparative study to assess its effectiveness. The findings of this study reveals that our proposed method achieved the highest accuracy in building extraction. Furthermore, we performed experiments on well-established datasets like WHU and INRIA, and our method consistently outperformed other existing methods, producing reliable results.","PeriodicalId":48496,"journal":{"name":"Environmental Research Communications","volume":"67 1","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Research Communications","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1088/2515-7620/ad5b3d","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}

引用次数: 0

Abstract

Extracting individual buildings from satellite images is crucial for various urban applications, including population estimation, urban planning, and other related fields. However, Extracting building footprints from remote sensing data is a challenging task because of scale differences, complex structures and different types of building. Addressing these issues, an approach that can efficiently detect buildings in images by generating a segmentation mask for each instance is proposed in this paper. This approach incorporates the Regional Convolutional Neural Network (MASK-RCNN), which combines Faster R-CNN for object mask prediction and boundary box recognition and was evaluated against other models like YOLOv5, YOLOv7 and YOLOv8 in a comparative study to assess its effectiveness. The findings of this study reveals that our proposed method achieved the highest accuracy in building extraction. Furthermore, we performed experiments on well-established datasets like WHU and INRIA, and our method consistently outperformed other existing methods, producing reliable results.

查看原文本刊更多论文

使用 MASK-RCNN 提取高分辨率航空图像中的建筑物足迹

从卫星图像中提取单个建筑物对于各种城市应用（包括人口估计、城市规划和其他相关领域）至关重要。然而，由于尺度差异、结构复杂和建筑物类型不同，从遥感数据中提取建筑物足迹是一项具有挑战性的任务。为了解决这些问题，本文提出了一种方法，通过为每个实例生成一个分割掩码来有效检测图像中的建筑物。这种方法结合了区域卷积神经网络（MASK-RCNN），将用于对象掩码预测和边界框识别的 Faster R-CNN 结合在一起，并与 YOLOv5、YOLOv7 和 YOLOv8 等其他模型进行了对比研究，以评估其有效性。研究结果表明，我们提出的方法在建筑物提取方面达到了最高的准确率。此外，我们还在 WHU 和 INRIA 等成熟的数据集上进行了实验，结果表明我们的方法始终优于其他现有方法，结果可靠。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Environmental Research Communications ENVIRONMENTAL SCIENCES-

CiteScore

3.50

自引率

0.00%

发文量

136