Comparative performance analysis of simple U-Net, residual attention U-Net, and VGG16-U-Net for inventory inland water bodies

IF 2.6 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Ali Ghaznavi , Mohammadmehdi Saberioon , Jakub Brom , Sibylle Itzerott
{"title":"Comparative performance analysis of simple U-Net, residual attention U-Net, and VGG16-U-Net for inventory inland water bodies","authors":"Ali Ghaznavi ,&nbsp;Mohammadmehdi Saberioon ,&nbsp;Jakub Brom ,&nbsp;Sibylle Itzerott","doi":"10.1016/j.acags.2023.100150","DOIUrl":null,"url":null,"abstract":"<div><p>Inland water bodies play a vital role at all scales in the terrestrial water balance and Earth’s climate variability. Thus, an inventory of inland waters is crucially important for hydrologic and ecological studies and management. Therefore, the main aim of this study was to develop a deep learning-based method for inventorying and mapping inland water bodies using the RGB band of high-resolution satellite imagery automatically and accurately.</p><p>The Sentinel-2 Harmonized dataset, together with ZABAGED-validated ground truth, was used as the main dataset for the model training step. Three different deep learning algorithms based on U-Net architecture were employed to segment inland waters, including a simple U-Net, Residual Attention U-Net, and VGG16-U-Net. All three algorithms were trained using a combination of Sentinel-2 visible bands (Red [B04; 665nm], Green [B03; 560nm], and Blue [B02; 490 nm]) at a 10-meter spatial resolution.</p><p>The Residual Attention U-Net achieved the highest computational cost due to the increased number of trainable parameters. The VGG16-U-Net had the shortest run time and the lowest number of trainable parameters, attributed to its architecture compared to the simple and Residual Attention U-Net architectures, respectively. As a result, the VGG16-U-Net provided the best segmentation results with a mean-IoU score of 0.9850, a slight improvement compared to other proposed U-Net-based architectures.</p><p>Although the accuracy of the model based on VGG16-U-Net does not make a difference from Residual Attention U-Net, the computation costs for training VGG16-U-Net were dramatically lower than Residual Attention U-Net.</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"21 ","pages":"Article 100150"},"PeriodicalIF":2.6000,"publicationDate":"2023-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590197423000393/pdfft?md5=e26e50e9fd7c6d7b45541d9f356c212b&pid=1-s2.0-S2590197423000393-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Computing and Geosciences","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590197423000393","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Inland water bodies play a vital role at all scales in the terrestrial water balance and Earth’s climate variability. Thus, an inventory of inland waters is crucially important for hydrologic and ecological studies and management. Therefore, the main aim of this study was to develop a deep learning-based method for inventorying and mapping inland water bodies using the RGB band of high-resolution satellite imagery automatically and accurately.

The Sentinel-2 Harmonized dataset, together with ZABAGED-validated ground truth, was used as the main dataset for the model training step. Three different deep learning algorithms based on U-Net architecture were employed to segment inland waters, including a simple U-Net, Residual Attention U-Net, and VGG16-U-Net. All three algorithms were trained using a combination of Sentinel-2 visible bands (Red [B04; 665nm], Green [B03; 560nm], and Blue [B02; 490 nm]) at a 10-meter spatial resolution.

The Residual Attention U-Net achieved the highest computational cost due to the increased number of trainable parameters. The VGG16-U-Net had the shortest run time and the lowest number of trainable parameters, attributed to its architecture compared to the simple and Residual Attention U-Net architectures, respectively. As a result, the VGG16-U-Net provided the best segmentation results with a mean-IoU score of 0.9850, a slight improvement compared to other proposed U-Net-based architectures.

Although the accuracy of the model based on VGG16-U-Net does not make a difference from Residual Attention U-Net, the computation costs for training VGG16-U-Net were dramatically lower than Residual Attention U-Net.

用于清查内陆水体的简单 U-网、剩余注意力 U-网和 VGG16-U-Net 的性能比较分析
内陆水体在陆地水量平衡和地球气候多变性的各个尺度上都发挥着至关重要的作用。因此,内陆水域清单对于水文和生态研究及管理至关重要。因此,本研究的主要目的是开发一种基于深度学习的方法,利用高分辨率卫星图像的 RGB 波段自动准确地清查和绘制内陆水体。在对内陆水域进行分割时,采用了三种不同的基于 U-Net 架构的深度学习算法,包括简单 U-Net、Residual Attention U-Net 和 VGG16-U-Net。这三种算法都是使用哨兵-2 的可见光波段(红波段[B04; 665nm]、绿波段[B03; 560nm]和蓝波段[B02; 490nm])组合进行训练的,空间分辨率为 10 米。由于可训练参数的数量增加,残留注意力 U-Net 的计算成本最高。VGG16-U-Net 的运行时间最短,可训练参数数量最少,这分别归因于其架构与简单 U-Net 架构和剩余注意力 U-Net 架构相比。因此,VGG16-U-Net 提供了最好的分割结果,平均 IoU 得分为 0.9850,与其他基于 U-Net 的架构相比略有提高。虽然基于 VGG16-U-Net 的模型的准确性与残差注意 U-Net 没有区别,但训练 VGG16-U-Net 的计算成本却大大低于残差注意 U-Net。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Applied Computing and Geosciences
Applied Computing and Geosciences Computer Science-General Computer Science
CiteScore
5.50
自引率
0.00%
发文量
23
审稿时长
5 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信