Road Image Segmentation using Unmanned Aerial Vehicle Images and DeepLab V3+ Semantic Segmentation Model

Mat Nizam Mahmud, M. K. Osman, A. P. Ismail, F. Ahmad, K. A. Ahmad, A. Ibrahim
{"title":"Road Image Segmentation using Unmanned Aerial Vehicle Images and DeepLab V3+ Semantic Segmentation Model","authors":"Mat Nizam Mahmud, M. K. Osman, A. P. Ismail, F. Ahmad, K. A. Ahmad, A. Ibrahim","doi":"10.1109/ICCSCE52189.2021.9530950","DOIUrl":null,"url":null,"abstract":"Road image segmentation is critical in a variety of applications, including road maintenance, intelligent transportation systems, and urban planning. Numerous image segmentation techniques, including popular neural network approaches, have been proposed for unmanned aerial vehicle (UAV) images recently. However, since these images include complex backgrounds, high-precision road segmentation from UAV images remains challenging. To address this issue, this study proposes a deep learning method called DeepLab V3+ semantic segmentation. Road images are captured and collected from several roads in Kedah and Selangor, Malaysia using a UAV. To segment the road from the background, the DeepLab V3+ with Resnet-50 backbone is utilised. Then, the performance is assessed by comparing segmented images by deep learning to manually segment images. Three metrics are used for the assessment; pixel accuracy (PA), mean area intersection by union (mIoU), and mean F1-score (MeanF1). The study also compares the segmentation performance with the DeepLab V3+ with mobile NetV2 for benchmarking purposes. Simulation results show that the DeepLab V3+ with Resnet-50 has performed better than the DeepLab V3+ with mobile NetV2 methods. The findings indicate that the DeepLab V3+ with Resnet-50 outperformed the DeepLab V3+ with mobile NetV2 for PA, mIoU, and MeanF1 by 1.39 %, 4.92 %, and 9.71 %, respectively.","PeriodicalId":285507,"journal":{"name":"2021 11th IEEE International Conference on Control System, Computing and Engineering (ICCSCE)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 11th IEEE International Conference on Control System, Computing and Engineering (ICCSCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSCE52189.2021.9530950","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

Abstract

Road image segmentation is critical in a variety of applications, including road maintenance, intelligent transportation systems, and urban planning. Numerous image segmentation techniques, including popular neural network approaches, have been proposed for unmanned aerial vehicle (UAV) images recently. However, since these images include complex backgrounds, high-precision road segmentation from UAV images remains challenging. To address this issue, this study proposes a deep learning method called DeepLab V3+ semantic segmentation. Road images are captured and collected from several roads in Kedah and Selangor, Malaysia using a UAV. To segment the road from the background, the DeepLab V3+ with Resnet-50 backbone is utilised. Then, the performance is assessed by comparing segmented images by deep learning to manually segment images. Three metrics are used for the assessment; pixel accuracy (PA), mean area intersection by union (mIoU), and mean F1-score (MeanF1). The study also compares the segmentation performance with the DeepLab V3+ with mobile NetV2 for benchmarking purposes. Simulation results show that the DeepLab V3+ with Resnet-50 has performed better than the DeepLab V3+ with mobile NetV2 methods. The findings indicate that the DeepLab V3+ with Resnet-50 outperformed the DeepLab V3+ with mobile NetV2 for PA, mIoU, and MeanF1 by 1.39 %, 4.92 %, and 9.71 %, respectively.
基于无人机图像和DeepLab V3+语义分割模型的道路图像分割
道路图像分割在各种应用中至关重要,包括道路维护、智能交通系统和城市规划。近年来,针对无人机图像的分割技术层出不穷,其中包括流行的神经网络分割方法。然而,由于这些图像包含复杂的背景,从无人机图像中进行高精度道路分割仍然具有挑战性。为了解决这个问题,本研究提出了一种深度学习方法DeepLab V3+语义分割。使用无人机从马来西亚吉打州和雪兰莪州的几条道路上捕获和收集道路图像。为了从背景中分割道路,使用了带有Resnet-50骨干网的DeepLab V3+。然后,通过将深度学习分割的图像与手动分割的图像进行比较,评估其性能。评估使用了三个指标;像素精度(PA)、平均面积交会并(mIoU)、平均F1-score (MeanF1)。该研究还比较了DeepLab V3+与移动NetV2的分割性能,以进行基准测试。仿真结果表明,采用Resnet-50的DeepLab V3+比采用移动NetV2方法的DeepLab V3+性能更好。结果表明,采用Resnet-50的DeepLab V3+在PA、mIoU和MeanF1方面分别优于采用移动NetV2的DeepLab V3+,分别提高1.39%、4.92%和9.71%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信