Performance Analysis of Machine Learning Algorithms in Generating Urban Land Cover Map of Quezon City, Philippines Using Sentinel-2 Satellite Imagery

Robert Martin C. Santiago, R. Gustilo, G. Arada, E. Magsino, E. Sybingco
{"title":"Performance Analysis of Machine Learning Algorithms in Generating Urban Land Cover Map of Quezon City, Philippines Using Sentinel-2 Satellite Imagery","authors":"Robert Martin C. Santiago, R. Gustilo, G. Arada, E. Magsino, E. Sybingco","doi":"10.1109/HNICEM54116.2021.9731856","DOIUrl":null,"url":null,"abstract":"As urban expansion is expected to persist and may even accelerate in the coming years, understanding and effectively managing urbanization become increasingly important in achieving long-term progress specifically in making cities and human settlements inclusive, safe, resilient, and sustainable. One way to accomplish these is to obtain reliable and updated information about the land cover characteristics of an area in the form of a map which can be done using remote sensing and machine learning. However, the practice of using these technologies for urban land cover mapping was observed to occur in the geographic locality level, and in the case of the Philippines, this is a domain that needs to be further explored to quantitatively comprehend urban extent. In this study, a map of man-made structures or built-up areas and natural structures or nonbuilt-up areas was generated over Quezon City and nearby surrounding areas where rapid rise in population occurs along with urban development. In addition, since related previous studies used various machine learning algorithms in doing the classification, this study compared the performances of three algorithms specifically random forest classifier, k-nearest neighbors, and Gaussian mixture model to identify which performed best in this particular application. The satellite imagery of the area of interest was collected from the Sentinel-2 mission satellites. All the three algorithms attained high accuracies across all measurements with small variations but greatly differed in the time consumed doing the classification. The highest over-all accuracy of 99.32% was obtained using random forest classifier despite taking the longest time to finish the classification, next is 98.95% using the k-nearest neighbors algorithm which also ranked second in terms of speed of classification, and last is 97.17% using the Gaussian mixture model despite being the fastest to complete the classification. Further studies may explore other machine learning algorithms as well as deep learning techniques to harness their capabilities in feature extraction for more complex applications. Aside from Sentinel-2, other satellite missions may also be utilized as sources of satellite imageries which can offer different spectral, spatial, and temporal resolutions that would fit a specific application.","PeriodicalId":129868,"journal":{"name":"2021 IEEE 13th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management (HNICEM)","volume":"174 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 13th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management (HNICEM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HNICEM54116.2021.9731856","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

As urban expansion is expected to persist and may even accelerate in the coming years, understanding and effectively managing urbanization become increasingly important in achieving long-term progress specifically in making cities and human settlements inclusive, safe, resilient, and sustainable. One way to accomplish these is to obtain reliable and updated information about the land cover characteristics of an area in the form of a map which can be done using remote sensing and machine learning. However, the practice of using these technologies for urban land cover mapping was observed to occur in the geographic locality level, and in the case of the Philippines, this is a domain that needs to be further explored to quantitatively comprehend urban extent. In this study, a map of man-made structures or built-up areas and natural structures or nonbuilt-up areas was generated over Quezon City and nearby surrounding areas where rapid rise in population occurs along with urban development. In addition, since related previous studies used various machine learning algorithms in doing the classification, this study compared the performances of three algorithms specifically random forest classifier, k-nearest neighbors, and Gaussian mixture model to identify which performed best in this particular application. The satellite imagery of the area of interest was collected from the Sentinel-2 mission satellites. All the three algorithms attained high accuracies across all measurements with small variations but greatly differed in the time consumed doing the classification. The highest over-all accuracy of 99.32% was obtained using random forest classifier despite taking the longest time to finish the classification, next is 98.95% using the k-nearest neighbors algorithm which also ranked second in terms of speed of classification, and last is 97.17% using the Gaussian mixture model despite being the fastest to complete the classification. Further studies may explore other machine learning algorithms as well as deep learning techniques to harness their capabilities in feature extraction for more complex applications. Aside from Sentinel-2, other satellite missions may also be utilized as sources of satellite imageries which can offer different spectral, spatial, and temporal resolutions that would fit a specific application.
利用Sentinel-2卫星图像生成菲律宾奎松市城市土地覆盖图的机器学习算法性能分析
由于预计未来几年城市扩张将持续,甚至可能加速,理解和有效管理城市化对于实现长期进展,特别是在建设包容、安全、有韧性和可持续的城市和人类住区方面,变得越来越重要。实现这些目标的一种方法是以地图的形式获取有关某一地区土地覆盖特征的可靠和最新信息,这可以使用遥感和机器学习来完成。然而,使用这些技术进行城市土地覆盖测绘的做法被观察到发生在地理位置一级,在菲律宾的情况下,这是一个需要进一步探索的领域,以定量地了解城市范围。在本研究中,我们绘制了奎松市及其周边地区的人造结构或建成区与自然结构或非建成区的地图。随着城市的发展,奎松市的人口迅速增长。此外,由于之前的相关研究使用了各种机器学习算法进行分类,因此本研究比较了三种算法的性能,即随机森林分类器、k近邻和高斯混合模型,以确定哪种算法在该特定应用中表现最佳。该地区的卫星图像是由哨兵2号任务卫星收集的。所有三种算法在所有测量值中都获得了较高的精度,变化很小,但在分类所花费的时间上存在很大差异。随机森林分类器完成分类时间最长,但总体准确率最高,达到99.32%;k近邻分类器分类速度第二,总体准确率为98.95%;高斯混合模型分类速度最快,总体准确率为97.17%。进一步的研究可能会探索其他机器学习算法以及深度学习技术,以利用它们在更复杂应用中的特征提取能力。除了哨兵2号之外,其他卫星任务也可以作为卫星图像的来源,这些卫星图像可以提供适合特定应用的不同光谱、空间和时间分辨率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信