Evaluation of machine learning algorithm capability for Bosten Lake Wetland classification based on multi-temporal Sentinel-2 data

IF 5.8 2区 环境科学与生态学 Q1 ECOLOGY
Feiying Xia , Guanghui Lv
{"title":"Evaluation of machine learning algorithm capability for Bosten Lake Wetland classification based on multi-temporal Sentinel-2 data","authors":"Feiying Xia ,&nbsp;Guanghui Lv","doi":"10.1016/j.ecoinf.2024.102839","DOIUrl":null,"url":null,"abstract":"<div><div>As crucial carbon sinks within terrestrial ecosystems, wetlands have been extensively studied in terms of spatio-temporal distributions. However, existing methods for classifying wetlands are of limited accuracy, and it is difficult to acquire consistent samples over time. Therefore, precise classification methods are required to facilitate wetland conservation and ecological restoration. In this study, multiple machine learning (ML) algorithms in combination with feature sets based on Sentinel-2 data were used to accurately classify the land-use types (LUTs) of the Bosten Lake Wetland (BLW) in Xinjiang, China. The enhanced water index (EWI), modified normalised difference water index (MNDWI), and normalised difference water index (NDWI) were selected to extract water information and distinguish water bodies from land surfaces in the BLW. Three classification plans based on vegetation indices, water indices, and textural features were developed using artificial neural network (ANN), support vector machine (SVM), random forest (RF) algorithms. Plan 9 combined vegetation water and texture with the highest overall accuracy (OA) 91.02 % and kappa coefficient (KC) 0.89. This plan obtained a producer accuracy of over 90 % for lake wetlands, river wetlands, grassland wetlands, mud flats, and farmland and &gt; 83 % for construction land and bareland. According to Plan 9, the wetland area during 2018–2023 showed noticeable seasonal fluctuations but stable interannual changes. Conversely, non-wetland areas demonstrated significant interannual fluctuations, particularly in bareland and farmland, which might have been influenced by urbanisation and ecological policies. This study provides insights into the data sources, feature selection, and methodological approaches for wetland information extraction in arid regions.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"84 ","pages":"Article 102839"},"PeriodicalIF":5.8000,"publicationDate":"2024-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Informatics","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574954124003819","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

As crucial carbon sinks within terrestrial ecosystems, wetlands have been extensively studied in terms of spatio-temporal distributions. However, existing methods for classifying wetlands are of limited accuracy, and it is difficult to acquire consistent samples over time. Therefore, precise classification methods are required to facilitate wetland conservation and ecological restoration. In this study, multiple machine learning (ML) algorithms in combination with feature sets based on Sentinel-2 data were used to accurately classify the land-use types (LUTs) of the Bosten Lake Wetland (BLW) in Xinjiang, China. The enhanced water index (EWI), modified normalised difference water index (MNDWI), and normalised difference water index (NDWI) were selected to extract water information and distinguish water bodies from land surfaces in the BLW. Three classification plans based on vegetation indices, water indices, and textural features were developed using artificial neural network (ANN), support vector machine (SVM), random forest (RF) algorithms. Plan 9 combined vegetation water and texture with the highest overall accuracy (OA) 91.02 % and kappa coefficient (KC) 0.89. This plan obtained a producer accuracy of over 90 % for lake wetlands, river wetlands, grassland wetlands, mud flats, and farmland and > 83 % for construction land and bareland. According to Plan 9, the wetland area during 2018–2023 showed noticeable seasonal fluctuations but stable interannual changes. Conversely, non-wetland areas demonstrated significant interannual fluctuations, particularly in bareland and farmland, which might have been influenced by urbanisation and ecological policies. This study provides insights into the data sources, feature selection, and methodological approaches for wetland information extraction in arid regions.

Abstract Image

基于多时 Sentinel-2 数据的博斯腾湖湿地分类机器学习算法能力评估
作为陆地生态系统中重要的碳汇,湿地在时空分布方面已得到广泛研究。然而,现有的湿地分类方法准确性有限,而且很难获得长期一致的样本。因此,需要精确的分类方法来促进湿地保护和生态恢复。本研究采用多种机器学习(ML)算法,结合基于哨兵-2 数据的特征集,对中国新疆博斯腾湖湿地(BLW)的土地利用类型(LUT)进行了精确分类。选择了增强水指数(EWI)、修正归一化差异水指数(MNDWI)和归一化差异水指数(NDWI)来提取水信息并区分博斯腾湖湿地的水体和地表。利用人工神经网络(ANN)、支持向量机(SVM)和随机森林(RF)算法,开发了基于植被指数、水指数和纹理特征的三种分类方案。方案 9 结合了植被水分和纹理特征,总体准确率 (OA) 为 91.02 %,卡帕系数 (KC) 为 0.89。该方案对湖泊湿地、河流湿地、草原湿地、滩涂和农田的生产者准确率超过 90%,对建筑用地和裸地的生产者准确率为 83%。根据 "规划 9",2018-2023 年湿地面积呈现明显的季节性波动,但年际变化稳定。相反,非湿地面积则表现出明显的年际波动,尤其是裸地和耕地,这可能受到城市化和生态政策的影响。这项研究为干旱地区湿地信息提取的数据来源、特征选择和方法提供了启示。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Ecological Informatics
Ecological Informatics 环境科学-生态学
CiteScore
8.30
自引率
11.80%
发文量
346
审稿时长
46 days
期刊介绍: The journal Ecological Informatics is devoted to the publication of high quality, peer-reviewed articles on all aspects of computational ecology, data science and biogeography. The scope of the journal takes into account the data-intensive nature of ecology, the growing capacity of information technology to access, harness and leverage complex data as well as the critical need for informing sustainable management in view of global environmental and climate change. The nature of the journal is interdisciplinary at the crossover between ecology and informatics. It focuses on novel concepts and techniques for image- and genome-based monitoring and interpretation, sensor- and multimedia-based data acquisition, internet-based data archiving and sharing, data assimilation, modelling and prediction of ecological data.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信