Estimation of water quality variables based on machine learning model and cluster analysis-based empirical model using multi-source remote sensing data in inland reservoirs, South China
Di Tian, Xinfeng Zhao, Lei Gao, Zuobing Liang, Zaizhi Yang, Pengcheng Zhang, Qirui Wu, Kun Ren, Rui Li, Chenchen Yang, Shaoheng Li, Meng Wang, Zhidong He, Zebin Zhang, Jianyao Chen
{"title":"Estimation of water quality variables based on machine learning model and cluster analysis-based empirical model using multi-source remote sensing data in inland reservoirs, South China","authors":"Di Tian, Xinfeng Zhao, Lei Gao, Zuobing Liang, Zaizhi Yang, Pengcheng Zhang, Qirui Wu, Kun Ren, Rui Li, Chenchen Yang, Shaoheng Li, Meng Wang, Zhidong He, Zebin Zhang, Jianyao Chen","doi":"10.1016/j.envpol.2023.123104","DOIUrl":null,"url":null,"abstract":"<p>Reservoirs play important roles in the drinking water supply for urban residents, agricultural water provision, and the maintenance of ecosystem health. Satellite optical remote sensing of water quality variables in medium and micro-sized inland waters under oligotrophic and mesotrophic status is challenging in terms of the spatio-temporal resolution, weather conditions and frequent nutrient status changes in reservoirs, etc., especially when quantifying non-optically active components (non-OACs). This study was based on the surface reflectance products of unmanned aerial vehicle (UAV) multispectral images, Sentinel-2B Multispectral instrument (MSI) images and Landsat 7 Enhanced Thematic Mapper Plus (ETM+) by utilizing fuzzy C-means (FCM) clustering algorithm was combined with band combination (BC) model to construct the FCM-BC empirical model, and used mixed density network (MDN), extreme gradient boosting (XGBoost), deep neural network (DNN) and support vector regression (SVR) machine learning (ML) models to invert 12 kinds of optically active components (OACs) and non-OACs. Compared with the unclustered BC (UC) model, the mean coefficient of determination (MR) of the FCM-BC models was improved by at least 46.9%. MDN model showed best accuracy (R<sup>2</sup> in the range of 0.60–0.98) and stability (R<sup>2</sup> decreased by up to 13.2%). The accuracy of UAV was relatively higher in both empirical methods and machine learning methods. Additionally, the spatio-temporal distribution maps of four water quality variables were mapped based on the MDN model and UAV images, all platforms showed good consistency. An inversion strategy of water quality variables in various monitoring frequencies and weather conditions were proposed finally. The purpose of introducing the UAV platform was to cooperate with the satellite to improve the monitoring response ability of OACs and non-OACs in small and micro-sized oligotrophic and mesotrophic water bodies.</p>","PeriodicalId":311,"journal":{"name":"Environmental Pollution","volume":"70 1","pages":""},"PeriodicalIF":7.3000,"publicationDate":"2023-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Pollution","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1016/j.envpol.2023.123104","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Reservoirs play important roles in the drinking water supply for urban residents, agricultural water provision, and the maintenance of ecosystem health. Satellite optical remote sensing of water quality variables in medium and micro-sized inland waters under oligotrophic and mesotrophic status is challenging in terms of the spatio-temporal resolution, weather conditions and frequent nutrient status changes in reservoirs, etc., especially when quantifying non-optically active components (non-OACs). This study was based on the surface reflectance products of unmanned aerial vehicle (UAV) multispectral images, Sentinel-2B Multispectral instrument (MSI) images and Landsat 7 Enhanced Thematic Mapper Plus (ETM+) by utilizing fuzzy C-means (FCM) clustering algorithm was combined with band combination (BC) model to construct the FCM-BC empirical model, and used mixed density network (MDN), extreme gradient boosting (XGBoost), deep neural network (DNN) and support vector regression (SVR) machine learning (ML) models to invert 12 kinds of optically active components (OACs) and non-OACs. Compared with the unclustered BC (UC) model, the mean coefficient of determination (MR) of the FCM-BC models was improved by at least 46.9%. MDN model showed best accuracy (R2 in the range of 0.60–0.98) and stability (R2 decreased by up to 13.2%). The accuracy of UAV was relatively higher in both empirical methods and machine learning methods. Additionally, the spatio-temporal distribution maps of four water quality variables were mapped based on the MDN model and UAV images, all platforms showed good consistency. An inversion strategy of water quality variables in various monitoring frequencies and weather conditions were proposed finally. The purpose of introducing the UAV platform was to cooperate with the satellite to improve the monitoring response ability of OACs and non-OACs in small and micro-sized oligotrophic and mesotrophic water bodies.
期刊介绍:
Environmental Pollution is an international peer-reviewed journal that publishes high-quality research papers and review articles covering all aspects of environmental pollution and its impacts on ecosystems and human health.
Subject areas include, but are not limited to:
• Sources and occurrences of pollutants that are clearly defined and measured in environmental compartments, food and food-related items, and human bodies;
• Interlinks between contaminant exposure and biological, ecological, and human health effects, including those of climate change;
• Contaminants of emerging concerns (including but not limited to antibiotic resistant microorganisms or genes, microplastics/nanoplastics, electronic wastes, light, and noise) and/or their biological, ecological, or human health effects;
• Laboratory and field studies on the remediation/mitigation of environmental pollution via new techniques and with clear links to biological, ecological, or human health effects;
• Modeling of pollution processes, patterns, or trends that is of clear environmental and/or human health interest;
• New techniques that measure and examine environmental occurrences, transport, behavior, and effects of pollutants within the environment or the laboratory, provided that they can be clearly used to address problems within regional or global environmental compartments.