Jiayu Cui , Shiqiang Wu , Jiangyu Dai , Wanyun Xue , Yue Zhang , Jiayi You , Xueyan Lv , Xuan Yang
{"title":"基于Sentinel-2图像和优化XGBoost模型的太湖总磷卫星反演","authors":"Jiayu Cui , Shiqiang Wu , Jiangyu Dai , Wanyun Xue , Yue Zhang , Jiayi You , Xueyan Lv , Xuan Yang","doi":"10.1016/j.ecolind.2025.113563","DOIUrl":null,"url":null,"abstract":"<div><div>Satellite remote sensing technology holds significant potential for water quality monitoring due to its high spatio-temporal resolution and long-term observational capabilities. However, developing satellite-retrieved models for non-optical active parameters such as total phosphorus remains challenging. In this study, we utilized measured total phosphorus data from 17 floating boat stations in Taihu Lake and Sentinel-2 satellite imagery to construct an XGBoost model based on Bayesian optimization and feature selection for the retrieval of total phosphorus levels in the lake. The model employed single bands and band ratios at various resolutions as input features, optimized model parameters through Bayesian optimization, and incrementally introduced variables with substantial contributions to evaluate and determine the optimal feature combination. Results indicated that band ratios at different resolutions constituted 79 % of the optimal feature combination, demonstrating significantly better performance than single bands. The coefficient of determination (R<sup>2</sup>) was 0.7151, and the root mean square error (RMSE) was 0.0179 mg/L. Compared to using all feature variables, the R<sup>2</sup> increased by 35 %, RMSE decreased by 11.4 %, and the accuracy rate of total phosphorus retrieval concentration reached 81.7 %. The spatial distribution of total phosphorus derived from the satellite-retrieved model aligned well with the actual conditions. The optimal band combination model exhibited high stability and generalization ability. This study provides a practical reference for constructing a satellite retrieval model to determine inactive water quality parameters, and also provides strong technical support for the construction of an air-space-ground integrated monitoring network, especially in the context of open sharing of satellite data, and offers a feasible path for realizing large-scale and low-cost water quality supervision.</div></div>","PeriodicalId":11459,"journal":{"name":"Ecological Indicators","volume":"175 ","pages":"Article 113563"},"PeriodicalIF":7.0000,"publicationDate":"2025-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Satellite retrievals of total phosphorus in Taihu Lake using Sentinel-2 images and an optimized XGBoost model\",\"authors\":\"Jiayu Cui , Shiqiang Wu , Jiangyu Dai , Wanyun Xue , Yue Zhang , Jiayi You , Xueyan Lv , Xuan Yang\",\"doi\":\"10.1016/j.ecolind.2025.113563\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Satellite remote sensing technology holds significant potential for water quality monitoring due to its high spatio-temporal resolution and long-term observational capabilities. However, developing satellite-retrieved models for non-optical active parameters such as total phosphorus remains challenging. In this study, we utilized measured total phosphorus data from 17 floating boat stations in Taihu Lake and Sentinel-2 satellite imagery to construct an XGBoost model based on Bayesian optimization and feature selection for the retrieval of total phosphorus levels in the lake. The model employed single bands and band ratios at various resolutions as input features, optimized model parameters through Bayesian optimization, and incrementally introduced variables with substantial contributions to evaluate and determine the optimal feature combination. Results indicated that band ratios at different resolutions constituted 79 % of the optimal feature combination, demonstrating significantly better performance than single bands. The coefficient of determination (R<sup>2</sup>) was 0.7151, and the root mean square error (RMSE) was 0.0179 mg/L. Compared to using all feature variables, the R<sup>2</sup> increased by 35 %, RMSE decreased by 11.4 %, and the accuracy rate of total phosphorus retrieval concentration reached 81.7 %. The spatial distribution of total phosphorus derived from the satellite-retrieved model aligned well with the actual conditions. The optimal band combination model exhibited high stability and generalization ability. This study provides a practical reference for constructing a satellite retrieval model to determine inactive water quality parameters, and also provides strong technical support for the construction of an air-space-ground integrated monitoring network, especially in the context of open sharing of satellite data, and offers a feasible path for realizing large-scale and low-cost water quality supervision.</div></div>\",\"PeriodicalId\":11459,\"journal\":{\"name\":\"Ecological Indicators\",\"volume\":\"175 \",\"pages\":\"Article 113563\"},\"PeriodicalIF\":7.0000,\"publicationDate\":\"2025-05-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ecological Indicators\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1470160X25004935\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Indicators","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1470160X25004935","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Satellite retrievals of total phosphorus in Taihu Lake using Sentinel-2 images and an optimized XGBoost model
Satellite remote sensing technology holds significant potential for water quality monitoring due to its high spatio-temporal resolution and long-term observational capabilities. However, developing satellite-retrieved models for non-optical active parameters such as total phosphorus remains challenging. In this study, we utilized measured total phosphorus data from 17 floating boat stations in Taihu Lake and Sentinel-2 satellite imagery to construct an XGBoost model based on Bayesian optimization and feature selection for the retrieval of total phosphorus levels in the lake. The model employed single bands and band ratios at various resolutions as input features, optimized model parameters through Bayesian optimization, and incrementally introduced variables with substantial contributions to evaluate and determine the optimal feature combination. Results indicated that band ratios at different resolutions constituted 79 % of the optimal feature combination, demonstrating significantly better performance than single bands. The coefficient of determination (R2) was 0.7151, and the root mean square error (RMSE) was 0.0179 mg/L. Compared to using all feature variables, the R2 increased by 35 %, RMSE decreased by 11.4 %, and the accuracy rate of total phosphorus retrieval concentration reached 81.7 %. The spatial distribution of total phosphorus derived from the satellite-retrieved model aligned well with the actual conditions. The optimal band combination model exhibited high stability and generalization ability. This study provides a practical reference for constructing a satellite retrieval model to determine inactive water quality parameters, and also provides strong technical support for the construction of an air-space-ground integrated monitoring network, especially in the context of open sharing of satellite data, and offers a feasible path for realizing large-scale and low-cost water quality supervision.
期刊介绍:
The ultimate aim of Ecological Indicators is to integrate the monitoring and assessment of ecological and environmental indicators with management practices. The journal provides a forum for the discussion of the applied scientific development and review of traditional indicator approaches as well as for theoretical, modelling and quantitative applications such as index development. Research into the following areas will be published.
• All aspects of ecological and environmental indicators and indices.
• New indicators, and new approaches and methods for indicator development, testing and use.
• Development and modelling of indices, e.g. application of indicator suites across multiple scales and resources.
• Analysis and research of resource, system- and scale-specific indicators.
• Methods for integration of social and other valuation metrics for the production of scientifically rigorous and politically-relevant assessments using indicator-based monitoring and assessment programs.
• How research indicators can be transformed into direct application for management purposes.
• Broader assessment objectives and methods, e.g. biodiversity, biological integrity, and sustainability, through the use of indicators.
• Resource-specific indicators such as landscape, agroecosystems, forests, wetlands, etc.