Ze-Hua Xu, Bai-Yin Liu, Bin Li, Wen-Ting Qiu, Zhi-Miao Zhang, Wei Wang
{"title":"[基于可解释集成机器学习的巢湖盆地溶解氧驱动因素研究]。","authors":"Ze-Hua Xu, Bai-Yin Liu, Bin Li, Wen-Ting Qiu, Zhi-Miao Zhang, Wei Wang","doi":"10.13227/j.hjkx.202407234","DOIUrl":null,"url":null,"abstract":"<p><p>Dissolved oxygen (DO) is a crucial indicator of the health of aquatic ecosystems. Low concentrations of DO can threaten the survival of aquatic life and disrupt the balance of ecosystems. Therefore, accurately identifying and quantifying the factors influencing DO is essential for developing effective water resource management strategies. This study selected the Chaohu Lake Basin as a case study area and utilized integrated machine learning methods and SHAP analysis to identify and explain the key factors affecting DO variations in the region systematically. By integrating models such as Random Forest, LightGBM, and XGBoost, the study demonstrated that a highly accurate predictive model can be constructed (<i>R</i><sup>2</sup>=0.94, RMSE=0.62 mg·L<sup>-1</sup>, MAE=0.41 mg·L<sup>-1</sup>). Water temperature, ammonia nitrogen (NH<sub>3</sub>-N), and pH were identified as the primary factors affecting DO concentrations, contributing 53.5%, 17.6%, and 9.1% of the effect, respectively. During hypoxic phases, the dominant factors shift, with the importance of water temperature and NH<sub>3</sub>-N decreasing by 20.5% and 7.9%, while the significance of pH, relative humidity, and conductivity increases by 7.1%, 3.7%, and 4.8%, respectively. Partial dependence analysis revealed that increasing water temperature and NH<sub>3</sub>-N concentration significantly decrease DO levels; a moderate pH facilitates the dissolution of oxygen, whereas extremely acidic or alkaline conditions may negatively impact DO. Under interactive effects, high temperatures, increased total phosphorus, and low atmospheric pressure amplify the negative impact of temperature on DO concentrations. Through these analyses, this study enhances understanding of the dynamics of DO, providing data support and scientific basis for DO monitoring, hypoxia warning, and management in the Chaohu Lake Basin and thereby aiding in the sustainable use and ecological protection of its water resources.</p>","PeriodicalId":35937,"journal":{"name":"环境科学","volume":"46 9","pages":"5641-5649"},"PeriodicalIF":0.0000,"publicationDate":"2025-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"[Investigation of Dissolved Oxygen Drivers in the Chaohu Basin Using Explainable Integrated Machine Learning].\",\"authors\":\"Ze-Hua Xu, Bai-Yin Liu, Bin Li, Wen-Ting Qiu, Zhi-Miao Zhang, Wei Wang\",\"doi\":\"10.13227/j.hjkx.202407234\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Dissolved oxygen (DO) is a crucial indicator of the health of aquatic ecosystems. Low concentrations of DO can threaten the survival of aquatic life and disrupt the balance of ecosystems. Therefore, accurately identifying and quantifying the factors influencing DO is essential for developing effective water resource management strategies. This study selected the Chaohu Lake Basin as a case study area and utilized integrated machine learning methods and SHAP analysis to identify and explain the key factors affecting DO variations in the region systematically. By integrating models such as Random Forest, LightGBM, and XGBoost, the study demonstrated that a highly accurate predictive model can be constructed (<i>R</i><sup>2</sup>=0.94, RMSE=0.62 mg·L<sup>-1</sup>, MAE=0.41 mg·L<sup>-1</sup>). Water temperature, ammonia nitrogen (NH<sub>3</sub>-N), and pH were identified as the primary factors affecting DO concentrations, contributing 53.5%, 17.6%, and 9.1% of the effect, respectively. During hypoxic phases, the dominant factors shift, with the importance of water temperature and NH<sub>3</sub>-N decreasing by 20.5% and 7.9%, while the significance of pH, relative humidity, and conductivity increases by 7.1%, 3.7%, and 4.8%, respectively. Partial dependence analysis revealed that increasing water temperature and NH<sub>3</sub>-N concentration significantly decrease DO levels; a moderate pH facilitates the dissolution of oxygen, whereas extremely acidic or alkaline conditions may negatively impact DO. Under interactive effects, high temperatures, increased total phosphorus, and low atmospheric pressure amplify the negative impact of temperature on DO concentrations. Through these analyses, this study enhances understanding of the dynamics of DO, providing data support and scientific basis for DO monitoring, hypoxia warning, and management in the Chaohu Lake Basin and thereby aiding in the sustainable use and ecological protection of its water resources.</p>\",\"PeriodicalId\":35937,\"journal\":{\"name\":\"环境科学\",\"volume\":\"46 9\",\"pages\":\"5641-5649\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-09-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"环境科学\",\"FirstCategoryId\":\"1087\",\"ListUrlMain\":\"https://doi.org/10.13227/j.hjkx.202407234\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Environmental Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"环境科学","FirstCategoryId":"1087","ListUrlMain":"https://doi.org/10.13227/j.hjkx.202407234","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Environmental Science","Score":null,"Total":0}
[Investigation of Dissolved Oxygen Drivers in the Chaohu Basin Using Explainable Integrated Machine Learning].
Dissolved oxygen (DO) is a crucial indicator of the health of aquatic ecosystems. Low concentrations of DO can threaten the survival of aquatic life and disrupt the balance of ecosystems. Therefore, accurately identifying and quantifying the factors influencing DO is essential for developing effective water resource management strategies. This study selected the Chaohu Lake Basin as a case study area and utilized integrated machine learning methods and SHAP analysis to identify and explain the key factors affecting DO variations in the region systematically. By integrating models such as Random Forest, LightGBM, and XGBoost, the study demonstrated that a highly accurate predictive model can be constructed (R2=0.94, RMSE=0.62 mg·L-1, MAE=0.41 mg·L-1). Water temperature, ammonia nitrogen (NH3-N), and pH were identified as the primary factors affecting DO concentrations, contributing 53.5%, 17.6%, and 9.1% of the effect, respectively. During hypoxic phases, the dominant factors shift, with the importance of water temperature and NH3-N decreasing by 20.5% and 7.9%, while the significance of pH, relative humidity, and conductivity increases by 7.1%, 3.7%, and 4.8%, respectively. Partial dependence analysis revealed that increasing water temperature and NH3-N concentration significantly decrease DO levels; a moderate pH facilitates the dissolution of oxygen, whereas extremely acidic or alkaline conditions may negatively impact DO. Under interactive effects, high temperatures, increased total phosphorus, and low atmospheric pressure amplify the negative impact of temperature on DO concentrations. Through these analyses, this study enhances understanding of the dynamics of DO, providing data support and scientific basis for DO monitoring, hypoxia warning, and management in the Chaohu Lake Basin and thereby aiding in the sustainable use and ecological protection of its water resources.