{"title":"评估使用机器学习算法根据一组简化的预测因子确定水库水质的可行性","authors":"Natalia Walczak , Zbigniew Walczak","doi":"10.1016/j.ecolind.2025.113556","DOIUrl":null,"url":null,"abstract":"<div><div>The present study analyses the possibility of assessing water quality using the water quality index (WQI) through the application of four different machine learning algorithms (ML): neural network models (NNM), random forest (RF), k-nearest neighbor (KNN), and linear regression (LR). Water quality was determined based on 5 indicators: P, COD, BOD<sub>5</sub>, N total, and total suspended solids TS. The possibility of predicting water quality (WQI index) based on the reduced number of predictors was then analyzed. It was estimated which indicators have the most significant impact on WQI values. The performance of models using different algorithms, as well as those trained on full and reduced data sets, was compared. The models demonstrate high performance in WQI prediction, achieving an R<sup>2</sup> of 0.999 (for NNM and LR) for the entire dataset, 0.988 (KNN) for the dataset using only three types of predictors, and 0.941 for the dataset using only two predictors (RF). The construction and training of ML models for reduced sets and types of predictors will enable early water quality estimation based on only a few selected parameters. The implementation of ML algorithms will enable more effective water quality management and significantly improve the precision of predictions for critical water parameters.</div></div>","PeriodicalId":11459,"journal":{"name":"Ecological Indicators","volume":"175 ","pages":"Article 113556"},"PeriodicalIF":7.0000,"publicationDate":"2025-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Assessing the feasibility of using Machine learning algorithms to determine reservoir water quality based on a reduced set of predictors\",\"authors\":\"Natalia Walczak , Zbigniew Walczak\",\"doi\":\"10.1016/j.ecolind.2025.113556\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The present study analyses the possibility of assessing water quality using the water quality index (WQI) through the application of four different machine learning algorithms (ML): neural network models (NNM), random forest (RF), k-nearest neighbor (KNN), and linear regression (LR). Water quality was determined based on 5 indicators: P, COD, BOD<sub>5</sub>, N total, and total suspended solids TS. The possibility of predicting water quality (WQI index) based on the reduced number of predictors was then analyzed. It was estimated which indicators have the most significant impact on WQI values. The performance of models using different algorithms, as well as those trained on full and reduced data sets, was compared. The models demonstrate high performance in WQI prediction, achieving an R<sup>2</sup> of 0.999 (for NNM and LR) for the entire dataset, 0.988 (KNN) for the dataset using only three types of predictors, and 0.941 for the dataset using only two predictors (RF). The construction and training of ML models for reduced sets and types of predictors will enable early water quality estimation based on only a few selected parameters. The implementation of ML algorithms will enable more effective water quality management and significantly improve the precision of predictions for critical water parameters.</div></div>\",\"PeriodicalId\":11459,\"journal\":{\"name\":\"Ecological Indicators\",\"volume\":\"175 \",\"pages\":\"Article 113556\"},\"PeriodicalIF\":7.0000,\"publicationDate\":\"2025-05-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ecological Indicators\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1470160X25004868\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Indicators","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1470160X25004868","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Assessing the feasibility of using Machine learning algorithms to determine reservoir water quality based on a reduced set of predictors
The present study analyses the possibility of assessing water quality using the water quality index (WQI) through the application of four different machine learning algorithms (ML): neural network models (NNM), random forest (RF), k-nearest neighbor (KNN), and linear regression (LR). Water quality was determined based on 5 indicators: P, COD, BOD5, N total, and total suspended solids TS. The possibility of predicting water quality (WQI index) based on the reduced number of predictors was then analyzed. It was estimated which indicators have the most significant impact on WQI values. The performance of models using different algorithms, as well as those trained on full and reduced data sets, was compared. The models demonstrate high performance in WQI prediction, achieving an R2 of 0.999 (for NNM and LR) for the entire dataset, 0.988 (KNN) for the dataset using only three types of predictors, and 0.941 for the dataset using only two predictors (RF). The construction and training of ML models for reduced sets and types of predictors will enable early water quality estimation based on only a few selected parameters. The implementation of ML algorithms will enable more effective water quality management and significantly improve the precision of predictions for critical water parameters.
期刊介绍:
The ultimate aim of Ecological Indicators is to integrate the monitoring and assessment of ecological and environmental indicators with management practices. The journal provides a forum for the discussion of the applied scientific development and review of traditional indicator approaches as well as for theoretical, modelling and quantitative applications such as index development. Research into the following areas will be published.
• All aspects of ecological and environmental indicators and indices.
• New indicators, and new approaches and methods for indicator development, testing and use.
• Development and modelling of indices, e.g. application of indicator suites across multiple scales and resources.
• Analysis and research of resource, system- and scale-specific indicators.
• Methods for integration of social and other valuation metrics for the production of scientifically rigorous and politically-relevant assessments using indicator-based monitoring and assessment programs.
• How research indicators can be transformed into direct application for management purposes.
• Broader assessment objectives and methods, e.g. biodiversity, biological integrity, and sustainability, through the use of indicators.
• Resource-specific indicators such as landscape, agroecosystems, forests, wetlands, etc.