Mosaraf Hosan Nishat, Md. Habibur Rahman Bejoy Khan, Tahmeed Ahmed, Syed Nahin Hossain, Amimul Ahsan, M. M. El-Sergany, Md. Shafiquzzaman, Monzur Alam Imteaz, Mohammad T. Alresheedi
{"title":"Comparative analysis of machine learning models for predicting water quality index in Dhaka’s rivers of Bangladesh","authors":"Mosaraf Hosan Nishat, Md. Habibur Rahman Bejoy Khan, Tahmeed Ahmed, Syed Nahin Hossain, Amimul Ahsan, M. M. El-Sergany, Md. Shafiquzzaman, Monzur Alam Imteaz, Mohammad T. Alresheedi","doi":"10.1186/s12302-025-01078-w","DOIUrl":null,"url":null,"abstract":"<div><p>The pollution in Dhaka's navigable waterways, including the Buriganga, Balu, Tongi Khal, and Turag rivers, is a significant concern due to rapid industrial and urban expansion. Industrial discharges, domestic sewage and inadequate waste management are the primary sources of this pollution, degrading water quality and threatening aquatic ecosystems. This study aimed to predict the Water Quality Index (WQI) of these rivers using fourteen machine learning (ML) models: Decision Tree Regression, Linear Regression, Ridge Regression, Stochastic Gradient Descent (SGD) Regressor, Extreme Gradient Boosting (XGB) Regressor, Light Gradient Boosting Machine (GBM) Regressor, Elastic Net Regressor, Support Vector Regression (SVM), Random Forest Regression, Bayesian Ridge Regressor, Artificial Neural Network (ANN), AdaBoost Regressor, CatBoost Regressor and Extra Trees Regressor. The objective was to evaluate and compare these models to identify the most effective predictive method for WQI, enabling efficient environmental monitoring and management of urban waterways. Among the evaluated ML models, ANN and Random Forest Regressor performed the best. The ANN model demonstrated superior predictive capability, achieving a Root Mean Squared Error (RMSE) of 2.34, a Mean Absolute Error (MAE) of 1.24, a Nash–Sutcliffe Efficiency (NSE) of 0.97, and a Coefficient of Determination (R<sup>2</sup>) of 0.97. Furthermore, an Adjusted <i>R</i><sup><i>2</i></sup> value of 0.965 further confirmed its ability to capture complex patterns in water quality data with remarkable accuracy. These findings emphasize the importance of using AI modeling techniques, specifically ANN and Random Forest Regression, to improve the accuracy of WQI forecasts for the waterways. This study contributes to the field of environmental science by offering a novel integration of feature selection techniques with ML models to enhance efficiency and cost-effectiveness of water quality monitoring. Unlike previous studies, this research specifically addresses the challenges of urban waterways in Dhaka, Bangladesh, a region significantly impacted by industrial and urban pollution. To our knowledge, this is the first study to apply such a comprehensive range of ML models to predict the WQI of Dhaka’s four major rivers. By providing a reliable methodology for WQI estimation, this study supports informed decision-making and proactive measures to protect vital water resources.</p></div>","PeriodicalId":546,"journal":{"name":"Environmental Sciences Europe","volume":"37 1","pages":""},"PeriodicalIF":6.0000,"publicationDate":"2025-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1186/s12302-025-01078-w.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Sciences Europe","FirstCategoryId":"93","ListUrlMain":"https://link.springer.com/article/10.1186/s12302-025-01078-w","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
The pollution in Dhaka's navigable waterways, including the Buriganga, Balu, Tongi Khal, and Turag rivers, is a significant concern due to rapid industrial and urban expansion. Industrial discharges, domestic sewage and inadequate waste management are the primary sources of this pollution, degrading water quality and threatening aquatic ecosystems. This study aimed to predict the Water Quality Index (WQI) of these rivers using fourteen machine learning (ML) models: Decision Tree Regression, Linear Regression, Ridge Regression, Stochastic Gradient Descent (SGD) Regressor, Extreme Gradient Boosting (XGB) Regressor, Light Gradient Boosting Machine (GBM) Regressor, Elastic Net Regressor, Support Vector Regression (SVM), Random Forest Regression, Bayesian Ridge Regressor, Artificial Neural Network (ANN), AdaBoost Regressor, CatBoost Regressor and Extra Trees Regressor. The objective was to evaluate and compare these models to identify the most effective predictive method for WQI, enabling efficient environmental monitoring and management of urban waterways. Among the evaluated ML models, ANN and Random Forest Regressor performed the best. The ANN model demonstrated superior predictive capability, achieving a Root Mean Squared Error (RMSE) of 2.34, a Mean Absolute Error (MAE) of 1.24, a Nash–Sutcliffe Efficiency (NSE) of 0.97, and a Coefficient of Determination (R2) of 0.97. Furthermore, an Adjusted R2 value of 0.965 further confirmed its ability to capture complex patterns in water quality data with remarkable accuracy. These findings emphasize the importance of using AI modeling techniques, specifically ANN and Random Forest Regression, to improve the accuracy of WQI forecasts for the waterways. This study contributes to the field of environmental science by offering a novel integration of feature selection techniques with ML models to enhance efficiency and cost-effectiveness of water quality monitoring. Unlike previous studies, this research specifically addresses the challenges of urban waterways in Dhaka, Bangladesh, a region significantly impacted by industrial and urban pollution. To our knowledge, this is the first study to apply such a comprehensive range of ML models to predict the WQI of Dhaka’s four major rivers. By providing a reliable methodology for WQI estimation, this study supports informed decision-making and proactive measures to protect vital water resources.
期刊介绍:
ESEU is an international journal, focusing primarily on Europe, with a broad scope covering all aspects of environmental sciences, including the main topic regulation.
ESEU will discuss the entanglement between environmental sciences and regulation because, in recent years, there have been misunderstandings and even disagreement between stakeholders in these two areas. ESEU will help to improve the comprehension of issues between environmental sciences and regulation.
ESEU will be an outlet from the German-speaking (DACH) countries to Europe and an inlet from Europe to the DACH countries regarding environmental sciences and regulation.
Moreover, ESEU will facilitate the exchange of ideas and interaction between Europe and the DACH countries regarding environmental regulatory issues.
Although Europe is at the center of ESEU, the journal will not exclude the rest of the world, because regulatory issues pertaining to environmental sciences can be fully seen only from a global perspective.