{"title":"A novel predictive framework for water quality assessment based on socio-economic indicators and water leaving reflectance","authors":"Hao Chen , Ali P. Yunus","doi":"10.1016/j.gsd.2025.101405","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate prediction of water quality at both spatial and temporal scales for large water bodies remains a daunting task with significant implications for human well-being and sustainable development (aligned with SDG 6 - clean water and sanitation). Traditional data-driven models on water quality prediction relied on some degree of field subsistence, which are neither cost-effective nor time-efficient. Socio-economic indicators have been concurrently used as predictor variable for water quality; however, such datasets typically available at coarse temporal resolutions, limiting their applicability for time-sensitive analyses. In this study, we integrated machine learning (ML) models with socio-economic indicators and remote sensing reflectance (R<sub>RS</sub>) to address the challenge of temporality in predicting Biochemical Oxygen Demand (BOD) and Total Coliform Bacteria (TCB) levels across 228 lake systems in the Indian subcontinent. Pearson correlation analysis revealed limited direct correlations (<0.5) between BOD, TCB, and the input variables. However, a stepwise omission and commission analysis demonstrated that incorporating R<sub>RS</sub> into the socio-economic model significantly enhanced predictive performance of the ML models. This approach achieved high classification accuracy for BOD and TCB, with Area Under the Curve (AUC) scores of 0.84 and 0.96, respectively, highlighting strong potential for temporal water quality assessment. Among the supervised learning methods tested, the random forest model outperformed all others in terms of accuracy and robustness. This study presents an integrated framework capable of predicting BOD and TCB with both high temporal and spatial resolution, and offers valuable insights for the effective and efficient management of aquatic ecosystems, enabling timely interventions and informed decision-making.</div></div>","PeriodicalId":37879,"journal":{"name":"Groundwater for Sustainable Development","volume":"28 ","pages":"Article 101405"},"PeriodicalIF":4.9000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Groundwater for Sustainable Development","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352801X25000025","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate prediction of water quality at both spatial and temporal scales for large water bodies remains a daunting task with significant implications for human well-being and sustainable development (aligned with SDG 6 - clean water and sanitation). Traditional data-driven models on water quality prediction relied on some degree of field subsistence, which are neither cost-effective nor time-efficient. Socio-economic indicators have been concurrently used as predictor variable for water quality; however, such datasets typically available at coarse temporal resolutions, limiting their applicability for time-sensitive analyses. In this study, we integrated machine learning (ML) models with socio-economic indicators and remote sensing reflectance (RRS) to address the challenge of temporality in predicting Biochemical Oxygen Demand (BOD) and Total Coliform Bacteria (TCB) levels across 228 lake systems in the Indian subcontinent. Pearson correlation analysis revealed limited direct correlations (<0.5) between BOD, TCB, and the input variables. However, a stepwise omission and commission analysis demonstrated that incorporating RRS into the socio-economic model significantly enhanced predictive performance of the ML models. This approach achieved high classification accuracy for BOD and TCB, with Area Under the Curve (AUC) scores of 0.84 and 0.96, respectively, highlighting strong potential for temporal water quality assessment. Among the supervised learning methods tested, the random forest model outperformed all others in terms of accuracy and robustness. This study presents an integrated framework capable of predicting BOD and TCB with both high temporal and spatial resolution, and offers valuable insights for the effective and efficient management of aquatic ecosystems, enabling timely interventions and informed decision-making.
期刊介绍:
Groundwater for Sustainable Development is directed to different stakeholders and professionals, including government and non-governmental organizations, international funding agencies, universities, public water institutions, public health and other public/private sector professionals, and other relevant institutions. It is aimed at professionals, academics and students in the fields of disciplines such as: groundwater and its connection to surface hydrology and environment, soil sciences, engineering, ecology, microbiology, atmospheric sciences, analytical chemistry, hydro-engineering, water technology, environmental ethics, economics, public health, policy, as well as social sciences, legal disciplines, or any other area connected with water issues. The objectives of this journal are to facilitate: • The improvement of effective and sustainable management of water resources across the globe. • The improvement of human access to groundwater resources in adequate quantity and good quality. • The meeting of the increasing demand for drinking and irrigation water needed for food security to contribute to a social and economically sound human development. • The creation of a global inter- and multidisciplinary platform and forum to improve our understanding of groundwater resources and to advocate their effective and sustainable management and protection against contamination. • Interdisciplinary information exchange and to stimulate scientific research in the fields of groundwater related sciences and social and health sciences required to achieve the United Nations Millennium Development Goals for sustainable development.