Antonio J. Aragón-Barroso , David Ribes , Francisco Osorio
{"title":"加强饮用水安全:利用机器学习和多感官技术实时预测配水系统中的三卤甲烷","authors":"Antonio J. Aragón-Barroso , David Ribes , Francisco Osorio","doi":"10.1016/j.ecoenv.2025.118243","DOIUrl":null,"url":null,"abstract":"<div><div>Prolonged exposure to high concentrations of trihalomethanes (THMs) may generate human health risks due to their carcinogenic and mutagenic properties. Therefore, monitoring THMs in drinking water distribution systems (DWDS) is essential. This study focused on the statistical modelling of THMs formation through multiple linear regression (MLR) method to develop simple predictive models that serve as preventive tools capable of alerting about potential increases in THMs within the water network. To achieve this, a dataset comprising 1192 observations of water quality measurements in the study area over five years was created. The independent variables selected to explain the formation of THMs were free residual chlorine (FRC), total organic carbon (TOC), conductivity, pH and turbidity. Then, following an exploratory analysis of the dataset using Pearson’s correlation matrix and an ANOVA test, multiple regression models were developed. In total, a total of two predictive models were built, based on data filtered by conductivity levels, with coefficients of determination (R<sup>2</sup>) of 0.64 and 0.47. The algorithms of these predictive models were integrated into the control center of the water company in the study area. On the other hand, a multisensory device was installed in a strategically located drinking water tank to measure the values of the independent variables used in the models. These measurements were transmitted online to the control center to continuously update the predictive models and provide real-time forecasts of THMs. Finally, model validation was performed by comparing the real-time predictions of the models with actual THMs levels obtained from laboratory analyses, achieving an average accuracy of 90 %.</div></div>","PeriodicalId":303,"journal":{"name":"Ecotoxicology and Environmental Safety","volume":"297 ","pages":"Article 118243"},"PeriodicalIF":6.2000,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing drinking water safety: Real-time prediction of trihalomethanes in a water distribution system using machine learning and multisensory technology\",\"authors\":\"Antonio J. Aragón-Barroso , David Ribes , Francisco Osorio\",\"doi\":\"10.1016/j.ecoenv.2025.118243\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Prolonged exposure to high concentrations of trihalomethanes (THMs) may generate human health risks due to their carcinogenic and mutagenic properties. Therefore, monitoring THMs in drinking water distribution systems (DWDS) is essential. This study focused on the statistical modelling of THMs formation through multiple linear regression (MLR) method to develop simple predictive models that serve as preventive tools capable of alerting about potential increases in THMs within the water network. To achieve this, a dataset comprising 1192 observations of water quality measurements in the study area over five years was created. The independent variables selected to explain the formation of THMs were free residual chlorine (FRC), total organic carbon (TOC), conductivity, pH and turbidity. Then, following an exploratory analysis of the dataset using Pearson’s correlation matrix and an ANOVA test, multiple regression models were developed. In total, a total of two predictive models were built, based on data filtered by conductivity levels, with coefficients of determination (R<sup>2</sup>) of 0.64 and 0.47. The algorithms of these predictive models were integrated into the control center of the water company in the study area. On the other hand, a multisensory device was installed in a strategically located drinking water tank to measure the values of the independent variables used in the models. These measurements were transmitted online to the control center to continuously update the predictive models and provide real-time forecasts of THMs. Finally, model validation was performed by comparing the real-time predictions of the models with actual THMs levels obtained from laboratory analyses, achieving an average accuracy of 90 %.</div></div>\",\"PeriodicalId\":303,\"journal\":{\"name\":\"Ecotoxicology and Environmental Safety\",\"volume\":\"297 \",\"pages\":\"Article 118243\"},\"PeriodicalIF\":6.2000,\"publicationDate\":\"2025-04-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ecotoxicology and Environmental Safety\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0147651325005792\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecotoxicology and Environmental Safety","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0147651325005792","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Enhancing drinking water safety: Real-time prediction of trihalomethanes in a water distribution system using machine learning and multisensory technology
Prolonged exposure to high concentrations of trihalomethanes (THMs) may generate human health risks due to their carcinogenic and mutagenic properties. Therefore, monitoring THMs in drinking water distribution systems (DWDS) is essential. This study focused on the statistical modelling of THMs formation through multiple linear regression (MLR) method to develop simple predictive models that serve as preventive tools capable of alerting about potential increases in THMs within the water network. To achieve this, a dataset comprising 1192 observations of water quality measurements in the study area over five years was created. The independent variables selected to explain the formation of THMs were free residual chlorine (FRC), total organic carbon (TOC), conductivity, pH and turbidity. Then, following an exploratory analysis of the dataset using Pearson’s correlation matrix and an ANOVA test, multiple regression models were developed. In total, a total of two predictive models were built, based on data filtered by conductivity levels, with coefficients of determination (R2) of 0.64 and 0.47. The algorithms of these predictive models were integrated into the control center of the water company in the study area. On the other hand, a multisensory device was installed in a strategically located drinking water tank to measure the values of the independent variables used in the models. These measurements were transmitted online to the control center to continuously update the predictive models and provide real-time forecasts of THMs. Finally, model validation was performed by comparing the real-time predictions of the models with actual THMs levels obtained from laboratory analyses, achieving an average accuracy of 90 %.
期刊介绍:
Ecotoxicology and Environmental Safety is a multi-disciplinary journal that focuses on understanding the exposure and effects of environmental contamination on organisms including human health. The scope of the journal covers three main themes. The topics within these themes, indicated below, include (but are not limited to) the following: Ecotoxicology、Environmental Chemistry、Environmental Safety etc.