Comparison and prediction of shallow groundwater nitrate in Shaying River basin based on urban distribution using multiple machine learning approaches.
{"title":"Comparison and prediction of shallow groundwater nitrate in Shaying River basin based on urban distribution using multiple machine learning approaches.","authors":"Zipeng Huang, Baonan He, Yanjia Chu, Yuanbo Song, Zheng Shen","doi":"10.1002/wer.70033","DOIUrl":null,"url":null,"abstract":"<p><p>Groundwater, a pivotal water resource in numerous regions worldwide, confronts formidable challenges posed by severe nitrate pollution. Traditional research methodologies aimed at addressing groundwater nitrate contamination frequently struggle to accurately depict the intricate conditions of the groundwater environment, particularly when dealing with high variability and nonlinear data. However, the advent of machine learning (ML) has heralded an innovative approach to simulating groundwater dynamics. In this study, six ML algorithms were deployed to model the concentrations of shallow groundwater nitrates in the Shaying River Basin. The efficacy of each model was assessed through comprehensive metrics including the coefficient of determination (R<sup>2</sup>), mean absolute error (MAE), and root mean square error (RMSE), gauging the alignment between observed and predicted groundwater nitrate levels. Subsequently, to discern the principal environmental factors influencing NO<sub>3</sub>-N concentrations, the most proficient model was selected. Among the array of models, the XGB algorithm, renowned for its capacity to handle extreme values, demonstrated superior performance (R<sup>2</sup> = 0.773, MAE = 7.625, RMSE = 11.92). Through an in-depth analysis of groundwater NO<sub>3</sub>-N across major urban centers, Fuyang city was identified as the most heavily contaminated locale, attributing the phenomenon to potential sources such as domestic sewage and agricultural activities (feature importance of Cl<sup>-</sup> = 78.64%). Conversely, Zhengzhou city emerged as the least polluted city, with notable influences from K<sup>+</sup> and NO<sub>2</sub> <sup>-</sup> (feature importance = 52.06% and 18.41%), indicative of a prevailing reducing environment compared to other cities. In summation, this study explores a methodology for amalgamating diverse environmental variables in the investigation of groundwater contamination. Such insights hold profound implications for the effective management and mitigation of nitrate contamination in the Shaying River Basin, offering a demonstration for similar endeavors in analogous regions. PRACTITIONER POINTS: Six machine learning models were utilized to simulate the nitrate contamination. XGB model for groundwater nitrate pollution prediction outperformed other models. Relative importance of environmental variables was identified using the XGB model. Impact of main environmental variables on groundwater nitrate was discussed.</p>","PeriodicalId":23621,"journal":{"name":"Water Environment Research","volume":"97 2","pages":"e70033"},"PeriodicalIF":2.5000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Water Environment Research","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1002/wer.70033","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Groundwater, a pivotal water resource in numerous regions worldwide, confronts formidable challenges posed by severe nitrate pollution. Traditional research methodologies aimed at addressing groundwater nitrate contamination frequently struggle to accurately depict the intricate conditions of the groundwater environment, particularly when dealing with high variability and nonlinear data. However, the advent of machine learning (ML) has heralded an innovative approach to simulating groundwater dynamics. In this study, six ML algorithms were deployed to model the concentrations of shallow groundwater nitrates in the Shaying River Basin. The efficacy of each model was assessed through comprehensive metrics including the coefficient of determination (R2), mean absolute error (MAE), and root mean square error (RMSE), gauging the alignment between observed and predicted groundwater nitrate levels. Subsequently, to discern the principal environmental factors influencing NO3-N concentrations, the most proficient model was selected. Among the array of models, the XGB algorithm, renowned for its capacity to handle extreme values, demonstrated superior performance (R2 = 0.773, MAE = 7.625, RMSE = 11.92). Through an in-depth analysis of groundwater NO3-N across major urban centers, Fuyang city was identified as the most heavily contaminated locale, attributing the phenomenon to potential sources such as domestic sewage and agricultural activities (feature importance of Cl- = 78.64%). Conversely, Zhengzhou city emerged as the least polluted city, with notable influences from K+ and NO2- (feature importance = 52.06% and 18.41%), indicative of a prevailing reducing environment compared to other cities. In summation, this study explores a methodology for amalgamating diverse environmental variables in the investigation of groundwater contamination. Such insights hold profound implications for the effective management and mitigation of nitrate contamination in the Shaying River Basin, offering a demonstration for similar endeavors in analogous regions. PRACTITIONER POINTS: Six machine learning models were utilized to simulate the nitrate contamination. XGB model for groundwater nitrate pollution prediction outperformed other models. Relative importance of environmental variables was identified using the XGB model. Impact of main environmental variables on groundwater nitrate was discussed.
期刊介绍:
Published since 1928, Water Environment Research (WER) is an international multidisciplinary water resource management journal for the dissemination of fundamental and applied research in all scientific and technical areas related to water quality and resource recovery. WER''s goal is to foster communication and interdisciplinary research between water sciences and related fields such as environmental toxicology, agriculture, public and occupational health, microbiology, and ecology. In addition to original research articles, short communications, case studies, reviews, and perspectives are encouraged.