{"title":"Long-term prediction for karst spring discharge and petroleum substances concentration based on the combination of LSTM and Transformer models","authors":"Feng Jiang, Qiang Li, Guotao Sun, Qixin Wu, Shuang Liu, Kebuzi Jiqin, Ruofan Wang, Hanwu Liu, Wei Hu","doi":"10.1016/j.watres.2025.123148","DOIUrl":null,"url":null,"abstract":"Monitoring the quantity and quality of karst springs is essential for groundwater resource management. However, it is challenging to robustly forecast the karst spring discharge and pollutant concentration due to the high complexity and heterogeneity of karst aquifers. Few researchers have addressed the long-term prediction of hourly spring quantity and quality, which is crucial for emergency management. Here, we develop an ensemble model based on the long short-term memory (LSTM) and iTransformer models, with a random forest model as a meta-model to combine the base models. Experiments were conducted on hourly spring discharge and pollutant concentration predictions at the Xianrendong Spring, Guizhou, China, using a dataset comprising 2106 hours of precipitation from four stations, spring discharge, and petroleum substances concentrations. The results indicate that the LSTM model can capture short-term dependencies but struggles with long-term variations, while the iTransformer can quickly apprehend complex patterns but tends to result in overfitting. By combining the strengths of LSTM and iTransformer, the ensemble model balances stability and sensitivity, reducing the bias and variance of individual models, and enhancing overall prediction accuracy. The ensemble model consistently outperforms both LSTM and iTransformer across all time steps (24, 36, 48, and 60 hours) and longer lead times (6-10 hours). The robust prediction with long lead times enables the ensemble model to effectively mitigate the hazard caused by petroleum substances leakage.","PeriodicalId":443,"journal":{"name":"Water Research","volume":"1 1","pages":""},"PeriodicalIF":11.4000,"publicationDate":"2025-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Water Research","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1016/j.watres.2025.123148","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Monitoring the quantity and quality of karst springs is essential for groundwater resource management. However, it is challenging to robustly forecast the karst spring discharge and pollutant concentration due to the high complexity and heterogeneity of karst aquifers. Few researchers have addressed the long-term prediction of hourly spring quantity and quality, which is crucial for emergency management. Here, we develop an ensemble model based on the long short-term memory (LSTM) and iTransformer models, with a random forest model as a meta-model to combine the base models. Experiments were conducted on hourly spring discharge and pollutant concentration predictions at the Xianrendong Spring, Guizhou, China, using a dataset comprising 2106 hours of precipitation from four stations, spring discharge, and petroleum substances concentrations. The results indicate that the LSTM model can capture short-term dependencies but struggles with long-term variations, while the iTransformer can quickly apprehend complex patterns but tends to result in overfitting. By combining the strengths of LSTM and iTransformer, the ensemble model balances stability and sensitivity, reducing the bias and variance of individual models, and enhancing overall prediction accuracy. The ensemble model consistently outperforms both LSTM and iTransformer across all time steps (24, 36, 48, and 60 hours) and longer lead times (6-10 hours). The robust prediction with long lead times enables the ensemble model to effectively mitigate the hazard caused by petroleum substances leakage.
期刊介绍:
Water Research, along with its open access companion journal Water Research X, serves as a platform for publishing original research papers covering various aspects of the science and technology related to the anthropogenic water cycle, water quality, and its management worldwide. The audience targeted by the journal comprises biologists, chemical engineers, chemists, civil engineers, environmental engineers, limnologists, and microbiologists. The scope of the journal include:
•Treatment processes for water and wastewaters (municipal, agricultural, industrial, and on-site treatment), including resource recovery and residuals management;
•Urban hydrology including sewer systems, stormwater management, and green infrastructure;
•Drinking water treatment and distribution;
•Potable and non-potable water reuse;
•Sanitation, public health, and risk assessment;
•Anaerobic digestion, solid and hazardous waste management, including source characterization and the effects and control of leachates and gaseous emissions;
•Contaminants (chemical, microbial, anthropogenic particles such as nanoparticles or microplastics) and related water quality sensing, monitoring, fate, and assessment;
•Anthropogenic impacts on inland, tidal, coastal and urban waters, focusing on surface and ground waters, and point and non-point sources of pollution;
•Environmental restoration, linked to surface water, groundwater and groundwater remediation;
•Analysis of the interfaces between sediments and water, and between water and atmosphere, focusing specifically on anthropogenic impacts;
•Mathematical modelling, systems analysis, machine learning, and beneficial use of big data related to the anthropogenic water cycle;
•Socio-economic, policy, and regulations studies.