{"title":"Pulse transfer learning: Multi-area river ammonia nitrogen prediction with limited data","authors":"Zichen Song , Boying Nie , Sitan Huang","doi":"10.1016/j.eswa.2024.125730","DOIUrl":null,"url":null,"abstract":"<div><div>Accurately predicting ammonia nitrogen content in irrigation water is essential for evaluating water quality, especially in ensuring the safety of agricultural water. However, most existing studies rely on traditional models such as Back Propagation (BP) neural networks, Support Vector Regression (SVR), Recurrent Neural Networks (RNN), and Long Short-Term Memory (LSTM) models within Artificial Neural Networks (ANN), along with newly proposed deep learning models like DeepTCN-GRU and GRU-N-Beats. These models, however, often focus on single monitoring points, limiting their practical applicability.</div><div>To better accommodate temporal dynamic data, we combine the existing Long Short-Term Memory (LSTM) model, which has demonstrated strong performance in handling time series datasets, with the third-generation neural network model, Spiking Neural Network (SNN). This integration leads to the development of a more temporally-driven model: SNN-LSTM (Spiking Neural Network-Long Short-Term Memory). Additionally, the multi-head attention mechanism enhances the model’s ability to process multivariate time series data. Models like BP, SVR, RNN, LSTM, CNN-LSTM, DeepTCN-GRU, and GRU-N-Beats were trained and tested on water quality data from Huangshui River Bridge, Xigu District, Lanzhou. Comparing the MAE, R-Squared, and RMSE of predictions shows that the SNN-LSTM with multi-head attention significantly outperforms other models.</div><div>The Elastic Weight Consolidation (EWC) method was used to integrate spatial and temporal features, enhancing model stability across regions. Using data from eight monitoring sites in Gansu Province, the EWC-enhanced SNN-LSTM-MT model was evaluated. Results based on MAE and RMSE showed improved generalization and reliability in predictions across different regions.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"263 ","pages":"Article 125730"},"PeriodicalIF":7.5000,"publicationDate":"2024-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417424025971","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Accurately predicting ammonia nitrogen content in irrigation water is essential for evaluating water quality, especially in ensuring the safety of agricultural water. However, most existing studies rely on traditional models such as Back Propagation (BP) neural networks, Support Vector Regression (SVR), Recurrent Neural Networks (RNN), and Long Short-Term Memory (LSTM) models within Artificial Neural Networks (ANN), along with newly proposed deep learning models like DeepTCN-GRU and GRU-N-Beats. These models, however, often focus on single monitoring points, limiting their practical applicability.
To better accommodate temporal dynamic data, we combine the existing Long Short-Term Memory (LSTM) model, which has demonstrated strong performance in handling time series datasets, with the third-generation neural network model, Spiking Neural Network (SNN). This integration leads to the development of a more temporally-driven model: SNN-LSTM (Spiking Neural Network-Long Short-Term Memory). Additionally, the multi-head attention mechanism enhances the model’s ability to process multivariate time series data. Models like BP, SVR, RNN, LSTM, CNN-LSTM, DeepTCN-GRU, and GRU-N-Beats were trained and tested on water quality data from Huangshui River Bridge, Xigu District, Lanzhou. Comparing the MAE, R-Squared, and RMSE of predictions shows that the SNN-LSTM with multi-head attention significantly outperforms other models.
The Elastic Weight Consolidation (EWC) method was used to integrate spatial and temporal features, enhancing model stability across regions. Using data from eight monitoring sites in Gansu Province, the EWC-enhanced SNN-LSTM-MT model was evaluated. Results based on MAE and RMSE showed improved generalization and reliability in predictions across different regions.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.