MamGA：基于mamba和深度关注层的双通道并行月径流预测的深度神经网络架构

IF 6.3 1区地球科学 Q1 ENGINEERING, CIVIL

Journal of Hydrology Pub Date : 2025-09-22 DOI:10.1016/j.jhydrol.2025.134304

Wen-chuan Wang , Wei-can Tian , Ming-lei Ren , Dong-mei Xu

{"title":"MamGA：基于mamba和深度关注层的双通道并行月径流预测的深度神经网络架构","authors":"Wen-chuan Wang , Wei-can Tian , Ming-lei Ren , Dong-mei Xu","doi":"10.1016/j.jhydrol.2025.134304","DOIUrl":null,"url":null,"abstract":"<div><div>Monthly runoff prediction is crucial in water resource management, involving both short-term hydrological dynamics and long-term planning. It has a decisive impact on flood prevention, resource allocation, and ecological protection. In the context of increasing uncertainties in runoff due to climate change and human activities, accurate monthly runoff forecasting becomes even more essential. Therefore, this paper proposes a novel dual-channel parallel monthly runoff prediction deep neural network architecture—MamGA—built on the significant application value of deep neural networks in runoff prediction. The architecture first introduces the Mamba model, which employs a selection mechanism to achieve selective information propagation and suppression, effectively enhancing the processing capability of global feature information while reducing the computational complexity of modelling long sequences. Furthermore, this paper incorporates a Depth-gated Attention Layer that combines bidirectional depth-gated modules and linear attention mechanisms to address the shortcomings of the Mamba network in unidirectional information processing. Integrating an Embedded Coding layer and a Sequential Decoding layer constructs an efficient coding and decoding system, further strengthening the model’s ability to capture global features and temporal information. To validate the effectiveness and advancement of the MamGA model, this study selected the Manwan Station (MW), Xiaowan Station (XW) in China, and the Thunder Creek Station (TC) in the United States as experimental subjects. Five evaluation metrics were employed for comparative analysis against nine benchmark models. The experimental results indicate that the MamGA model exhibits significant superiority across all cases. For instance, at the MW station, compared to the Long Short-Term Memory (LSTM) model, the MamGA model reduced the Mean Absolute Error (MAE) and Normalized Root Mean Square Error (NRMSE) by 33.08% and 23.93%, respectively. Meanwhile, the Nash Efficiency Coefficient (NSE), correlation coefficient (R), and Kling-Gupta Efficiency (KGE) improved by 8.41%, 3.93%, and 8.36%, respectively, with both R and NSE exceeding 0.9. The MamGA model also demonstrated significant performance improvements at other stations compared to the competing models. The study suggests that the MamGA model, as an advanced tool for monthly runoff prediction, can significantly enhance the accuracy of runoff forecasting, providing robust support for the optimal allocation and management of water resources.</div></div>","PeriodicalId":362,"journal":{"name":"Journal of Hydrology","volume":"663 ","pages":"Article 134304"},"PeriodicalIF":6.3000,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MamGA: a deep neural network architecture for dual-channel parallel monthly runoff prediction based on mamba and depth-gated attention layer\",\"authors\":\"Wen-chuan Wang , Wei-can Tian , Ming-lei Ren , Dong-mei Xu\",\"doi\":\"10.1016/j.jhydrol.2025.134304\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Monthly runoff prediction is crucial in water resource management, involving both short-term hydrological dynamics and long-term planning. It has a decisive impact on flood prevention, resource allocation, and ecological protection. In the context of increasing uncertainties in runoff due to climate change and human activities, accurate monthly runoff forecasting becomes even more essential. Therefore, this paper proposes a novel dual-channel parallel monthly runoff prediction deep neural network architecture—MamGA—built on the significant application value of deep neural networks in runoff prediction. The architecture first introduces the Mamba model, which employs a selection mechanism to achieve selective information propagation and suppression, effectively enhancing the processing capability of global feature information while reducing the computational complexity of modelling long sequences. Furthermore, this paper incorporates a Depth-gated Attention Layer that combines bidirectional depth-gated modules and linear attention mechanisms to address the shortcomings of the Mamba network in unidirectional information processing. Integrating an Embedded Coding layer and a Sequential Decoding layer constructs an efficient coding and decoding system, further strengthening the model’s ability to capture global features and temporal information. To validate the effectiveness and advancement of the MamGA model, this study selected the Manwan Station (MW), Xiaowan Station (XW) in China, and the Thunder Creek Station (TC) in the United States as experimental subjects. Five evaluation metrics were employed for comparative analysis against nine benchmark models. The experimental results indicate that the MamGA model exhibits significant superiority across all cases. For instance, at the MW station, compared to the Long Short-Term Memory (LSTM) model, the MamGA model reduced the Mean Absolute Error (MAE) and Normalized Root Mean Square Error (NRMSE) by 33.08% and 23.93%, respectively. Meanwhile, the Nash Efficiency Coefficient (NSE), correlation coefficient (R), and Kling-Gupta Efficiency (KGE) improved by 8.41%, 3.93%, and 8.36%, respectively, with both R and NSE exceeding 0.9. The MamGA model also demonstrated significant performance improvements at other stations compared to the competing models. The study suggests that the MamGA model, as an advanced tool for monthly runoff prediction, can significantly enhance the accuracy of runoff forecasting, providing robust support for the optimal allocation and management of water resources.</div></div>\",\"PeriodicalId\":362,\"journal\":{\"name\":\"Journal of Hydrology\",\"volume\":\"663 \",\"pages\":\"Article 134304\"},\"PeriodicalIF\":6.3000,\"publicationDate\":\"2025-09-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Hydrology\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0022169425016440\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hydrology","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0022169425016440","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}

引用次数: 0

摘要

每月径流预测在水资源管理中至关重要，涉及短期水文动态和长期规划。它对防洪、资源配置和生态保护具有决定性影响。在气候变化和人类活动导致径流不确定性增加的背景下，准确的月度径流预报变得更加重要。因此，本文基于深度神经网络在径流预测中的重要应用价值，提出了一种新的双通道并行月度径流预测深度神经网络架构——mamga。该体系结构首先引入了Mamba模型，采用选择机制实现信息的选择性传播和抑制，有效增强了对全局特征信息的处理能力，同时降低了长序列建模的计算复杂度。此外，结合双向深度门控模块和线性注意机制的深度门控注意层，解决了曼巴网络在单向信息处理方面的不足。嵌入编码层和顺序解码层的集成构建了高效的编解码系统，进一步增强了模型捕获全局特征和时序信息的能力。为了验证MamGA模型的有效性和先进性，本研究选择了中国的漫湾站（MW）、小湾站（XW）和美国的雷溪站（TC）作为实验对象。采用5个评价指标对9个基准模型进行比较分析。实验结果表明，MamGA模型在所有情况下都具有显著的优越性。以MW台站为例，与LSTM模型相比，MamGA模型的平均绝对误差（MAE）和标准化均方根误差（NRMSE）分别降低了33.08%和23.93%。纳什效率系数（NSE）、相关系数(R)和克林-古普塔效率（KGE）分别提高了8.41%、3.93%和8.36%，R和NSE均超过0.9。与竞争模型相比，MamGA模型在其他站点也显示出显着的性能改进。研究表明，MamGA模型作为一种先进的月度径流预测工具，可以显著提高径流预测的准确性，为水资源的优化配置和管理提供有力支持。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

MamGA: a deep neural network architecture for dual-channel parallel monthly runoff prediction based on mamba and depth-gated attention layer

Monthly runoff prediction is crucial in water resource management, involving both short-term hydrological dynamics and long-term planning. It has a decisive impact on flood prevention, resource allocation, and ecological protection. In the context of increasing uncertainties in runoff due to climate change and human activities, accurate monthly runoff forecasting becomes even more essential. Therefore, this paper proposes a novel dual-channel parallel monthly runoff prediction deep neural network architecture—MamGA—built on the significant application value of deep neural networks in runoff prediction. The architecture first introduces the Mamba model, which employs a selection mechanism to achieve selective information propagation and suppression, effectively enhancing the processing capability of global feature information while reducing the computational complexity of modelling long sequences. Furthermore, this paper incorporates a Depth-gated Attention Layer that combines bidirectional depth-gated modules and linear attention mechanisms to address the shortcomings of the Mamba network in unidirectional information processing. Integrating an Embedded Coding layer and a Sequential Decoding layer constructs an efficient coding and decoding system, further strengthening the model’s ability to capture global features and temporal information. To validate the effectiveness and advancement of the MamGA model, this study selected the Manwan Station (MW), Xiaowan Station (XW) in China, and the Thunder Creek Station (TC) in the United States as experimental subjects. Five evaluation metrics were employed for comparative analysis against nine benchmark models. The experimental results indicate that the MamGA model exhibits significant superiority across all cases. For instance, at the MW station, compared to the Long Short-Term Memory (LSTM) model, the MamGA model reduced the Mean Absolute Error (MAE) and Normalized Root Mean Square Error (NRMSE) by 33.08% and 23.93%, respectively. Meanwhile, the Nash Efficiency Coefficient (NSE), correlation coefficient (R), and Kling-Gupta Efficiency (KGE) improved by 8.41%, 3.93%, and 8.36%, respectively, with both R and NSE exceeding 0.9. The MamGA model also demonstrated significant performance improvements at other stations compared to the competing models. The study suggests that the MamGA model, as an advanced tool for monthly runoff prediction, can significantly enhance the accuracy of runoff forecasting, providing robust support for the optimal allocation and management of water resources.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Hydrology 地学-地球科学综合

CiteScore

11.00

自引率

12.50%

发文量

1309

审稿时长

7.5 months

期刊介绍： The Journal of Hydrology publishes original research papers and comprehensive reviews in all the subfields of the hydrological sciences including water based management and policy issues that impact on economics and society. These comprise, but are not limited to the physical, chemical, biogeochemical, stochastic and systems aspects of surface and groundwater hydrology, hydrometeorology and hydrogeology. Relevant topics incorporating the insights and methodologies of disciplines such as climatology, water resource systems, hydraulics, agrohydrology, geomorphology, soil science, instrumentation and remote sensing, civil and environmental engineering are included. Social science perspectives on hydrological problems such as resource and ecological economics, environmental sociology, psychology and behavioural science, management and policy analysis are also invited. Multi-and interdisciplinary analyses of hydrological problems are within scope. The science published in the Journal of Hydrology is relevant to catchment scales rather than exclusively to a local scale or site.