A hybrid signal decomposition-machine learning benchmarking framework for multi-station precipitation prediction in the Kébir Rhumel basin (Algeria)

IF 2.1 4区 地球科学
Aykut Erol, Issam Rehamnia, Hatice Citakoglu
{"title":"A hybrid signal decomposition-machine learning benchmarking framework for multi-station precipitation prediction in the Kébir Rhumel basin (Algeria)","authors":"Aykut Erol,&nbsp;Issam Rehamnia,&nbsp;Hatice Citakoglu","doi":"10.1007/s11600-026-01888-3","DOIUrl":null,"url":null,"abstract":"<div><p>Reliable multi-station precipitation forecasting is challenging due to nonstationarity, noise, and spatial heterogeneity. This paper introduces a hybrid signal decomposition-machine learning benchmarking framework that integrates four decomposition methods (TQWT, MODWT, EWT, VMD) with three learners (Bagging, LSBoost, KNN), yielding twelve hybrid models. These models were rigorously tested across twelve stations in the Kébir Rhumel Basin using eight statistical metrics and distributional diagnostics to assess accuracy, stability, and generalization. Two dominant families emerged: TQWT-based hybrids achieved localized accuracy at four stations, while MODWT-Bagging led at eight stations and delivered the most consistent cross-station performance. MODWT-Bagging achieved <i>R</i><sup>2</sup> = 0.984–0.993 and NSE = 0.981–0.993, with RMSE ranging from 2.64 to 6.34, demonstrating strong predictive skill under varying hydro-climatic conditions. In noise-rich environments, it substantially reduced errors; for example, at El Milia, RMSE dropped from 12.57 (VMD-LSBoost) to 6.03, a ≈ 52% reduction, and improvements of up to 63% were observed at other stations. Its superiority stems from MODWT’s shift-invariance and noise robustness combined with Bagging’s variance reduction. Taylor diagrams and violin plots confirmed centered, compact error structures, while scatter plots verified accurate phase and magnitude tracking. By clarifying how decomposition structure and learner characteristics interact across heterogeneous regimes, this framework fills a key gap in signal decomposition-machine learning model selection. The findings support adaptive hybrid design for early warning, water resource management, and precipitation-driven forecasting systems. Overall, MODWT-Bagging is established as a robust default for complex precipitation modeling, and the proposed framework provides a scalable foundation for next-generation hybrid predictive tools.</p></div>","PeriodicalId":6988,"journal":{"name":"Acta Geophysica","volume":"74 3","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2026-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s11600-026-01888-3.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acta Geophysica","FirstCategoryId":"89","ListUrlMain":"https://link.springer.com/article/10.1007/s11600-026-01888-3","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Reliable multi-station precipitation forecasting is challenging due to nonstationarity, noise, and spatial heterogeneity. This paper introduces a hybrid signal decomposition-machine learning benchmarking framework that integrates four decomposition methods (TQWT, MODWT, EWT, VMD) with three learners (Bagging, LSBoost, KNN), yielding twelve hybrid models. These models were rigorously tested across twelve stations in the Kébir Rhumel Basin using eight statistical metrics and distributional diagnostics to assess accuracy, stability, and generalization. Two dominant families emerged: TQWT-based hybrids achieved localized accuracy at four stations, while MODWT-Bagging led at eight stations and delivered the most consistent cross-station performance. MODWT-Bagging achieved R2 = 0.984–0.993 and NSE = 0.981–0.993, with RMSE ranging from 2.64 to 6.34, demonstrating strong predictive skill under varying hydro-climatic conditions. In noise-rich environments, it substantially reduced errors; for example, at El Milia, RMSE dropped from 12.57 (VMD-LSBoost) to 6.03, a ≈ 52% reduction, and improvements of up to 63% were observed at other stations. Its superiority stems from MODWT’s shift-invariance and noise robustness combined with Bagging’s variance reduction. Taylor diagrams and violin plots confirmed centered, compact error structures, while scatter plots verified accurate phase and magnitude tracking. By clarifying how decomposition structure and learner characteristics interact across heterogeneous regimes, this framework fills a key gap in signal decomposition-machine learning model selection. The findings support adaptive hybrid design for early warning, water resource management, and precipitation-driven forecasting systems. Overall, MODWT-Bagging is established as a robust default for complex precipitation modeling, and the proposed framework provides a scalable foundation for next-generation hybrid predictive tools.

阿尔及利亚k bir Rhumel盆地多站降水预报的混合信号分解-机器学习基准框架
由于非平稳性、噪声和空间异质性,可靠的多站降水预报具有挑战性。本文介绍了一种混合信号分解-机器学习基准测试框架,该框架将四种分解方法(TQWT, MODWT, EWT, VMD)与三个学习器(Bagging, LSBoost, KNN)集成在一起,产生十二个混合模型。这些模型在ksambir Rhumel盆地的12个站点进行了严格的测试,使用8种统计指标和分布诊断来评估准确性、稳定性和泛化。出现了两个优势家族:基于tqwt的混合动力系统在4个站点实现了定位精度,而基于MODWT-Bagging的混合动力系统在8个站点实现了定位精度,并提供了最一致的跨站点性能。MODWT-Bagging的R2 = 0.984 ~ 0.993, NSE = 0.981 ~ 0.993, RMSE为2.64 ~ 6.34,在不同水文气候条件下具有较强的预测能力。在噪声丰富的环境中,它大大减少了误差;例如,在El Milia, RMSE从12.57 (VMD-LSBoost)下降到6.03,降低了约52%,其他站点的RMSE提高了63%。其优势在于MODWT的平移不变性和噪声鲁棒性,并结合Bagging的方差约简。泰勒图和小提琴图证实了中心,紧凑的误差结构,而散点图证实了准确的相位和幅度跟踪。通过澄清分解结构和学习者特征如何在异构机制中相互作用,该框架填补了信号分解-机器学习模型选择的关键空白。研究结果支持早期预警、水资源管理和降水驱动预报系统的自适应混合设计。总的来说,MODWT-Bagging已被确立为复杂降水建模的鲁棒默认值,所提出的框架为下一代混合预测工具提供了可扩展的基础。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Acta Geophysica
Acta Geophysica GEOCHEMISTRY & GEOPHYSICS-
CiteScore
3.80
自引率
13.00%
发文量
251
期刊介绍: Acta Geophysica is open to all kinds of manuscripts including research and review articles, short communications, comments to published papers, letters to the Editor as well as book reviews. Some of the issues are fully devoted to particular topics; we do encourage proposals for such topical issues. We accept submissions from scientists world-wide, offering high scientific and editorial standard and comprehensive treatment of the discussed topics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书