利用信号处理预测藻华:来自集成学习的新视角

IF 11.4 1区 环境科学与生态学 Q1 ENGINEERING, ENVIRONMENTAL
Caicai Xu , Yuzhou Huang , Ruoxue Xin , Na Wu , Muyuan Liu
{"title":"利用信号处理预测藻华:来自集成学习的新视角","authors":"Caicai Xu ,&nbsp;Yuzhou Huang ,&nbsp;Ruoxue Xin ,&nbsp;Na Wu ,&nbsp;Muyuan Liu","doi":"10.1016/j.watres.2025.123800","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate forecasting of algal blooms is essential for implementing timely control measures. However, given their inherent complex time-frequency characteristics, capturing the dynamics of algal blooms remains an ongoing challenge in standalone models. Targeting this challenge, this study demonstrates an ensemble framework that combines signal processing with machine learning (ML) techniques to collectively forecast algal dynamics. This method utilizes an efficient signal processing algorithm, namely the compete ensemble empirical mode decomposition with adaptive noise (CEEMDAN), to decompose the highly non-stationary patterns of algal dynamics, while leveraging the complementary strengths of four distinct ML models to optimize the learning of the decomposed components. Our results demonstrated that CEEMDAN can largely improve the forecasting performance of standalone ML models (e.g., long short-term memory), achieving an average increase in validation <em>R</em><sup>2</sup> by 63 %. Moreover, by incorporating the ensemble effects that leverage model-specific strengths, this performance gain was further amplified, resulting in an average increase of 75 % in validation <em>R</em><sup>2</sup> compared to standalone ML models. The developed method, termed CEEMDAN-Hybrid-Ensemble (CHES) model, consistently delivered accurate forecasting of algal dynamics across multiple time resolutions (hourly, daily, and biweekly) in both rivers (River Enborne and The Cut) and lakes (Blelham Tarn and Lake Lillinonah), as suggested by high validation <em>R</em><sup>2</sup> values of 0.955, 0.878, 0.824, and 0.957, respectively. In addition, the CHES model achieved stable multi-step forecasting of algal dynamics with gaps ranging from 1 to 7 steps, as indicated by an average validation <em>R</em><sup>2</sup> of 0.72 <span><math><mo>±</mo></math></span> 0.17 (S.D.) and an average validation root-mean-square-error (RMSE) of 0.32 <span><math><mo>±</mo></math></span> 0.11 RFU. This study highlighted the ensemble effect achieved by integrating signal processing and ML techniques, presenting a novel perspective that enhances forecasting robustness to support the early warning of algal blooms.</div></div>","PeriodicalId":443,"journal":{"name":"Water Research","volume":"283 ","pages":"Article 123800"},"PeriodicalIF":11.4000,"publicationDate":"2025-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Algal bloom forecasting leveraging signal processing: A novel perspective from ensemble learning\",\"authors\":\"Caicai Xu ,&nbsp;Yuzhou Huang ,&nbsp;Ruoxue Xin ,&nbsp;Na Wu ,&nbsp;Muyuan Liu\",\"doi\":\"10.1016/j.watres.2025.123800\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Accurate forecasting of algal blooms is essential for implementing timely control measures. However, given their inherent complex time-frequency characteristics, capturing the dynamics of algal blooms remains an ongoing challenge in standalone models. Targeting this challenge, this study demonstrates an ensemble framework that combines signal processing with machine learning (ML) techniques to collectively forecast algal dynamics. This method utilizes an efficient signal processing algorithm, namely the compete ensemble empirical mode decomposition with adaptive noise (CEEMDAN), to decompose the highly non-stationary patterns of algal dynamics, while leveraging the complementary strengths of four distinct ML models to optimize the learning of the decomposed components. Our results demonstrated that CEEMDAN can largely improve the forecasting performance of standalone ML models (e.g., long short-term memory), achieving an average increase in validation <em>R</em><sup>2</sup> by 63 %. Moreover, by incorporating the ensemble effects that leverage model-specific strengths, this performance gain was further amplified, resulting in an average increase of 75 % in validation <em>R</em><sup>2</sup> compared to standalone ML models. The developed method, termed CEEMDAN-Hybrid-Ensemble (CHES) model, consistently delivered accurate forecasting of algal dynamics across multiple time resolutions (hourly, daily, and biweekly) in both rivers (River Enborne and The Cut) and lakes (Blelham Tarn and Lake Lillinonah), as suggested by high validation <em>R</em><sup>2</sup> values of 0.955, 0.878, 0.824, and 0.957, respectively. In addition, the CHES model achieved stable multi-step forecasting of algal dynamics with gaps ranging from 1 to 7 steps, as indicated by an average validation <em>R</em><sup>2</sup> of 0.72 <span><math><mo>±</mo></math></span> 0.17 (S.D.) and an average validation root-mean-square-error (RMSE) of 0.32 <span><math><mo>±</mo></math></span> 0.11 RFU. This study highlighted the ensemble effect achieved by integrating signal processing and ML techniques, presenting a novel perspective that enhances forecasting robustness to support the early warning of algal blooms.</div></div>\",\"PeriodicalId\":443,\"journal\":{\"name\":\"Water Research\",\"volume\":\"283 \",\"pages\":\"Article 123800\"},\"PeriodicalIF\":11.4000,\"publicationDate\":\"2025-05-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Water Research\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0043135425007092\",\"RegionNum\":1,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ENVIRONMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Water Research","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0043135425007092","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0

摘要

准确预报藻华对及时实施控制措施至关重要。然而,考虑到藻华固有的复杂时频特性,在独立模型中捕捉藻华的动态仍然是一个持续的挑战。针对这一挑战,本研究展示了一个集成框架,将信号处理与机器学习(ML)技术相结合,共同预测藻类动态。该方法利用一种高效的信号处理算法,即竞争集成经验模态分解与自适应噪声(CEEMDAN),来分解藻类动力学的高度非平稳模式,同时利用四种不同ML模型的互补优势来优化分解组件的学习。我们的研究结果表明,CEEMDAN可以在很大程度上提高独立ML模型(例如,长短期记忆或LSTM)的预测性能,实现验证R2平均提高63%。此外,通过整合利用模型特定优势的集成效应,这种性能增益被进一步放大,与独立的ML模型相比,验证R2平均增加了75%。所开发的方法,称为CEEMDAN-Hybrid-Ensemble (CHES)模型,在河流(Enborne河和The Cut河)和湖泊(Blelham Tarn湖和Lillinonah湖)的多个时间分辨率(每小时、每天和每两周)中始终提供准确的藻类动态预测,验证R2值分别为0.955、0.878、0.824和0.957。此外,该模型在1 ~ 7步范围内实现了稳定的藻类动态多步预测,平均验证R2为0.72±±0.17 (sd),平均验证均方根误差(RMSE)为0.32±0.11 RFU。本研究强调了通过整合信号处理和ML技术实现的集成效应,提出了一个新的视角,增强了预测的鲁棒性,以支持藻华的早期预警。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Algal bloom forecasting leveraging signal processing: A novel perspective from ensemble learning

Algal bloom forecasting leveraging signal processing: A novel perspective from ensemble learning
Accurate forecasting of algal blooms is essential for implementing timely control measures. However, given their inherent complex time-frequency characteristics, capturing the dynamics of algal blooms remains an ongoing challenge in standalone models. Targeting this challenge, this study demonstrates an ensemble framework that combines signal processing with machine learning (ML) techniques to collectively forecast algal dynamics. This method utilizes an efficient signal processing algorithm, namely the compete ensemble empirical mode decomposition with adaptive noise (CEEMDAN), to decompose the highly non-stationary patterns of algal dynamics, while leveraging the complementary strengths of four distinct ML models to optimize the learning of the decomposed components. Our results demonstrated that CEEMDAN can largely improve the forecasting performance of standalone ML models (e.g., long short-term memory), achieving an average increase in validation R2 by 63 %. Moreover, by incorporating the ensemble effects that leverage model-specific strengths, this performance gain was further amplified, resulting in an average increase of 75 % in validation R2 compared to standalone ML models. The developed method, termed CEEMDAN-Hybrid-Ensemble (CHES) model, consistently delivered accurate forecasting of algal dynamics across multiple time resolutions (hourly, daily, and biweekly) in both rivers (River Enborne and The Cut) and lakes (Blelham Tarn and Lake Lillinonah), as suggested by high validation R2 values of 0.955, 0.878, 0.824, and 0.957, respectively. In addition, the CHES model achieved stable multi-step forecasting of algal dynamics with gaps ranging from 1 to 7 steps, as indicated by an average validation R2 of 0.72 ± 0.17 (S.D.) and an average validation root-mean-square-error (RMSE) of 0.32 ± 0.11 RFU. This study highlighted the ensemble effect achieved by integrating signal processing and ML techniques, presenting a novel perspective that enhances forecasting robustness to support the early warning of algal blooms.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Water Research
Water Research 环境科学-工程:环境
CiteScore
20.80
自引率
9.40%
发文量
1307
审稿时长
38 days
期刊介绍: Water Research, along with its open access companion journal Water Research X, serves as a platform for publishing original research papers covering various aspects of the science and technology related to the anthropogenic water cycle, water quality, and its management worldwide. The audience targeted by the journal comprises biologists, chemical engineers, chemists, civil engineers, environmental engineers, limnologists, and microbiologists. The scope of the journal include: •Treatment processes for water and wastewaters (municipal, agricultural, industrial, and on-site treatment), including resource recovery and residuals management; •Urban hydrology including sewer systems, stormwater management, and green infrastructure; •Drinking water treatment and distribution; •Potable and non-potable water reuse; •Sanitation, public health, and risk assessment; •Anaerobic digestion, solid and hazardous waste management, including source characterization and the effects and control of leachates and gaseous emissions; •Contaminants (chemical, microbial, anthropogenic particles such as nanoparticles or microplastics) and related water quality sensing, monitoring, fate, and assessment; •Anthropogenic impacts on inland, tidal, coastal and urban waters, focusing on surface and ground waters, and point and non-point sources of pollution; •Environmental restoration, linked to surface water, groundwater and groundwater remediation; •Analysis of the interfaces between sediments and water, and between water and atmosphere, focusing specifically on anthropogenic impacts; •Mathematical modelling, systems analysis, machine learning, and beneficial use of big data related to the anthropogenic water cycle; •Socio-economic, policy, and regulations studies.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信