An ODE based neural network approach for PM2.5 forecasting.

IF 3.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES
Md Khalid Hossen, Yan-Tsung Peng, Asher Shao, Meng Chang Chen
{"title":"An ODE based neural network approach for PM2.5 forecasting.","authors":"Md Khalid Hossen, Yan-Tsung Peng, Asher Shao, Meng Chang Chen","doi":"10.1038/s41598-025-05958-2","DOIUrl":null,"url":null,"abstract":"<p><p>Predicting time-series data is inherently complex, spurring the development of advanced neural network approaches. Monitoring and predicting PM2.5 levels is especially challenging due to the interplay of diverse natural and anthropogenic factors influencing its dispersion, making accurate predictions both costly and intricate. A key challenge in predicting PM2.5 concentrations lies in its variability, as the data distribution fluctuates significantly over time. Meanwhile, neural networks provide a cost-effective and highly accurate solution in managing such complexities. Deep learning models like Long Short-Term Memory (LSTM) and Bidirectional LSTM (BiLSTM) have been widely applied to PM2.5 prediction tasks. However, prediction errors increase as the forecasting window expands from 1 to 72 hours, underscoring the rising uncertainty in longer-term predictions. Recurrent Neural Networks (RNNs) with continuous-time hidden states are well-suited for modeling irregularly sampled time series but struggle with long-term dependencies due to gradient vanishing or exploding, as revealed by the ordinary differential equation (ODE) based hidden state dynamics-regardless of the ODE solver used. Continuous-time neural processes, defined by differential equations, are limited by numerical solvers, restricting scalability and hindering the modeling of complex phenomena like neural dynamics-ideally addressed via closed-form solutions. In contrast to ODE-based continuous models, closed-form networks demonstrate superior scalability over traditional deep-learning approaches. As continuous-time neural networks, Neural ODEs excel in modeling the intricate dynamics of time-series data, presenting a robust alternative to traditional LSTM models. We propose two ODE-based models: a transformer-based ODE model and a closed-form ODE model. Empirical evaluations show these models significantly enhance prediction accuracy, with improvements ranging from 2.91 to 14.15% for 1-hour to 8-hour predictions when compared to LSTM-based models. Moreover, after conducting the paired t-test, the RMSE values of the proposed model (CCCFC) were found to be significantly different from those of BILSTM, LSTM, GRU, ODE-LSTM, and PCNN,CNN-LSSTM. This implies that CCCFC demonstrates a distinct performance advantage, reinforcing its effectiveness in hourly PM2.5 forecasting.</p>","PeriodicalId":21811,"journal":{"name":"Scientific Reports","volume":"15 1","pages":"24830"},"PeriodicalIF":3.9000,"publicationDate":"2025-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12246124/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Reports","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41598-025-05958-2","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Predicting time-series data is inherently complex, spurring the development of advanced neural network approaches. Monitoring and predicting PM2.5 levels is especially challenging due to the interplay of diverse natural and anthropogenic factors influencing its dispersion, making accurate predictions both costly and intricate. A key challenge in predicting PM2.5 concentrations lies in its variability, as the data distribution fluctuates significantly over time. Meanwhile, neural networks provide a cost-effective and highly accurate solution in managing such complexities. Deep learning models like Long Short-Term Memory (LSTM) and Bidirectional LSTM (BiLSTM) have been widely applied to PM2.5 prediction tasks. However, prediction errors increase as the forecasting window expands from 1 to 72 hours, underscoring the rising uncertainty in longer-term predictions. Recurrent Neural Networks (RNNs) with continuous-time hidden states are well-suited for modeling irregularly sampled time series but struggle with long-term dependencies due to gradient vanishing or exploding, as revealed by the ordinary differential equation (ODE) based hidden state dynamics-regardless of the ODE solver used. Continuous-time neural processes, defined by differential equations, are limited by numerical solvers, restricting scalability and hindering the modeling of complex phenomena like neural dynamics-ideally addressed via closed-form solutions. In contrast to ODE-based continuous models, closed-form networks demonstrate superior scalability over traditional deep-learning approaches. As continuous-time neural networks, Neural ODEs excel in modeling the intricate dynamics of time-series data, presenting a robust alternative to traditional LSTM models. We propose two ODE-based models: a transformer-based ODE model and a closed-form ODE model. Empirical evaluations show these models significantly enhance prediction accuracy, with improvements ranging from 2.91 to 14.15% for 1-hour to 8-hour predictions when compared to LSTM-based models. Moreover, after conducting the paired t-test, the RMSE values of the proposed model (CCCFC) were found to be significantly different from those of BILSTM, LSTM, GRU, ODE-LSTM, and PCNN,CNN-LSSTM. This implies that CCCFC demonstrates a distinct performance advantage, reinforcing its effectiveness in hourly PM2.5 forecasting.

Abstract Image

Abstract Image

Abstract Image

基于ODE的PM2.5预测神经网络方法。
预测时间序列数据本质上是复杂的,这刺激了先进神经网络方法的发展。监测和预测PM2.5水平尤其具有挑战性,因为影响其分散的各种自然和人为因素相互作用,要做出准确的预测既昂贵又复杂。预测PM2.5浓度的一个关键挑战在于其可变性,因为数据分布会随着时间的推移而大幅波动。同时,神经网络为管理这种复杂性提供了一种经济有效且高度精确的解决方案。长短期记忆(LSTM)和双向LSTM (BiLSTM)等深度学习模型已广泛应用于PM2.5预测任务。然而,随着预报窗口从1小时扩大到72小时,预测误差也会增加,这表明长期预报的不确定性在增加。具有连续时间隐藏状态的递归神经网络(rnn)非常适合于不规则采样时间序列的建模,但由于梯度消失或爆炸而与长期依赖关系作斗争,正如基于常微分方程(ODE)的隐藏状态动力学所揭示的那样-无论使用何种ODE求解器。由微分方程定义的连续时间神经过程受到数值求解器的限制,限制了可扩展性,阻碍了神经动力学等复杂现象的建模——理想情况下,通过封闭形式的解决方案来解决。与基于ode的连续模型相比,封闭形式的网络比传统的深度学习方法具有更好的可扩展性。作为连续时间神经网络,神经ode在对时间序列数据的复杂动态建模方面表现出色,为传统LSTM模型提供了一种鲁棒的替代方案。我们提出了两种基于ODE的模型:基于变压器的ODE模型和封闭形式的ODE模型。经验评估表明,这些模型显著提高了预测精度,与基于lstm的模型相比,1小时到8小时的预测精度提高了2.91 ~ 14.15%。此外,经过配对t检验,发现所提出模型(CCCFC)的RMSE值与BILSTM、LSTM、GRU、ODE-LSTM以及PCNN、CNN-LSSTM的RMSE值存在显著差异。这意味着CCCFC具有明显的性能优势,增强了其逐时PM2.5预报的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Scientific Reports
Scientific Reports Natural Science Disciplines-
CiteScore
7.50
自引率
4.30%
发文量
19567
审稿时长
3.9 months
期刊介绍: We publish original research from all areas of the natural sciences, psychology, medicine and engineering. You can learn more about what we publish by browsing our specific scientific subject areas below or explore Scientific Reports by browsing all articles and collections. Scientific Reports has a 2-year impact factor: 4.380 (2021), and is the 6th most-cited journal in the world, with more than 540,000 citations in 2020 (Clarivate Analytics, 2021). •Engineering Engineering covers all aspects of engineering, technology, and applied science. It plays a crucial role in the development of technologies to address some of the world''s biggest challenges, helping to save lives and improve the way we live. •Physical sciences Physical sciences are those academic disciplines that aim to uncover the underlying laws of nature — often written in the language of mathematics. It is a collective term for areas of study including astronomy, chemistry, materials science and physics. •Earth and environmental sciences Earth and environmental sciences cover all aspects of Earth and planetary science and broadly encompass solid Earth processes, surface and atmospheric dynamics, Earth system history, climate and climate change, marine and freshwater systems, and ecology. It also considers the interactions between humans and these systems. •Biological sciences Biological sciences encompass all the divisions of natural sciences examining various aspects of vital processes. The concept includes anatomy, physiology, cell biology, biochemistry and biophysics, and covers all organisms from microorganisms, animals to plants. •Health sciences The health sciences study health, disease and healthcare. This field of study aims to develop knowledge, interventions and technology for use in healthcare to improve the treatment of patients.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信