评估机器学习和概率模型对复杂气候特征的普遍预测能力

IF 3.9 3区 环境科学与生态学 Q1 ENGINEERING, CIVIL
Aamina Batool, Zulfiqar Ali, Muhammad Mohsin, Atef Masmoudi, Veysi Kartal, Samina Satti
{"title":"评估机器学习和概率模型对复杂气候特征的普遍预测能力","authors":"Aamina Batool, Zulfiqar Ali, Muhammad Mohsin, Atef Masmoudi, Veysi Kartal, Samina Satti","doi":"10.1007/s00477-024-02721-3","DOIUrl":null,"url":null,"abstract":"<p>Climate changes and global warming increase risk of recurrent extreme and complex climatic features. It necessitates accurate modeling and forecasting of climate phenomena for sustainable development goals. However, machine learning algorithms and advanced statistical models are extensively employed to analyze complex data and make predictions related to climate phenomena. It is important to have comprehensive knowledge to use these models and consider their potential implications. This study aims to evaluate and compare some popular machine learning and probabilistic methods by analyzing various time series indices associated with precipitation and temperature. For application, time series data of Standardized Precipitation Temperature Index (SPTI), Standardized Temperature Index (STI), Standardized Compound Drought and Heat Index (SCDHI), and Biased Diminished Weighted Regional Drought Index (BDWRDI) are used from various meteorological regions of Pakistan. The performance of each algorithm is compared using Residual Mean Square Error (RMSE) and Mean Average Error (MAE). The outcomes associated with this research indicate a higher preference of neural networks over machine learning methods in the training sets. However, the efficiency varies from model to model, indicator to indicator, time scale to time scale, and location to location during the testing phase. The most appropriate models are found by considering a list of candidates forecasting models and investigating the performance of each model.</p>","PeriodicalId":21987,"journal":{"name":"Stochastic Environmental Research and Risk Assessment","volume":"30 1","pages":""},"PeriodicalIF":3.9000,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Assessing the generalization of forecasting ability of machine learning and probabilistic models for complex climate characteristics\",\"authors\":\"Aamina Batool, Zulfiqar Ali, Muhammad Mohsin, Atef Masmoudi, Veysi Kartal, Samina Satti\",\"doi\":\"10.1007/s00477-024-02721-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Climate changes and global warming increase risk of recurrent extreme and complex climatic features. It necessitates accurate modeling and forecasting of climate phenomena for sustainable development goals. However, machine learning algorithms and advanced statistical models are extensively employed to analyze complex data and make predictions related to climate phenomena. It is important to have comprehensive knowledge to use these models and consider their potential implications. This study aims to evaluate and compare some popular machine learning and probabilistic methods by analyzing various time series indices associated with precipitation and temperature. For application, time series data of Standardized Precipitation Temperature Index (SPTI), Standardized Temperature Index (STI), Standardized Compound Drought and Heat Index (SCDHI), and Biased Diminished Weighted Regional Drought Index (BDWRDI) are used from various meteorological regions of Pakistan. The performance of each algorithm is compared using Residual Mean Square Error (RMSE) and Mean Average Error (MAE). The outcomes associated with this research indicate a higher preference of neural networks over machine learning methods in the training sets. However, the efficiency varies from model to model, indicator to indicator, time scale to time scale, and location to location during the testing phase. The most appropriate models are found by considering a list of candidates forecasting models and investigating the performance of each model.</p>\",\"PeriodicalId\":21987,\"journal\":{\"name\":\"Stochastic Environmental Research and Risk Assessment\",\"volume\":\"30 1\",\"pages\":\"\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2024-04-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Stochastic Environmental Research and Risk Assessment\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.1007/s00477-024-02721-3\",\"RegionNum\":3,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Stochastic Environmental Research and Risk Assessment","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1007/s00477-024-02721-3","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0

摘要

气候变化和全球变暖增加了反复出现极端和复杂气候特征的风险。为实现可持续发展目标,有必要对气候现象进行精确建模和预测。然而,机器学习算法和先进的统计模型被广泛用于分析复杂的数据并做出与气候现象相关的预测。要使用这些模型并考虑其潜在影响,必须掌握全面的知识。本研究旨在通过分析与降水和温度相关的各种时间序列指数,评估和比较一些流行的机器学习和概率方法。在应用时,使用了巴基斯坦不同气象区域的标准化降水温度指数 (SPTI)、标准化温度指数 (STI)、标准化复合干旱和炎热指数 (SCDHI) 以及偏减权区域干旱指数 (BDWRDI) 的时间序列数据。使用残差均方误差 (RMSE) 和平均误差 (MAE) 比较了每种算法的性能。研究结果表明,在训练集中,神经网络比机器学习方法更受青睐。然而,在测试阶段,不同模型、不同指标、不同时间尺度和不同地点的效率各不相同。通过考虑候选预测模型列表并调查每个模型的性能,可以找到最合适的模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Assessing the generalization of forecasting ability of machine learning and probabilistic models for complex climate characteristics

Assessing the generalization of forecasting ability of machine learning and probabilistic models for complex climate characteristics

Climate changes and global warming increase risk of recurrent extreme and complex climatic features. It necessitates accurate modeling and forecasting of climate phenomena for sustainable development goals. However, machine learning algorithms and advanced statistical models are extensively employed to analyze complex data and make predictions related to climate phenomena. It is important to have comprehensive knowledge to use these models and consider their potential implications. This study aims to evaluate and compare some popular machine learning and probabilistic methods by analyzing various time series indices associated with precipitation and temperature. For application, time series data of Standardized Precipitation Temperature Index (SPTI), Standardized Temperature Index (STI), Standardized Compound Drought and Heat Index (SCDHI), and Biased Diminished Weighted Regional Drought Index (BDWRDI) are used from various meteorological regions of Pakistan. The performance of each algorithm is compared using Residual Mean Square Error (RMSE) and Mean Average Error (MAE). The outcomes associated with this research indicate a higher preference of neural networks over machine learning methods in the training sets. However, the efficiency varies from model to model, indicator to indicator, time scale to time scale, and location to location during the testing phase. The most appropriate models are found by considering a list of candidates forecasting models and investigating the performance of each model.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
7.10
自引率
9.50%
发文量
189
审稿时长
3.8 months
期刊介绍: Stochastic Environmental Research and Risk Assessment (SERRA) will publish research papers, reviews and technical notes on stochastic and probabilistic approaches to environmental sciences and engineering, including interactions of earth and atmospheric environments with people and ecosystems. The basic idea is to bring together research papers on stochastic modelling in various fields of environmental sciences and to provide an interdisciplinary forum for the exchange of ideas, for communicating on issues that cut across disciplinary barriers, and for the dissemination of stochastic techniques used in different fields to the community of interested researchers. Original contributions will be considered dealing with modelling (theoretical and computational), measurements and instrumentation in one or more of the following topical areas: - Spatiotemporal analysis and mapping of natural processes. - Enviroinformatics. - Environmental risk assessment, reliability analysis and decision making. - Surface and subsurface hydrology and hydraulics. - Multiphase porous media domains and contaminant transport modelling. - Hazardous waste site characterization. - Stochastic turbulence and random hydrodynamic fields. - Chaotic and fractal systems. - Random waves and seafloor morphology. - Stochastic atmospheric and climate processes. - Air pollution and quality assessment research. - Modern geostatistics. - Mechanisms of pollutant formation, emission, exposure and absorption. - Physical, chemical and biological analysis of human exposure from single and multiple media and routes; control and protection. - Bioinformatics. - Probabilistic methods in ecology and population biology. - Epidemiological investigations. - Models using stochastic differential equations stochastic or partial differential equations. - Hazardous waste site characterization.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信