利用机器学习改进喜马拉雅地区季节尺度的北美多模式集合(nmme)降水预报

S. Shrivastava, R. Avtar, P. K. Bal
{"title":"利用机器学习改进喜马拉雅地区季节尺度的北美多模式集合(nmme)降水预报","authors":"S. Shrivastava, R. Avtar, P. K. Bal","doi":"10.1142/s263053482150008x","DOIUrl":null,"url":null,"abstract":"The coarse horizontal resolution global climate models (GCMs) have limitations in producing large biases over the mountainous region. Also, single model output or simple multi-model ensemble (SMME) outputs are associated with large biases. While predicting the rainfall extreme events, this study attempts to use an alternative modeling approach by using five different machine learning (ML) algorithms to improve the skill of North American Multi-Model Ensemble (NMME) GCMs during Indian summer monsoon rainfall from 1982 to 2009 by reducing the model biases. Random forest (RF), AdaBoost (Ada), gradient (Grad) boosting, bagging (Bag) and extra (Extra) trees regression models are used and the results from each models are compared against the observations. In simple MME (SMME), a wet bias of 20[Formula: see text]mm/day and an RMSE up to 15[Formula: see text]mm/day are found over the Himalayan region. However, all the ML models can bring down the mean bias up to [Formula: see text][Formula: see text]mm/day and RMSE up to 2[Formula: see text]mm/day. The interannual variability in ML outputs is closer to observation than the SMME. Also, a high correlation from 0.5 to 0.8 is found between in all ML models and then in SMME. Moreover, representation of RF and Grad is found to be best out of all five ML models that represent a high correlation over the Himalayan region. In conclusion, by taking full advantage of different models, the proposed ML-based multi-model ensemble method is shown to be accurate and effective.","PeriodicalId":262307,"journal":{"name":"International Journal of Big Data Mining for Global Warming","volume":"83 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"IMPROVING THE NORTH AMERICAN MULTI-MODEL ENSEMBLE (NMME) PRECIPITATION FORECASTS AT SEASONAL SCALE OVER THE HIMALAYAN REGION USING MACHINE LEARNING\",\"authors\":\"S. Shrivastava, R. Avtar, P. K. Bal\",\"doi\":\"10.1142/s263053482150008x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The coarse horizontal resolution global climate models (GCMs) have limitations in producing large biases over the mountainous region. Also, single model output or simple multi-model ensemble (SMME) outputs are associated with large biases. While predicting the rainfall extreme events, this study attempts to use an alternative modeling approach by using five different machine learning (ML) algorithms to improve the skill of North American Multi-Model Ensemble (NMME) GCMs during Indian summer monsoon rainfall from 1982 to 2009 by reducing the model biases. Random forest (RF), AdaBoost (Ada), gradient (Grad) boosting, bagging (Bag) and extra (Extra) trees regression models are used and the results from each models are compared against the observations. In simple MME (SMME), a wet bias of 20[Formula: see text]mm/day and an RMSE up to 15[Formula: see text]mm/day are found over the Himalayan region. However, all the ML models can bring down the mean bias up to [Formula: see text][Formula: see text]mm/day and RMSE up to 2[Formula: see text]mm/day. The interannual variability in ML outputs is closer to observation than the SMME. Also, a high correlation from 0.5 to 0.8 is found between in all ML models and then in SMME. Moreover, representation of RF and Grad is found to be best out of all five ML models that represent a high correlation over the Himalayan region. In conclusion, by taking full advantage of different models, the proposed ML-based multi-model ensemble method is shown to be accurate and effective.\",\"PeriodicalId\":262307,\"journal\":{\"name\":\"International Journal of Big Data Mining for Global Warming\",\"volume\":\"83 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Big Data Mining for Global Warming\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1142/s263053482150008x\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Big Data Mining for Global Warming","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/s263053482150008x","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

粗水平分辨率全球气候模式(GCMs)在山区产生较大偏差方面存在局限性。此外,单模型输出或简单的多模型集成(SMME)输出与大偏差相关。在预测降雨极端事件的同时,本研究试图使用一种替代建模方法,通过使用五种不同的机器学习(ML)算法,通过减少模型偏差,提高1982 - 2009年印度夏季风降雨期间北美多模式集成(NMME) GCMs的技能。使用随机森林(RF)、AdaBoost (Ada)、梯度(Grad)增强、bagging (Bag)和extra (extra)树回归模型,并将每个模型的结果与观测结果进行比较。在简单MME (SMME)中,喜马拉雅地区的湿偏为20[公式:见文本]mm/day, RMSE高达15[公式:见文本]mm/day。然而,所有ML模型都可以将平均偏差降低到[公式:见文本]mm/day, RMSE降低到2[公式:见文本]mm/day。ML输出的年际变化比SMME更接近观测值。此外,在所有ML模型和SMME之间发现了从0.5到0.8的高相关性。此外,在喜马拉雅地区具有高相关性的所有五个ML模型中,RF和Grad的表示被发现是最好的。综上所述,通过充分利用不同的模型,本文提出的基于ml的多模型集成方法是准确有效的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
IMPROVING THE NORTH AMERICAN MULTI-MODEL ENSEMBLE (NMME) PRECIPITATION FORECASTS AT SEASONAL SCALE OVER THE HIMALAYAN REGION USING MACHINE LEARNING
The coarse horizontal resolution global climate models (GCMs) have limitations in producing large biases over the mountainous region. Also, single model output or simple multi-model ensemble (SMME) outputs are associated with large biases. While predicting the rainfall extreme events, this study attempts to use an alternative modeling approach by using five different machine learning (ML) algorithms to improve the skill of North American Multi-Model Ensemble (NMME) GCMs during Indian summer monsoon rainfall from 1982 to 2009 by reducing the model biases. Random forest (RF), AdaBoost (Ada), gradient (Grad) boosting, bagging (Bag) and extra (Extra) trees regression models are used and the results from each models are compared against the observations. In simple MME (SMME), a wet bias of 20[Formula: see text]mm/day and an RMSE up to 15[Formula: see text]mm/day are found over the Himalayan region. However, all the ML models can bring down the mean bias up to [Formula: see text][Formula: see text]mm/day and RMSE up to 2[Formula: see text]mm/day. The interannual variability in ML outputs is closer to observation than the SMME. Also, a high correlation from 0.5 to 0.8 is found between in all ML models and then in SMME. Moreover, representation of RF and Grad is found to be best out of all five ML models that represent a high correlation over the Himalayan region. In conclusion, by taking full advantage of different models, the proposed ML-based multi-model ensemble method is shown to be accurate and effective.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信