利用机器学习改进喜马拉雅地区季节尺度的北美多模式集合(nmme)降水预报

International Journal of Big Data Mining for Global Warming Pub Date : 2021-10-12 DOI:10.1142/s263053482150008x

S. Shrivastava, R. Avtar, P. K. Bal

{"title":"利用机器学习改进喜马拉雅地区季节尺度的北美多模式集合(nmme)降水预报","authors":"S. Shrivastava, R. Avtar, P. K. Bal","doi":"10.1142/s263053482150008x","DOIUrl":null,"url":null,"abstract":"The coarse horizontal resolution global climate models (GCMs) have limitations in producing large biases over the mountainous region. Also, single model output or simple multi-model ensemble (SMME) outputs are associated with large biases. While predicting the rainfall extreme events, this study attempts to use an alternative modeling approach by using five different machine learning (ML) algorithms to improve the skill of North American Multi-Model Ensemble (NMME) GCMs during Indian summer monsoon rainfall from 1982 to 2009 by reducing the model biases. Random forest (RF), AdaBoost (Ada), gradient (Grad) boosting, bagging (Bag) and extra (Extra) trees regression models are used and the results from each models are compared against the observations. In simple MME (SMME), a wet bias of 20[Formula: see text]mm/day and an RMSE up to 15[Formula: see text]mm/day are found over the Himalayan region. However, all the ML models can bring down the mean bias up to [Formula: see text][Formula: see text]mm/day and RMSE up to 2[Formula: see text]mm/day. The interannual variability in ML outputs is closer to observation than the SMME. Also, a high correlation from 0.5 to 0.8 is found between in all ML models and then in SMME. Moreover, representation of RF and Grad is found to be best out of all five ML models that represent a high correlation over the Himalayan region. In conclusion, by taking full advantage of different models, the proposed ML-based multi-model ensemble method is shown to be accurate and effective.","PeriodicalId":262307,"journal":{"name":"International Journal of Big Data Mining for Global Warming","volume":"83 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"IMPROVING THE NORTH AMERICAN MULTI-MODEL ENSEMBLE (NMME) PRECIPITATION FORECASTS AT SEASONAL SCALE OVER THE HIMALAYAN REGION USING MACHINE LEARNING\",\"authors\":\"S. Shrivastava, R. Avtar, P. K. Bal\",\"doi\":\"10.1142/s263053482150008x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The coarse horizontal resolution global climate models (GCMs) have limitations in producing large biases over the mountainous region. Also, single model output or simple multi-model ensemble (SMME) outputs are associated with large biases. While predicting the rainfall extreme events, this study attempts to use an alternative modeling approach by using five different machine learning (ML) algorithms to improve the skill of North American Multi-Model Ensemble (NMME) GCMs during Indian summer monsoon rainfall from 1982 to 2009 by reducing the model biases. Random forest (RF), AdaBoost (Ada), gradient (Grad) boosting, bagging (Bag) and extra (Extra) trees regression models are used and the results from each models are compared against the observations. In simple MME (SMME), a wet bias of 20[Formula: see text]mm/day and an RMSE up to 15[Formula: see text]mm/day are found over the Himalayan region. However, all the ML models can bring down the mean bias up to [Formula: see text][Formula: see text]mm/day and RMSE up to 2[Formula: see text]mm/day. The interannual variability in ML outputs is closer to observation than the SMME. Also, a high correlation from 0.5 to 0.8 is found between in all ML models and then in SMME. Moreover, representation of RF and Grad is found to be best out of all five ML models that represent a high correlation over the Himalayan region. In conclusion, by taking full advantage of different models, the proposed ML-based multi-model ensemble method is shown to be accurate and effective.\",\"PeriodicalId\":262307,\"journal\":{\"name\":\"International Journal of Big Data Mining for Global Warming\",\"volume\":\"83 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Big Data Mining for Global Warming\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1142/s263053482150008x\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Big Data Mining for Global Warming","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/s263053482150008x","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

粗水平分辨率全球气候模式(GCMs)在山区产生较大偏差方面存在局限性。此外，单模型输出或简单的多模型集成(SMME)输出与大偏差相关。在预测降雨极端事件的同时，本研究试图使用一种替代建模方法，通过使用五种不同的机器学习(ML)算法，通过减少模型偏差，提高1982 - 2009年印度夏季风降雨期间北美多模式集成(NMME) GCMs的技能。使用随机森林(RF)、AdaBoost (Ada)、梯度(Grad)增强、bagging (Bag)和extra (extra)树回归模型，并将每个模型的结果与观测结果进行比较。在简单MME (SMME)中，喜马拉雅地区的湿偏为20[公式:见文本]mm/day, RMSE高达15[公式:见文本]mm/day。然而，所有ML模型都可以将平均偏差降低到[公式:见文本]mm/day, RMSE降低到2[公式:见文本]mm/day。ML输出的年际变化比SMME更接近观测值。此外，在所有ML模型和SMME之间发现了从0.5到0.8的高相关性。此外，在喜马拉雅地区具有高相关性的所有五个ML模型中，RF和Grad的表示被发现是最好的。综上所述，通过充分利用不同的模型，本文提出的基于ml的多模型集成方法是准确有效的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

IMPROVING THE NORTH AMERICAN MULTI-MODEL ENSEMBLE (NMME) PRECIPITATION FORECASTS AT SEASONAL SCALE OVER THE HIMALAYAN REGION USING MACHINE LEARNING

The coarse horizontal resolution global climate models (GCMs) have limitations in producing large biases over the mountainous region. Also, single model output or simple multi-model ensemble (SMME) outputs are associated with large biases. While predicting the rainfall extreme events, this study attempts to use an alternative modeling approach by using five different machine learning (ML) algorithms to improve the skill of North American Multi-Model Ensemble (NMME) GCMs during Indian summer monsoon rainfall from 1982 to 2009 by reducing the model biases. Random forest (RF), AdaBoost (Ada), gradient (Grad) boosting, bagging (Bag) and extra (Extra) trees regression models are used and the results from each models are compared against the observations. In simple MME (SMME), a wet bias of 20[Formula: see text]mm/day and an RMSE up to 15[Formula: see text]mm/day are found over the Himalayan region. However, all the ML models can bring down the mean bias up to [Formula: see text][Formula: see text]mm/day and RMSE up to 2[Formula: see text]mm/day. The interannual variability in ML outputs is closer to observation than the SMME. Also, a high correlation from 0.5 to 0.8 is found between in all ML models and then in SMME. Moreover, representation of RF and Grad is found to be best out of all five ML models that represent a high correlation over the Himalayan region. In conclusion, by taking full advantage of different models, the proposed ML-based multi-model ensemble method is shown to be accurate and effective.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of Big Data Mining for Global Warming

自引率

0.00%

发文量