Impact of NWP Model Configuration and Training Sample Characteristics on Random Forest-Based Day-1 Convective Outlook Guidance

IF 2.9 3区 地球科学 Q2 ASTRONOMY & ASTROPHYSICS
Aaron Johnson, Xuguang Wang
{"title":"Impact of NWP Model Configuration and Training Sample Characteristics on Random Forest-Based Day-1 Convective Outlook Guidance","authors":"Aaron Johnson,&nbsp;Xuguang Wang","doi":"10.1029/2024EA003822","DOIUrl":null,"url":null,"abstract":"<p>This study aims to quantify and better understand the impact of an upgrade to the configuration of an FV3 (Finite Volume cubed-sphere) LAM (Limited Area Model) convection-allowing ensemble on the skill of the RF models trained on cases before the upgrade and forecast on cases after the upgrade. Specifically, Random Forest (RF) models were used to produce probabilistic forecasts of severe weather, significant severe weather, and individual hazards of wind, hail, and tornado for the purpose of day-1 convective outlook guidance. The RF models are trained and forecast on different subsets of the available data set of forecast cases from the spring seasons of 2019 and 2021 (before the FV3 LAM upgrade) and 2022 (after the upgrade) and evaluated both quantitatively and qualitatively. It is found for most predictands that the RF models forecasting 2022 (2019/2021) cases are statistically significantly more skillful when trained on other cases from the 2022 (2019/2021) data set using a leave-one-out approach. However, within the 2019/2021 data set, training on cases from a different year than the year being forecast also leads to statistically significant degradations of skill, apparently at least in part due to the different sample climate between 2019 and 2021. For this particular NWP (Numerical Weather Prediction) model configuration change, the consistency in sample climate between training and forecast cases is at least as important as consistency in model configuration. Finally, increases in skill resulting from increasing the number of forecast cases used to train the RF levels off around 30 forecast cases.</p>","PeriodicalId":54286,"journal":{"name":"Earth and Space Science","volume":"12 2","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1029/2024EA003822","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Earth and Space Science","FirstCategoryId":"89","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1029/2024EA003822","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
引用次数: 0

Abstract

This study aims to quantify and better understand the impact of an upgrade to the configuration of an FV3 (Finite Volume cubed-sphere) LAM (Limited Area Model) convection-allowing ensemble on the skill of the RF models trained on cases before the upgrade and forecast on cases after the upgrade. Specifically, Random Forest (RF) models were used to produce probabilistic forecasts of severe weather, significant severe weather, and individual hazards of wind, hail, and tornado for the purpose of day-1 convective outlook guidance. The RF models are trained and forecast on different subsets of the available data set of forecast cases from the spring seasons of 2019 and 2021 (before the FV3 LAM upgrade) and 2022 (after the upgrade) and evaluated both quantitatively and qualitatively. It is found for most predictands that the RF models forecasting 2022 (2019/2021) cases are statistically significantly more skillful when trained on other cases from the 2022 (2019/2021) data set using a leave-one-out approach. However, within the 2019/2021 data set, training on cases from a different year than the year being forecast also leads to statistically significant degradations of skill, apparently at least in part due to the different sample climate between 2019 and 2021. For this particular NWP (Numerical Weather Prediction) model configuration change, the consistency in sample climate between training and forecast cases is at least as important as consistency in model configuration. Finally, increases in skill resulting from increasing the number of forecast cases used to train the RF levels off around 30 forecast cases.

Abstract Image

NWP模型配置和训练样本特征对基于随机森林的第1天对流展望指导的影响
本研究旨在量化和更好地理解升级到FV3(有限体积立方球)LAM(有限区域模型)对流集成配置对升级前训练的RF模型技能的影响,并对升级后的案例进行预测。具体而言,随机森林(RF)模型用于产生恶劣天气、显著恶劣天气以及风、冰雹和龙卷风的个别危害的概率预报,目的是为第1天的对流展望提供指导。RF模型在2019年和2021年春季(FV3 LAM升级之前)和2022年春季(升级之后)的可用预测案例数据集的不同子集上进行训练和预测,并进行定量和定性评估。研究发现,对于大多数预测,预测2022年(2019/2021)病例的RF模型在使用留一方法对2022年(2019/2021)数据集的其他病例进行训练时,在统计上明显更加熟练。然而,在2019/2021年的数据集中,对不同年份的案例进行训练也会导致统计上显著的技能下降,这显然至少部分是由于2019年至2021年之间的样本气候不同。对于这种特殊的NWP(数值天气预报)模型配置变化,训练和预测案例之间样本气候的一致性至少与模型配置的一致性同样重要。最后,由于增加用于训练RF的预测案例数量而导致的技能增加,大约有30个预测案例。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Earth and Space Science
Earth and Space Science Earth and Planetary Sciences-General Earth and Planetary Sciences
CiteScore
5.50
自引率
3.20%
发文量
285
审稿时长
19 weeks
期刊介绍: Marking AGU’s second new open access journal in the last 12 months, Earth and Space Science is the only journal that reflects the expansive range of science represented by AGU’s 62,000 members, including all of the Earth, planetary, and space sciences, and related fields in environmental science, geoengineering, space engineering, and biogeochemistry.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信