Using machine learning to predict convection-allowing ensemble forecast skill: Evaluation with the NSSL Warn-on-Forecast System

Corey K. Potvin, Montgomery Flora, P. Skinner, Anthony E. Reinhart, B. Matilla
{"title":"Using machine learning to predict convection-allowing ensemble forecast skill: Evaluation with the NSSL Warn-on-Forecast System","authors":"Corey K. Potvin, Montgomery Flora, P. Skinner, Anthony E. Reinhart, B. Matilla","doi":"10.1175/aies-d-23-0106.1","DOIUrl":null,"url":null,"abstract":"\nForecasters routinely calibrate their confidence in model forecasts. Ensembles inherently estimate forecast confidence, but are often underdispersive, and ensemble spread does not strongly correlate with ensemble-mean error. The misalignment between ensemble spread and skill motivates new methods for “forecasting forecast skill” so that forecasters can better utilize ensemble guidance. We have trained logistic regression and random forest models to predict the skill of composite reflectivity forecasts from the NSSL Warn-on-Forecast System (WoFS), a 3-km ensemble that generates rapidly updating forecast guidance for 0-6-h lead times. The forecast skill predictions are valid at 1-h, 2-h, or 3-h lead times within localized regions determined by the observed storm locations at analysis time. We use WoFS analysis and forecast output and NSSL Multi-Radar / Multi-Sensor composite reflectivity for 106 cases from the 2017-2021 NOAA Hazardous Weather Testbed Spring Forecasting Experiments. We frame the prediction task as a multi-classification problem, where the forecast skill labels are determined by averaging the extended Fractions Skill Scores (eFSS) for several reflectivity thresholds and verification neighborhoods, then converting to one of three classes based on where the average eFSS ranks within the entire dataset: POOR (bottom 20%), FAIR (middle 60%), or GOOD (top 20%). Initial machine learning (ML) models are trained on 323 predictors; reducing to 10 or 15 predictors in the final models only modestly reduces skill. The final models substantially outperform carefully developed persistence- and spread-based models, and are reasonably explainable. The results suggest that ML can be a valuable tool for guiding user confidence in convection-allowing (and larger-scale) ensemble forecasts.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"45 5","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial intelligence for the earth systems","FirstCategoryId":"0","ListUrlMain":"https://doi.org/10.1175/aies-d-23-0106.1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Forecasters routinely calibrate their confidence in model forecasts. Ensembles inherently estimate forecast confidence, but are often underdispersive, and ensemble spread does not strongly correlate with ensemble-mean error. The misalignment between ensemble spread and skill motivates new methods for “forecasting forecast skill” so that forecasters can better utilize ensemble guidance. We have trained logistic regression and random forest models to predict the skill of composite reflectivity forecasts from the NSSL Warn-on-Forecast System (WoFS), a 3-km ensemble that generates rapidly updating forecast guidance for 0-6-h lead times. The forecast skill predictions are valid at 1-h, 2-h, or 3-h lead times within localized regions determined by the observed storm locations at analysis time. We use WoFS analysis and forecast output and NSSL Multi-Radar / Multi-Sensor composite reflectivity for 106 cases from the 2017-2021 NOAA Hazardous Weather Testbed Spring Forecasting Experiments. We frame the prediction task as a multi-classification problem, where the forecast skill labels are determined by averaging the extended Fractions Skill Scores (eFSS) for several reflectivity thresholds and verification neighborhoods, then converting to one of three classes based on where the average eFSS ranks within the entire dataset: POOR (bottom 20%), FAIR (middle 60%), or GOOD (top 20%). Initial machine learning (ML) models are trained on 323 predictors; reducing to 10 or 15 predictors in the final models only modestly reduces skill. The final models substantially outperform carefully developed persistence- and spread-based models, and are reasonably explainable. The results suggest that ML can be a valuable tool for guiding user confidence in convection-allowing (and larger-scale) ensemble forecasts.
利用机器学习预测对流--允许集合预报技能:利用 NSSL 预报预警系统进行评估
预报员经常校准他们对模式预报的信心。集合本身可以估计预报的可信度,但往往分散性不足,而且集合扩散与集合平均误差的相关性不强。集合散布与预测技能之间的不一致促使我们采用新方法来 "预测预测技能",以便预测人员更好地利用集合指导。我们对逻辑回归和随机森林模型进行了训练,以预测来自国家空间实验室预报预警系统(WoFS)的复合反射率预报技能。预报技能预测在 1 小时、2 小时或 3 小时前沿时间内的局部区域有效,这些区域由分析时观测到的风暴位置决定。我们使用了 WoFS 分析和预报输出以及 NSSL 多雷达/多传感器综合反射率,这些数据来自 2017-2021 年 NOAA 危险天气试验台春季预报实验的 106 个案例。我们将预测任务视为一个多分类问题,通过对若干反射率阈值和验证邻域的扩展分数技能得分(eFSS)求平均值来确定预报技能标签,然后根据 eFSS 平均值在整个数据集中的排名将其转换为三个类别之一:差(最低 20%)、一般(中间 60%)或好(最高 20%)。初始机器学习(ML)模型在 323 个预测因子上进行训练;在最终模型中,将预测因子减少到 10 或 15 个只会适度降低技能。最终模型的表现大大优于精心开发的基于持久性和传播的模型,并且可以合理解释。结果表明,ML 可以成为指导用户对允许对流的(和更大规模的)集合预报的信心的重要工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信