NWP-based lightning prediction using flexible count data regression

Q1 Mathematics
T. Simon, G. Mayr, Nikolaus Umlauf, A. Zeileis
{"title":"NWP-based lightning prediction using flexible count data regression","authors":"T. Simon, G. Mayr, Nikolaus Umlauf, A. Zeileis","doi":"10.5194/ASCMO-5-1-2019","DOIUrl":null,"url":null,"abstract":"Abstract. A method to predict lightning by postprocessing numerical weather prediction\n(NWP) output is developed for the region of the European Eastern Alps.\nCloud-to-ground (CG) flashes – detected by the ground-based Austrian\nLightning Detection & Information System (ALDIS) network – are counted on\nthe 18×18 km2 grid of the 51-member NWP ensemble of the European\nCentre for Medium-Range Weather Forecasts (ECMWF). These counts serve as the\ntarget quantity in count data regression models for the occurrence of\nlightning events and flash counts of CG. The probability of lightning\noccurrence is modelled by a Bernoulli distribution. The flash counts are\nmodelled with a hurdle approach where the Bernoulli distribution is combined\nwith a zero-truncated negative binomial. In the statistical models the\nparameters of the distributions are described by additive predictors, which\nare assembled using potentially nonlinear functions of NWP covariates.\nMeasures of location and spread of 100 direct and derived NWP covariates\nprovide a pool of candidates for the nonlinear terms. A combination of\nstability selection and gradient boosting identifies the nine (three) most\ninfluential terms for the parameters of the Bernoulli (zero-truncated\nnegative binomial) distribution, most of which turn out to be associated with\neither convective available potential energy (CAPE) or convective\nprecipitation. Markov chain Monte Carlo (MCMC) sampling estimates the final\nmodel to provide credible inference of effects, scores, and\npredictions. The selection of terms and MCMC sampling are applied for data of\nthe year 2016, and out-of-sample performance is evaluated for 2017. The\noccurrence model outperforms a reference climatology – based on 7 years of\ndata – up to a forecast horizon of 5 days. The flash count model is\ncalibrated and also outperforms climatology for exceedance probabilities,\nquantiles, and full predictive distributions.\n","PeriodicalId":36792,"journal":{"name":"Advances in Statistical Climatology, Meteorology and Oceanography","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Statistical Climatology, Meteorology and Oceanography","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5194/ASCMO-5-1-2019","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 14

Abstract

Abstract. A method to predict lightning by postprocessing numerical weather prediction (NWP) output is developed for the region of the European Eastern Alps. Cloud-to-ground (CG) flashes – detected by the ground-based Austrian Lightning Detection & Information System (ALDIS) network – are counted on the 18×18 km2 grid of the 51-member NWP ensemble of the European Centre for Medium-Range Weather Forecasts (ECMWF). These counts serve as the target quantity in count data regression models for the occurrence of lightning events and flash counts of CG. The probability of lightning occurrence is modelled by a Bernoulli distribution. The flash counts are modelled with a hurdle approach where the Bernoulli distribution is combined with a zero-truncated negative binomial. In the statistical models the parameters of the distributions are described by additive predictors, which are assembled using potentially nonlinear functions of NWP covariates. Measures of location and spread of 100 direct and derived NWP covariates provide a pool of candidates for the nonlinear terms. A combination of stability selection and gradient boosting identifies the nine (three) most influential terms for the parameters of the Bernoulli (zero-truncated negative binomial) distribution, most of which turn out to be associated with either convective available potential energy (CAPE) or convective precipitation. Markov chain Monte Carlo (MCMC) sampling estimates the final model to provide credible inference of effects, scores, and predictions. The selection of terms and MCMC sampling are applied for data of the year 2016, and out-of-sample performance is evaluated for 2017. The occurrence model outperforms a reference climatology – based on 7 years of data – up to a forecast horizon of 5 days. The flash count model is calibrated and also outperforms climatology for exceedance probabilities, quantiles, and full predictive distributions.
基于NWP的灵活计数数据回归闪电预测
摘要为欧洲东阿尔卑斯地区开发了一种通过后处理数值天气预报(NWP)输出来预测闪电的方法。地面奥地利闪电探测与信息系统(ALDIS)网络探测到的云对地(CG)闪光按18×18计算 欧洲中期天气预报中心(ECMWF)由51名成员组成的NWP集合的平方公里网格。这些计数作为计数数据回归模型中的目标量,用于CG的发光事件和闪光计数的发生。闪电发生的概率是由伯努利分布模拟的。闪光次数采用栅栏法建模,其中伯努利分布与零截断负二项式相结合。在统计模型中,分布的参数由加性预测器描述,该预测器是使用NWP协变量的潜在非线性函数组装的。100个直接和导出的NWP协变量的位置和扩展的度量为非线性项提供了候选项库。稳定性选择和梯度增强的结合确定了伯努利(零截断负二项式)分布参数的九(三)个最具影响力的项,其中大多数与对流可用势能(CAPE)或对流降水有关。马尔可夫链蒙特卡罗(MCMC)抽样估计最终模型,以提供对效果、分数和预测的可信推断。术语选择和MCMC抽样适用于2016年的数据,并对2017年的样本外性能进行了评估。该发生率模型在5天的预测期内优于基于7年数据的参考气候学。闪光计数模型进行了校准,在超越概率、分位数和完全预测分布方面也优于气候学。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Advances in Statistical Climatology, Meteorology and Oceanography
Advances in Statistical Climatology, Meteorology and Oceanography Earth and Planetary Sciences-Atmospheric Science
CiteScore
4.80
自引率
0.00%
发文量
9
审稿时长
26 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信