Mark DeMaria, James L. Franklin, Galina Chirokova, Jacob Radford, Robert DeMaria, Kate D. Musgrave, Imme Ebert-Uphoff
{"title":"Evaluation of Tropical Cyclone Track and Intensity Forecasts from Artificial Intelligence Weather Prediction (AIWP) Models","authors":"Mark DeMaria, James L. Franklin, Galina Chirokova, Jacob Radford, Robert DeMaria, Kate D. Musgrave, Imme Ebert-Uphoff","doi":"arxiv-2409.06735","DOIUrl":null,"url":null,"abstract":"In just the past few years multiple data-driven Artificial Intelligence\nWeather Prediction (AIWP) models have been developed, with new versions\nappearing almost monthly. Given this rapid development, the applicability of\nthese models to operational forecasting has yet to be adequately explored and\ndocumented. To assess their utility for operational tropical cyclone (TC)\nforecasting, the NHC verification procedure is used to evaluate seven-day track\nand intensity predictions for northern hemisphere TCs from May-November 2023.\nFour open-source AIWP models are considered (FourCastNetv1,\nFourCastNetv2-small, GraphCast-operational and Pangu-Weather). The AIWP track forecast errors and detection rates are comparable to those\nfrom the best-performing operational forecast models. However, the AIWP\nintensity forecast errors are larger than those of even the simplest intensity\nforecasts based on climatology and persistence. The AIWP models almost always\nreduce the TC intensity, especially within the first 24 h of the forecast,\nresulting in a substantial low bias. The contribution of the AIWP models to the NHC model consensus was also\nevaluated. The consensus track errors are reduced by up to 11% at the longer\ntime periods. The five-day NHC official track forecasts have improved by about\n2% per year since 2001, so this represents more than a five-year gain in\naccuracy. Despite substantial negative intensity biases, the AIWP models have a\nneutral impact on the intensity consensus. These results show that the current\nformulation of the AIWP models have promise for operational TC track forecasts,\nbut improved bias corrections or model reformulations will be needed for\naccurate intensity forecasts.","PeriodicalId":501166,"journal":{"name":"arXiv - PHYS - Atmospheric and Oceanic Physics","volume":"21 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - PHYS - Atmospheric and Oceanic Physics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.06735","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In just the past few years multiple data-driven Artificial Intelligence
Weather Prediction (AIWP) models have been developed, with new versions
appearing almost monthly. Given this rapid development, the applicability of
these models to operational forecasting has yet to be adequately explored and
documented. To assess their utility for operational tropical cyclone (TC)
forecasting, the NHC verification procedure is used to evaluate seven-day track
and intensity predictions for northern hemisphere TCs from May-November 2023.
Four open-source AIWP models are considered (FourCastNetv1,
FourCastNetv2-small, GraphCast-operational and Pangu-Weather). The AIWP track forecast errors and detection rates are comparable to those
from the best-performing operational forecast models. However, the AIWP
intensity forecast errors are larger than those of even the simplest intensity
forecasts based on climatology and persistence. The AIWP models almost always
reduce the TC intensity, especially within the first 24 h of the forecast,
resulting in a substantial low bias. The contribution of the AIWP models to the NHC model consensus was also
evaluated. The consensus track errors are reduced by up to 11% at the longer
time periods. The five-day NHC official track forecasts have improved by about
2% per year since 2001, so this represents more than a five-year gain in
accuracy. Despite substantial negative intensity biases, the AIWP models have a
neutral impact on the intensity consensus. These results show that the current
formulation of the AIWP models have promise for operational TC track forecasts,
but improved bias corrections or model reformulations will be needed for
accurate intensity forecasts.