Sida Song , Xiao Zhou , Shangbo Yuan , Pengle Cheng , Xiaodong Liu
{"title":"Interpretable artificial intelligence models for predicting lightning prone to inducing forest fires","authors":"Sida Song , Xiao Zhou , Shangbo Yuan , Pengle Cheng , Xiaodong Liu","doi":"10.1016/j.jastp.2024.106408","DOIUrl":null,"url":null,"abstract":"<div><div>Specific types and intensities of lightning are significant causes of forest lightning fires. Analyzing the relationship between these lightning events and the climatic conditions that favor their occurrence is crucial for predicting and preventing forest lightning fires. However, there is a lack of research in this area for the Greater Khingan Range in Northeast China. This study utilized data from three lightning location networks and the ERA5 meteorological dataset to analyze the historical climate and lightning data in the Greater Khingan Mountains region of China from 2021 to 2023, focusing on the impact of various climatic factors on the density of target lightning—lightning that is prone to cause forest lightning fires. Four machine learning models—SVM, RF, XGBoost, and LightGBM—were evaluated, with RF demonstrating the best predictive performance, achieving R<sup>2</sup> of 0.83, MAE of 1.91, and MSE of 14.90. Additionally, the prediction results of the RF model were evaluated using the Kruskal-Wallis test to determine if the results are statistically significant. Using SHAP values to interpret the model, it was found that the K-index (kx) and Convective Available Potential Energy (CAPE) are the most significant predictors of target lightning density, followed by the leaf area index for high vegetation (lai_kv), surface pressure (sp), cloud base height (cbh), temperature at 2 m (t2m), and coverage of high vegetation (cvh). Approximately 70% of the total average absolute SHAP values are attributed to kx and CAPE, highlighting their crucial role in the prediction process. This study provides insights into the environmental factors influencing lightning frequency and emphasizes the importance of interpretable machine learning models in predicting future lightning occurrences and forest lightning fires. Visualization tools, including SHAP summary plots and force plots, were used to provide a detailed illustration of each feature's contribution to the model predictions.</div></div>","PeriodicalId":15096,"journal":{"name":"Journal of Atmospheric and Solar-Terrestrial Physics","volume":"267 ","pages":"Article 106408"},"PeriodicalIF":1.8000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Atmospheric and Solar-Terrestrial Physics","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1364682624002360","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GEOCHEMISTRY & GEOPHYSICS","Score":null,"Total":0}
引用次数: 0
Abstract
Specific types and intensities of lightning are significant causes of forest lightning fires. Analyzing the relationship between these lightning events and the climatic conditions that favor their occurrence is crucial for predicting and preventing forest lightning fires. However, there is a lack of research in this area for the Greater Khingan Range in Northeast China. This study utilized data from three lightning location networks and the ERA5 meteorological dataset to analyze the historical climate and lightning data in the Greater Khingan Mountains region of China from 2021 to 2023, focusing on the impact of various climatic factors on the density of target lightning—lightning that is prone to cause forest lightning fires. Four machine learning models—SVM, RF, XGBoost, and LightGBM—were evaluated, with RF demonstrating the best predictive performance, achieving R2 of 0.83, MAE of 1.91, and MSE of 14.90. Additionally, the prediction results of the RF model were evaluated using the Kruskal-Wallis test to determine if the results are statistically significant. Using SHAP values to interpret the model, it was found that the K-index (kx) and Convective Available Potential Energy (CAPE) are the most significant predictors of target lightning density, followed by the leaf area index for high vegetation (lai_kv), surface pressure (sp), cloud base height (cbh), temperature at 2 m (t2m), and coverage of high vegetation (cvh). Approximately 70% of the total average absolute SHAP values are attributed to kx and CAPE, highlighting their crucial role in the prediction process. This study provides insights into the environmental factors influencing lightning frequency and emphasizes the importance of interpretable machine learning models in predicting future lightning occurrences and forest lightning fires. Visualization tools, including SHAP summary plots and force plots, were used to provide a detailed illustration of each feature's contribution to the model predictions.
期刊介绍:
The Journal of Atmospheric and Solar-Terrestrial Physics (JASTP) is an international journal concerned with the inter-disciplinary science of the Earth''s atmospheric and space environment, especially the highly varied and highly variable physical phenomena that occur in this natural laboratory and the processes that couple them.
The journal covers the physical processes operating in the troposphere, stratosphere, mesosphere, thermosphere, ionosphere, magnetosphere, the Sun, interplanetary medium, and heliosphere. Phenomena occurring in other "spheres", solar influences on climate, and supporting laboratory measurements are also considered. The journal deals especially with the coupling between the different regions.
Solar flares, coronal mass ejections, and other energetic events on the Sun create interesting and important perturbations in the near-Earth space environment. The physics of such "space weather" is central to the Journal of Atmospheric and Solar-Terrestrial Physics and the journal welcomes papers that lead in the direction of a predictive understanding of the coupled system. Regarding the upper atmosphere, the subjects of aeronomy, geomagnetism and geoelectricity, auroral phenomena, radio wave propagation, and plasma instabilities, are examples within the broad field of solar-terrestrial physics which emphasise the energy exchange between the solar wind, the magnetospheric and ionospheric plasmas, and the neutral gas. In the lower atmosphere, topics covered range from mesoscale to global scale dynamics, to atmospheric electricity, lightning and its effects, and to anthropogenic changes.