Daniela de Oliveira Maionchi, Adriano Carvalho Nunes e Araújo, Walter Aguiar Martins Junior, Junior Gonçalves da Silva, Danilo Ferreira de Souza
{"title":"A Machine Learning Model for Lightning-Related Deaths in Brazil","authors":"Daniela de Oliveira Maionchi, Adriano Carvalho Nunes e Araújo, Walter Aguiar Martins Junior, Junior Gonçalves da Silva, Danilo Ferreira de Souza","doi":"10.1175/wcas-d-23-0084.1","DOIUrl":null,"url":null,"abstract":"Brazil presents the highest number of lightning-related deaths in the world. This study aimed to identify the key victims’ characteristics associated with such fatalities in Brazil and to develop a model that predicts the number of deaths as function of the victims’ data. The dataset provided by the Department of Informatics of the Unified Health System in Brazil- DATASUS was analyzed and machine learning regression techniques were applied. The Gradient Boosting Regressor (GBR) model was found to be the most effective, achieving a prediction accuracy of 97%. Through the analysis of 34 initial variables, 10 variables were identified as having the greatest influence on the model’s outcomes. These variables included race, gender, age group, occupational accidents, education, and location of death. Understanding these characteristics is crucial for implementing targeted prevention and safety strategies in various regions, helping to mitigate the risk of lightning-related deaths worldwide. Additionally, the methodology used in this study can serve as a framework for similar research in different locations, allowing for the identification of important factors specific to each region. By adapting the machine learning regression techniques and incorporating local datasets, researchers can gain valuable insights into the determinants of lightning-related fatalities, enabling the development of effective prevention and safety measures tailored to specific geographical areas.","PeriodicalId":507492,"journal":{"name":"Weather, Climate, and Society","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Weather, Climate, and Society","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1175/wcas-d-23-0084.1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Brazil presents the highest number of lightning-related deaths in the world. This study aimed to identify the key victims’ characteristics associated with such fatalities in Brazil and to develop a model that predicts the number of deaths as function of the victims’ data. The dataset provided by the Department of Informatics of the Unified Health System in Brazil- DATASUS was analyzed and machine learning regression techniques were applied. The Gradient Boosting Regressor (GBR) model was found to be the most effective, achieving a prediction accuracy of 97%. Through the analysis of 34 initial variables, 10 variables were identified as having the greatest influence on the model’s outcomes. These variables included race, gender, age group, occupational accidents, education, and location of death. Understanding these characteristics is crucial for implementing targeted prevention and safety strategies in various regions, helping to mitigate the risk of lightning-related deaths worldwide. Additionally, the methodology used in this study can serve as a framework for similar research in different locations, allowing for the identification of important factors specific to each region. By adapting the machine learning regression techniques and incorporating local datasets, researchers can gain valuable insights into the determinants of lightning-related fatalities, enabling the development of effective prevention and safety measures tailored to specific geographical areas.