{"title":"野火预测模型开发中的大数据分析机器学习","authors":"Chan-Ho Lee, Mooyoung Lim, Yohan Lee","doi":"10.9798/kosham.2023.23.2.29","DOIUrl":null,"url":null,"abstract":"This study aims to develop a model that predicts domestic forest fire occurrences during fire outbreaks using machine learning techniques. For the modeling methods, logistic regression analysis and ensemble techniques, such as gradient boost and random forest, were used while the oversampling technique was utilized to address the imbalance problem of the forest fire data. The model developed in this study predicted 239 out of 333 forest fire occurrences during the nationwide forest fire period in 2020 with a prediction accuracy of approximately 71.8%. Forest fires that occur during such periods are highly influenced by different factors affecting the climate, such as temperature, humidity, and precipitation. In Gangwon-do, in addition to these factors, a high correlation between farmland density and stem volume per hectare has also been associated with increased forest fire occurrences. The significance of this study lies in the fact that it presents a customized wildfire occurrence prediction model that can be used in the administrative parts, which serve as the basic centers for wildfire prevention, of provinces and cities across the country.","PeriodicalId":416980,"journal":{"name":"Journal of the Korean Society of Hazard Mitigation","volume":"102 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine Learning for Big Data Analytics in Development of Wildfire Prediction Models\",\"authors\":\"Chan-Ho Lee, Mooyoung Lim, Yohan Lee\",\"doi\":\"10.9798/kosham.2023.23.2.29\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study aims to develop a model that predicts domestic forest fire occurrences during fire outbreaks using machine learning techniques. For the modeling methods, logistic regression analysis and ensemble techniques, such as gradient boost and random forest, were used while the oversampling technique was utilized to address the imbalance problem of the forest fire data. The model developed in this study predicted 239 out of 333 forest fire occurrences during the nationwide forest fire period in 2020 with a prediction accuracy of approximately 71.8%. Forest fires that occur during such periods are highly influenced by different factors affecting the climate, such as temperature, humidity, and precipitation. In Gangwon-do, in addition to these factors, a high correlation between farmland density and stem volume per hectare has also been associated with increased forest fire occurrences. The significance of this study lies in the fact that it presents a customized wildfire occurrence prediction model that can be used in the administrative parts, which serve as the basic centers for wildfire prevention, of provinces and cities across the country.\",\"PeriodicalId\":416980,\"journal\":{\"name\":\"Journal of the Korean Society of Hazard Mitigation\",\"volume\":\"102 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of the Korean Society of Hazard Mitigation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.9798/kosham.2023.23.2.29\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Korean Society of Hazard Mitigation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.9798/kosham.2023.23.2.29","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Machine Learning for Big Data Analytics in Development of Wildfire Prediction Models
This study aims to develop a model that predicts domestic forest fire occurrences during fire outbreaks using machine learning techniques. For the modeling methods, logistic regression analysis and ensemble techniques, such as gradient boost and random forest, were used while the oversampling technique was utilized to address the imbalance problem of the forest fire data. The model developed in this study predicted 239 out of 333 forest fire occurrences during the nationwide forest fire period in 2020 with a prediction accuracy of approximately 71.8%. Forest fires that occur during such periods are highly influenced by different factors affecting the climate, such as temperature, humidity, and precipitation. In Gangwon-do, in addition to these factors, a high correlation between farmland density and stem volume per hectare has also been associated with increased forest fire occurrences. The significance of this study lies in the fact that it presents a customized wildfire occurrence prediction model that can be used in the administrative parts, which serve as the basic centers for wildfire prevention, of provinces and cities across the country.