{"title":"A Hybrid of Seasonal Autoregressive Integrated Moving Average (SARIMA) and Decision Tree for Drought Forecasting","authors":"Yasnita, E. Sutoyo, Ahmad Musnansyah","doi":"10.1145/3429789.3429870","DOIUrl":null,"url":null,"abstract":"Drought is one of the triggers for forest fires due to depletion of surface water reserves. Along with the frequent drought, the incidence of forest fires has also increased. Therefore, it is important to know or forecast drought to take precautions. In this study, drought forecasting was carried out by applying the concept of data mining classification methods and forecasting methods. This classification uses the decision tree (CART) method, which is a method that aims to see the rules resulting from the classification of existing data. While forecasting uses the SARIMA method, this method is used to predict the factors that cause drought (temperature, humidity, and rainfall). Furthermore, the rule of the classification results is used to classify the results of forecasts. Based on the implementation of the CART algorithm which is evaluated using a confusion matrix is able to achieve an accuracy of 91.33%. Based on the implementation of the SARIMA method, a model is obtained for each variable to build forecasting. Each model was selected based on AIC criteria, and evaluated using MSE. The optimal model for temperature (Tx) is SARIMA (1, 1, 0) x (0, 1, 1, 12) with the MSE value of 0.15. While the selected model for humidity (RH_avg) is SARIMA (0, 1, 1) x (1, 1, 1, 12) with the MSE value of 3.85, and the optimal model for rainfall (RR) is SARIMA (0, 1, 1) x (0, 1, 1, 12) with the MSE value of 8.61.","PeriodicalId":416230,"journal":{"name":"Proceedings of the 2021 International Conference on Engineering and Information Technology for Sustainable Industry","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 International Conference on Engineering and Information Technology for Sustainable Industry","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3429789.3429870","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Drought is one of the triggers for forest fires due to depletion of surface water reserves. Along with the frequent drought, the incidence of forest fires has also increased. Therefore, it is important to know or forecast drought to take precautions. In this study, drought forecasting was carried out by applying the concept of data mining classification methods and forecasting methods. This classification uses the decision tree (CART) method, which is a method that aims to see the rules resulting from the classification of existing data. While forecasting uses the SARIMA method, this method is used to predict the factors that cause drought (temperature, humidity, and rainfall). Furthermore, the rule of the classification results is used to classify the results of forecasts. Based on the implementation of the CART algorithm which is evaluated using a confusion matrix is able to achieve an accuracy of 91.33%. Based on the implementation of the SARIMA method, a model is obtained for each variable to build forecasting. Each model was selected based on AIC criteria, and evaluated using MSE. The optimal model for temperature (Tx) is SARIMA (1, 1, 0) x (0, 1, 1, 12) with the MSE value of 0.15. While the selected model for humidity (RH_avg) is SARIMA (0, 1, 1) x (1, 1, 1, 12) with the MSE value of 3.85, and the optimal model for rainfall (RR) is SARIMA (0, 1, 1) x (0, 1, 1, 12) with the MSE value of 8.61.
由于地表水储量的枯竭,干旱是引发森林火灾的因素之一。随着干旱的频繁发生,森林火灾的发生率也有所增加。因此,了解或预测干旱,采取预防措施是很重要的。本研究运用数据挖掘的概念、分类方法和预测方法进行干旱预测。这种分类使用决策树(CART)方法,这是一种旨在查看由现有数据分类产生的规则的方法。虽然预报使用SARIMA方法,但该方法用于预测导致干旱的因素(温度、湿度和降雨)。在此基础上,利用分类结果规则对预测结果进行分类。基于CART算法的实现,使用混淆矩阵进行评估,能够达到91.33%的准确率。在SARIMA方法实现的基础上,对每个变量建立模型进行预测。根据AIC标准选择每个模型,并使用MSE进行评估。温度(Tx)的最优模型为SARIMA (1,1,0) x (0,1,1,12), MSE值为0.15。湿度(RH_avg)优选模型为SARIMA (0,1,1) x (1,1,1,12), MSE值为3.85;降雨(RR)优选模型为SARIMA (0,1,1) x (0,1,1,12), MSE值为8.61。