{"title":"基于特征选择的随机森林蒸发皿蒸发量预测模型","authors":"Rakhee, Archana Singh, Mamta Mittal, Amrender Kumar","doi":"10.1109/Confluence47617.2020.9057856","DOIUrl":null,"url":null,"abstract":"Random Forest is a learning method that can be used for classification and regression problems; it operates by constructing decision trees at training time and output the predicted results. In this study, the algorithm is used to predict the Pan Evaporation for Karnal district, India. Random forest is also adopted to select the important features which highly influence the evaporation conditions. The weather of four lag weeks from the week of forecast is used to form indices that are considered for the model development. The algorithm is trained using thirty-one-year data (1973-2003) and subsequent year (2004-05) which is not utilized for model development is used as a testing set. The developed random forest model is further compared with the artificial neural network with backpropagation algorithm. The performance of the models is measured using mean square error, which shows that the predicted values are in close approximation with the observed one but the random forest model has better predictions than the artificial neural network.","PeriodicalId":180005,"journal":{"name":"2020 10th International Conference on Cloud Computing, Data Science & Engineering (Confluence)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Predictive modeling of Pan Evaporation using Random Forest Algorithm along with Features Selection\",\"authors\":\"Rakhee, Archana Singh, Mamta Mittal, Amrender Kumar\",\"doi\":\"10.1109/Confluence47617.2020.9057856\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Random Forest is a learning method that can be used for classification and regression problems; it operates by constructing decision trees at training time and output the predicted results. In this study, the algorithm is used to predict the Pan Evaporation for Karnal district, India. Random forest is also adopted to select the important features which highly influence the evaporation conditions. The weather of four lag weeks from the week of forecast is used to form indices that are considered for the model development. The algorithm is trained using thirty-one-year data (1973-2003) and subsequent year (2004-05) which is not utilized for model development is used as a testing set. The developed random forest model is further compared with the artificial neural network with backpropagation algorithm. The performance of the models is measured using mean square error, which shows that the predicted values are in close approximation with the observed one but the random forest model has better predictions than the artificial neural network.\",\"PeriodicalId\":180005,\"journal\":{\"name\":\"2020 10th International Conference on Cloud Computing, Data Science & Engineering (Confluence)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 10th International Conference on Cloud Computing, Data Science & Engineering (Confluence)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/Confluence47617.2020.9057856\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 10th International Conference on Cloud Computing, Data Science & Engineering (Confluence)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/Confluence47617.2020.9057856","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Predictive modeling of Pan Evaporation using Random Forest Algorithm along with Features Selection
Random Forest is a learning method that can be used for classification and regression problems; it operates by constructing decision trees at training time and output the predicted results. In this study, the algorithm is used to predict the Pan Evaporation for Karnal district, India. Random forest is also adopted to select the important features which highly influence the evaporation conditions. The weather of four lag weeks from the week of forecast is used to form indices that are considered for the model development. The algorithm is trained using thirty-one-year data (1973-2003) and subsequent year (2004-05) which is not utilized for model development is used as a testing set. The developed random forest model is further compared with the artificial neural network with backpropagation algorithm. The performance of the models is measured using mean square error, which shows that the predicted values are in close approximation with the observed one but the random forest model has better predictions than the artificial neural network.