{"title":"确定巴西米纳斯吉拉斯州气候数据输入的最佳机器学习算法","authors":"Lucas O. Bayma, Marconi A. Pereira","doi":"10.5753/jidm.2018.2044","DOIUrl":null,"url":null,"abstract":"Climate prediction is a relevant activity for humanity and, for the success of the climate forecast, a good historical database is necessary. However, because of several factors, large historical data gaps are found at different meteorological stations, and studies to determine such missing weather values are still scarce. This work describes a study of a combination of several machine learning techniques to determine missing climatic values. This study extends our previous work, producing a computational framework, formed by three different methods: neural networks, regression bagged trees and random forest. Deep data analysis and a statistical study is conducted to compare these three methods. The study statistically demonstrated that the random forest technique was successful in obtaining missing climatic values for the state of Minas Gerais and can be widely used by the responsible agencies to improve their historical databases, consequently, their climate forecasts.","PeriodicalId":293511,"journal":{"name":"Journal of Information and Data Management","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Identifying Finest Machine Learning Algorithm for Climate Data Imputation in the State of Minas Gerais, Brazil\",\"authors\":\"Lucas O. Bayma, Marconi A. Pereira\",\"doi\":\"10.5753/jidm.2018.2044\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Climate prediction is a relevant activity for humanity and, for the success of the climate forecast, a good historical database is necessary. However, because of several factors, large historical data gaps are found at different meteorological stations, and studies to determine such missing weather values are still scarce. This work describes a study of a combination of several machine learning techniques to determine missing climatic values. This study extends our previous work, producing a computational framework, formed by three different methods: neural networks, regression bagged trees and random forest. Deep data analysis and a statistical study is conducted to compare these three methods. The study statistically demonstrated that the random forest technique was successful in obtaining missing climatic values for the state of Minas Gerais and can be widely used by the responsible agencies to improve their historical databases, consequently, their climate forecasts.\",\"PeriodicalId\":293511,\"journal\":{\"name\":\"Journal of Information and Data Management\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-12-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Information and Data Management\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5753/jidm.2018.2044\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Information and Data Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5753/jidm.2018.2044","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Identifying Finest Machine Learning Algorithm for Climate Data Imputation in the State of Minas Gerais, Brazil
Climate prediction is a relevant activity for humanity and, for the success of the climate forecast, a good historical database is necessary. However, because of several factors, large historical data gaps are found at different meteorological stations, and studies to determine such missing weather values are still scarce. This work describes a study of a combination of several machine learning techniques to determine missing climatic values. This study extends our previous work, producing a computational framework, formed by three different methods: neural networks, regression bagged trees and random forest. Deep data analysis and a statistical study is conducted to compare these three methods. The study statistically demonstrated that the random forest technique was successful in obtaining missing climatic values for the state of Minas Gerais and can be widely used by the responsible agencies to improve their historical databases, consequently, their climate forecasts.