Identifying Finest Machine Learning Algorithm for Climate Data Imputation in the State of Minas Gerais, Brazil

Lucas O. Bayma, Marconi A. Pereira
{"title":"Identifying Finest Machine Learning Algorithm for Climate Data Imputation in the State of Minas Gerais, Brazil","authors":"Lucas O. Bayma, Marconi A. Pereira","doi":"10.5753/jidm.2018.2044","DOIUrl":null,"url":null,"abstract":"Climate prediction is a relevant activity for humanity and, for the success of the climate forecast, a good historical database is necessary. However, because of several factors, large historical data gaps are found at different meteorological stations, and studies to determine such missing weather values are still scarce. This work describes a study of a combination of several machine learning techniques to determine missing climatic values. This study extends our previous work, producing a computational framework, formed by three different methods: neural networks, regression bagged trees and random forest. Deep data analysis and a statistical study is conducted to compare these three methods. The study statistically demonstrated that the random forest technique was successful in obtaining missing climatic values for the state of Minas Gerais and can be widely used by the responsible agencies to improve their historical databases, consequently, their climate forecasts.","PeriodicalId":293511,"journal":{"name":"Journal of Information and Data Management","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Information and Data Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5753/jidm.2018.2044","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Climate prediction is a relevant activity for humanity and, for the success of the climate forecast, a good historical database is necessary. However, because of several factors, large historical data gaps are found at different meteorological stations, and studies to determine such missing weather values are still scarce. This work describes a study of a combination of several machine learning techniques to determine missing climatic values. This study extends our previous work, producing a computational framework, formed by three different methods: neural networks, regression bagged trees and random forest. Deep data analysis and a statistical study is conducted to compare these three methods. The study statistically demonstrated that the random forest technique was successful in obtaining missing climatic values for the state of Minas Gerais and can be widely used by the responsible agencies to improve their historical databases, consequently, their climate forecasts.
确定巴西米纳斯吉拉斯州气候数据输入的最佳机器学习算法
气候预测是人类的一项重要活动,气候预测的成功离不开良好的历史数据库。然而,由于多种因素的影响,在不同的气象站发现了较大的历史数据缺口,并且确定这种缺失的天气值的研究仍然很少。这项工作描述了几种机器学习技术的组合研究,以确定缺失的气候值。这项研究扩展了我们之前的工作,产生了一个计算框架,由三种不同的方法组成:神经网络、回归袋树和随机森林。对这三种方法进行了深入的数据分析和统计研究。该研究在统计上表明,随机森林技术成功地获得了米纳斯吉拉斯州缺失的气候值,可以被负责机构广泛使用,以改进其历史数据库,从而改进其气候预测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信