{"title":"Learning ensembles of Continuous Bayesian Networks: An application to rainfall prediction","authors":"Scott Hellman, A. McGovern, M. Xue","doi":"10.1109/CIDU.2012.6382191","DOIUrl":null,"url":null,"abstract":"We introduce Ensembled Continuous Bayesian Networks (ECBN), an ensemble approach to learning salient dependence relationships and to predicting values for continuous data. By training individual Bayesian networks on both a subset of the data (bagging) and a subset of the attributes in the data (randomization), ECBN produces models for continuous domains that can be used to identify important variables in a dataset and to identify relationships between those variables. We use linear Gaussian distributions within our ensembles, providing efficient network-level inference. By ensembling these networks, we are able to represent nonlinear relationships. We empirically demonstrate that ECBN outperforms the meteorological forecast on a rainfall prediction task across the United States, and performs comparably to results reported for Random Forests.","PeriodicalId":270712,"journal":{"name":"2012 Conference on Intelligent Data Understanding","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 Conference on Intelligent Data Understanding","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIDU.2012.6382191","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12
Abstract
We introduce Ensembled Continuous Bayesian Networks (ECBN), an ensemble approach to learning salient dependence relationships and to predicting values for continuous data. By training individual Bayesian networks on both a subset of the data (bagging) and a subset of the attributes in the data (randomization), ECBN produces models for continuous domains that can be used to identify important variables in a dataset and to identify relationships between those variables. We use linear Gaussian distributions within our ensembles, providing efficient network-level inference. By ensembling these networks, we are able to represent nonlinear relationships. We empirically demonstrate that ECBN outperforms the meteorological forecast on a rainfall prediction task across the United States, and performs comparably to results reported for Random Forests.