{"title":"A Novel Bayesian Approach for Construction of Random Forest","authors":"Arpan Dam, Ashish Phophalia, V. Jain","doi":"10.1109/ICIIP53038.2021.9702564","DOIUrl":null,"url":null,"abstract":"Decision tree is one of the commonly used machine learning algorithm. Random Forest (RF) is an ensemble of such decision trees. The construction of optimal Decision Tree and hence Random Forest is NP Hard when data is large. The Bayesian statistics have been used in the past for various machine learning and pattern recognition problems. The Bayesian statistics provide a tool to construct Random Forest when no prior information for data is available. Here a forest is generated based on Bayesian statistics where numerous trees are sampled given the prior distribution without the use of training data, and after that weighted ensemble is performed. In the past, it has been used for classification problems. In this paper, we are proposing construction of RF under Bayesian framework using Tree Strength concept. Also, we extend our proposal to regression problems. The proposal is evaluated on UCI data sets for both classification and regression task and found satisfactory results.","PeriodicalId":431272,"journal":{"name":"2021 Sixth International Conference on Image Information Processing (ICIIP)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Sixth International Conference on Image Information Processing (ICIIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIIP53038.2021.9702564","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Decision tree is one of the commonly used machine learning algorithm. Random Forest (RF) is an ensemble of such decision trees. The construction of optimal Decision Tree and hence Random Forest is NP Hard when data is large. The Bayesian statistics have been used in the past for various machine learning and pattern recognition problems. The Bayesian statistics provide a tool to construct Random Forest when no prior information for data is available. Here a forest is generated based on Bayesian statistics where numerous trees are sampled given the prior distribution without the use of training data, and after that weighted ensemble is performed. In the past, it has been used for classification problems. In this paper, we are proposing construction of RF under Bayesian framework using Tree Strength concept. Also, we extend our proposal to regression problems. The proposal is evaluated on UCI data sets for both classification and regression task and found satisfactory results.