{"title":"The Cascade Improved Model Based Deep Forest for Small-scale Datasets Classification","authors":"Yimin Fan, L. Qi, Y. Tie","doi":"10.1109/ISNE.2019.8896445","DOIUrl":null,"url":null,"abstract":"It is very important to classify some small-scale datasets accurately in biology. With the rapid advancement of classification models, support vector machine(SVM), Random Forest(RF), Deep Forest, Convolutional Neural Networks(CNNs) are widely used. However, for small-scale datasets, CNNs always need massive datasets to train. Other methods usually can’t achieve better effects. Therefore, in this paper, a new forest model is proposed to solve the problems in small-scale datasets. It improves the classification performance through integrated learning method. The improved model is different from the primitive model in two important aspects. Firstly, considering the fitting quality of every forest, the standard deviation of some most major features in every forest make up a new feature to be transport in the next cascade layer. Secondly, the sub-layer structure is adapted to the cascade layer to increase the training opportunities. Experiments on five datasets demonstrate that our method has better classification effect than other classification models in the small-scale datasets.","PeriodicalId":405565,"journal":{"name":"2019 8th International Symposium on Next Generation Electronics (ISNE)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 8th International Symposium on Next Generation Electronics (ISNE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISNE.2019.8896445","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
It is very important to classify some small-scale datasets accurately in biology. With the rapid advancement of classification models, support vector machine(SVM), Random Forest(RF), Deep Forest, Convolutional Neural Networks(CNNs) are widely used. However, for small-scale datasets, CNNs always need massive datasets to train. Other methods usually can’t achieve better effects. Therefore, in this paper, a new forest model is proposed to solve the problems in small-scale datasets. It improves the classification performance through integrated learning method. The improved model is different from the primitive model in two important aspects. Firstly, considering the fitting quality of every forest, the standard deviation of some most major features in every forest make up a new feature to be transport in the next cascade layer. Secondly, the sub-layer structure is adapted to the cascade layer to increase the training opportunities. Experiments on five datasets demonstrate that our method has better classification effect than other classification models in the small-scale datasets.