{"title":"一种改进的不平衡学习随机森林分类器","authors":"Weiping Lin, Jie Gao, Beizhan Wang, Qingqi Hong","doi":"10.1109/ICAICA52286.2021.9497933","DOIUrl":null,"url":null,"abstract":"There are many application scenarios involving imbalanced datasets, whereas many traditional machine learning methods have limited ability to adapt to this kind of data. These methods usually have a bias to identify the majority classes while the minority classes are more important in many cases. In this study, we propose a variant of the completely random forest called HCRF. To improve the classification performance of imbalanced data, we introduced 2 mechanisms: random hybrid-resampling and a cost function that focuses on the minority classes. Verified on several imbalanced datasets, HCRF outperforms all comparison methods, demonstrating excellent performance on imbalanced learning.","PeriodicalId":121979,"journal":{"name":"2021 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"An Improved Random Forest Classifier for Imbalanced Learning\",\"authors\":\"Weiping Lin, Jie Gao, Beizhan Wang, Qingqi Hong\",\"doi\":\"10.1109/ICAICA52286.2021.9497933\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"There are many application scenarios involving imbalanced datasets, whereas many traditional machine learning methods have limited ability to adapt to this kind of data. These methods usually have a bias to identify the majority classes while the minority classes are more important in many cases. In this study, we propose a variant of the completely random forest called HCRF. To improve the classification performance of imbalanced data, we introduced 2 mechanisms: random hybrid-resampling and a cost function that focuses on the minority classes. Verified on several imbalanced datasets, HCRF outperforms all comparison methods, demonstrating excellent performance on imbalanced learning.\",\"PeriodicalId\":121979,\"journal\":{\"name\":\"2021 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA)\",\"volume\":\"36 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-06-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAICA52286.2021.9497933\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAICA52286.2021.9497933","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An Improved Random Forest Classifier for Imbalanced Learning
There are many application scenarios involving imbalanced datasets, whereas many traditional machine learning methods have limited ability to adapt to this kind of data. These methods usually have a bias to identify the majority classes while the minority classes are more important in many cases. In this study, we propose a variant of the completely random forest called HCRF. To improve the classification performance of imbalanced data, we introduced 2 mechanisms: random hybrid-resampling and a cost function that focuses on the minority classes. Verified on several imbalanced datasets, HCRF outperforms all comparison methods, demonstrating excellent performance on imbalanced learning.