{"title":"通过使用随机森林模型防止客户流失","authors":"Weiyun Ying, Xiu Li, Yaya Xie, Ellis L. Johnson","doi":"10.1109/IRI.2008.4583069","DOIUrl":null,"url":null,"abstract":"In this paper, we use the improved balanced random forests(IBRF) to predict the customer churn, while integrating a sampling technique and cost-sensitive learning into the standard random forests to achieve a better performance than most existing algorithms. The nature of IBRF is that the best features are iteratively learned by altering the class distribution and by putting higher penalties on misclassification of the minority class. Applied to a credit debt customer database of an anonymous commercial bank in China, they are proven to significantly improve prediction accuracy comparing with other algorithms, such as artificial neural networks, decision trees, and class-weighted core support vector machines (CWC-SVM). The assessment and comparison of these algorithms are made to analyze the traits of them. Data processing and sampling scheme are also detailed introduced.","PeriodicalId":169554,"journal":{"name":"2008 IEEE International Conference on Information Reuse and Integration","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":"{\"title\":\"Preventing customer churn by using random forests modeling\",\"authors\":\"Weiyun Ying, Xiu Li, Yaya Xie, Ellis L. Johnson\",\"doi\":\"10.1109/IRI.2008.4583069\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we use the improved balanced random forests(IBRF) to predict the customer churn, while integrating a sampling technique and cost-sensitive learning into the standard random forests to achieve a better performance than most existing algorithms. The nature of IBRF is that the best features are iteratively learned by altering the class distribution and by putting higher penalties on misclassification of the minority class. Applied to a credit debt customer database of an anonymous commercial bank in China, they are proven to significantly improve prediction accuracy comparing with other algorithms, such as artificial neural networks, decision trees, and class-weighted core support vector machines (CWC-SVM). The assessment and comparison of these algorithms are made to analyze the traits of them. Data processing and sampling scheme are also detailed introduced.\",\"PeriodicalId\":169554,\"journal\":{\"name\":\"2008 IEEE International Conference on Information Reuse and Integration\",\"volume\":\"60 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-07-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"17\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 IEEE International Conference on Information Reuse and Integration\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IRI.2008.4583069\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE International Conference on Information Reuse and Integration","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRI.2008.4583069","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Preventing customer churn by using random forests modeling
In this paper, we use the improved balanced random forests(IBRF) to predict the customer churn, while integrating a sampling technique and cost-sensitive learning into the standard random forests to achieve a better performance than most existing algorithms. The nature of IBRF is that the best features are iteratively learned by altering the class distribution and by putting higher penalties on misclassification of the minority class. Applied to a credit debt customer database of an anonymous commercial bank in China, they are proven to significantly improve prediction accuracy comparing with other algorithms, such as artificial neural networks, decision trees, and class-weighted core support vector machines (CWC-SVM). The assessment and comparison of these algorithms are made to analyze the traits of them. Data processing and sampling scheme are also detailed introduced.