{"title":"基于Borderline-SMOTE和随机森林的客户流失智能预测模型研究","authors":"L. Feng","doi":"10.1109/ICPICS55264.2022.9873702","DOIUrl":null,"url":null,"abstract":"In the era of big data, with the rapid development of Internet finance and the increasing pressure on banks to compete, banks have turned their goal to retaining old customers, so it is essential to predict customer churn. Around this problem, this paper designs a Borderline-SMOTE-random forest prediction model for unbalanced data such as bank customers. Combined with the oversampling algorithm, it can better solve the unbalanced data and the strong anti-noise ability of random forest. OOB error rate, AUC, Precision, Recall, and F-mean are used as the evaluation indicators of the model, and KNN, decision tree and Naive Bayes are used as comparisons. The experimental results show that the Borderline-SMOTE-random forest prediction model has the best ability to solve the problem of bank customer churn prediction among the models, and its performance is improved by about 4% compared with other models.","PeriodicalId":257180,"journal":{"name":"2022 IEEE 4th International Conference on Power, Intelligent Computing and Systems (ICPICS)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Research on Customer Churn Intelligent Prediction Model based on Borderline-SMOTE and Random Forest\",\"authors\":\"L. Feng\",\"doi\":\"10.1109/ICPICS55264.2022.9873702\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the era of big data, with the rapid development of Internet finance and the increasing pressure on banks to compete, banks have turned their goal to retaining old customers, so it is essential to predict customer churn. Around this problem, this paper designs a Borderline-SMOTE-random forest prediction model for unbalanced data such as bank customers. Combined with the oversampling algorithm, it can better solve the unbalanced data and the strong anti-noise ability of random forest. OOB error rate, AUC, Precision, Recall, and F-mean are used as the evaluation indicators of the model, and KNN, decision tree and Naive Bayes are used as comparisons. The experimental results show that the Borderline-SMOTE-random forest prediction model has the best ability to solve the problem of bank customer churn prediction among the models, and its performance is improved by about 4% compared with other models.\",\"PeriodicalId\":257180,\"journal\":{\"name\":\"2022 IEEE 4th International Conference on Power, Intelligent Computing and Systems (ICPICS)\",\"volume\":\"41 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 4th International Conference on Power, Intelligent Computing and Systems (ICPICS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICPICS55264.2022.9873702\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 4th International Conference on Power, Intelligent Computing and Systems (ICPICS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICPICS55264.2022.9873702","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Research on Customer Churn Intelligent Prediction Model based on Borderline-SMOTE and Random Forest
In the era of big data, with the rapid development of Internet finance and the increasing pressure on banks to compete, banks have turned their goal to retaining old customers, so it is essential to predict customer churn. Around this problem, this paper designs a Borderline-SMOTE-random forest prediction model for unbalanced data such as bank customers. Combined with the oversampling algorithm, it can better solve the unbalanced data and the strong anti-noise ability of random forest. OOB error rate, AUC, Precision, Recall, and F-mean are used as the evaluation indicators of the model, and KNN, decision tree and Naive Bayes are used as comparisons. The experimental results show that the Borderline-SMOTE-random forest prediction model has the best ability to solve the problem of bank customer churn prediction among the models, and its performance is improved by about 4% compared with other models.