Customer Churn Prediction by Classification Models in Machine Learning

Heng Zhao, Xumin Zuo, Yuanyuan Xie
{"title":"Customer Churn Prediction by Classification Models in Machine Learning","authors":"Heng Zhao, Xumin Zuo, Yuanyuan Xie","doi":"10.1109/ICEEE55327.2022.9772553","DOIUrl":null,"url":null,"abstract":"The classification model in machine learning has been employed to address different problems. Machine Learning classification is an effective method to realize customer churn prediction. This article provides a comparative study of machine learning from the perspective of predicting customer churn. Customer churn is one of the obstacles hindering the development of companies. Through classification approaches based on machine learning, customer churn can be predicted precisely, thus providing decision-making capabilities to these companies. Customer churn prediction is a typical classification problem and can be addressed by the use of a decision tree or random forest model. In this paper, decision tree and random forest models are employed to predict customer churn, using sales data provided by a chemical company from 2012 to 2020. We analyze the underlying risk of customer churn and the customer churn factors, including the low-priced count (LC), total amount of money (TM), and creation time (CT). The prediction results of the two models are evaluated by calculating various metrics. The experimental results indicate that the low-priced count (LC) is the most essential factor for customer churn. The results of the comparison, considering the training error and generalization error including: confusion matrix, ROC curve, AUC, precision, recall, and F1 score, reveal that the random forest model has better prediction accuracy than the decision tree model.","PeriodicalId":375340,"journal":{"name":"2022 9th International Conference on Electrical and Electronics Engineering (ICEEE)","volume":"82 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 9th International Conference on Electrical and Electronics Engineering (ICEEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEEE55327.2022.9772553","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The classification model in machine learning has been employed to address different problems. Machine Learning classification is an effective method to realize customer churn prediction. This article provides a comparative study of machine learning from the perspective of predicting customer churn. Customer churn is one of the obstacles hindering the development of companies. Through classification approaches based on machine learning, customer churn can be predicted precisely, thus providing decision-making capabilities to these companies. Customer churn prediction is a typical classification problem and can be addressed by the use of a decision tree or random forest model. In this paper, decision tree and random forest models are employed to predict customer churn, using sales data provided by a chemical company from 2012 to 2020. We analyze the underlying risk of customer churn and the customer churn factors, including the low-priced count (LC), total amount of money (TM), and creation time (CT). The prediction results of the two models are evaluated by calculating various metrics. The experimental results indicate that the low-priced count (LC) is the most essential factor for customer churn. The results of the comparison, considering the training error and generalization error including: confusion matrix, ROC curve, AUC, precision, recall, and F1 score, reveal that the random forest model has better prediction accuracy than the decision tree model.
基于机器学习分类模型的客户流失预测
机器学习中的分类模型已经被用来解决不同的问题。机器学习分类是实现客户流失预测的有效方法。本文从预测客户流失的角度对机器学习进行了比较研究。客户流失是阻碍公司发展的障碍之一。通过基于机器学习的分类方法,可以准确预测客户流失,从而为这些公司提供决策能力。客户流失预测是一个典型的分类问题,可以通过使用决策树或随机森林模型来解决。本文采用决策树和随机森林模型对某化工公司2012 - 2020年的销售数据进行客户流失预测。我们分析了客户流失的潜在风险和客户流失因素,包括低价计数(LC)、总金额(TM)和创建时间(CT)。通过计算各种指标,对两种模型的预测结果进行了评价。实验结果表明,低价计数(LC)是导致顾客流失的最重要因素。考虑混淆矩阵、ROC曲线、AUC、精度、召回率、F1分数等训练误差和泛化误差的比较结果表明,随机森林模型比决策树模型具有更好的预测精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信