{"title":"Customer Churn Prediction Based on the Decision Tree in Personal Handyphone System Service","authors":"Luo Bin, Shao Peiji, Liu Juan","doi":"10.1109/ICSSSM.2007.4280145","DOIUrl":null,"url":null,"abstract":"Nowadays, churn prediction and management is critical for more and more companies in the fast changing and strongly competitive telecommunication market. In order to improve customer retention, telecommunication companies must be able to predict customers at risk who are prone to switch service provider. In this study, to overcome the limitations of lack of information of customers of Personal Handyphone System Service (PHSS) and to build an effective and accurate customer churn model, three research experimentations (changing sub-periods for training data sets, changing misclassification cost in churn model, changing sample methods for training data sets) are put forward to improve the prediction performance of churn model by using decision tree which is used widely, some optimal parameters (the time of sub-period being 10 days, misclassification cost being 1:5, and random sample method for train set) of models are found under the help of three research experimentations. The empirical evaluation results suggest that customer churn models built have a good performance through the course of model optical selecting, and show that the methods and techniques proposed are effective and feasible under the condition that information of customers is very little and class distribution is skewed. This study benefits not only churn prediction research and practice but also other data mining applications with similar characteristics.","PeriodicalId":153603,"journal":{"name":"2007 International Conference on Service Systems and Service Management","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"65","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 International Conference on Service Systems and Service Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSSSM.2007.4280145","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 65
Abstract
Nowadays, churn prediction and management is critical for more and more companies in the fast changing and strongly competitive telecommunication market. In order to improve customer retention, telecommunication companies must be able to predict customers at risk who are prone to switch service provider. In this study, to overcome the limitations of lack of information of customers of Personal Handyphone System Service (PHSS) and to build an effective and accurate customer churn model, three research experimentations (changing sub-periods for training data sets, changing misclassification cost in churn model, changing sample methods for training data sets) are put forward to improve the prediction performance of churn model by using decision tree which is used widely, some optimal parameters (the time of sub-period being 10 days, misclassification cost being 1:5, and random sample method for train set) of models are found under the help of three research experimentations. The empirical evaluation results suggest that customer churn models built have a good performance through the course of model optical selecting, and show that the methods and techniques proposed are effective and feasible under the condition that information of customers is very little and class distribution is skewed. This study benefits not only churn prediction research and practice but also other data mining applications with similar characteristics.
如今,在快速变化和竞争激烈的电信市场中,客户流失预测和管理对越来越多的公司来说至关重要。为了提高客户保留率,电信公司必须能够预测有可能更换服务提供商的风险客户。为了克服个人电话系统服务(Personal Handyphone System Service, PHSS)客户信息缺乏的局限性,建立有效准确的客户流失模型,本研究提出了三个研究实验(改变训练数据集子周期、改变流失模型错分类成本、改变训练数据集样本方法),利用决策树提高客户流失模型的预测性能。在三个研究实验的帮助下,找到了模型的一些最优参数(子周期时间为10天,误分类代价为1:5,训练集随机抽样方法)。实证评价结果表明,通过模型光学选择建立的客户流失模型具有良好的性能,并表明所提出的方法和技术在客户信息很少、阶层分布偏斜的情况下是有效可行的。该研究不仅对客户流失预测的研究和实践有益,而且对其他具有类似特征的数据挖掘应用也有借鉴意义。