{"title":"集成算法在电信数据客户流失预测中的分析实现:比较研究","authors":"R. Sari, Ferdy Febriyanto, Ahmad Cahyono Adi","doi":"10.31449/inf.v47i7.4797","DOIUrl":null,"url":null,"abstract":"Globalization and technological advancements in the telecommunication industry have led to a significant rise in the number of operators, leading to intense market competition. This sector has become crucial in developed countries, and companies strive to increase profits by acquiring new customers, up-selling existing ones, and extending the retention period of current clients. In the traditional method of defect prediction, a single classifier is used to build a model on a pre-labeled dataset. However, this approach has limitations in predicting defects accurately under certain circumstances. To overcome these limitations, boosting is applied to combine multiple weak classifiers and create a robust classification model. Among many algorithms used for churn prediction, ensemble techniques have demonstrated greater accuracy than simpler approaches. This study aims to overcome these limitations by experimenting with five ensemble algorithms, including Adaboost, Gradient Boost, XGBoost, CatBoost, and LightGBM. The results indicate that XGBoost outperforms other techniques and is the most suitable algorithm to build the predictive model. Additionally, the study achieves higher accuracy by performing a Grid Search CV hyper-parameter setting with XGBoost, resulting in an accuracy of 81.2%. Povzetek: Študija je primerjala pet ansambelskih algoritmov za napovedovanje prekinitve naročniškega razmerja. Rezultati kažejo, da je XGBoost najboljši algoritem z natančnostjo 81,2 %.","PeriodicalId":35802,"journal":{"name":"Informatica (Slovenia)","volume":"23 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Analysis Implementation of the Ensemble Algorithm in Predicting Customer Churn in Telco Data: A Comparative Study\",\"authors\":\"R. Sari, Ferdy Febriyanto, Ahmad Cahyono Adi\",\"doi\":\"10.31449/inf.v47i7.4797\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Globalization and technological advancements in the telecommunication industry have led to a significant rise in the number of operators, leading to intense market competition. This sector has become crucial in developed countries, and companies strive to increase profits by acquiring new customers, up-selling existing ones, and extending the retention period of current clients. In the traditional method of defect prediction, a single classifier is used to build a model on a pre-labeled dataset. However, this approach has limitations in predicting defects accurately under certain circumstances. To overcome these limitations, boosting is applied to combine multiple weak classifiers and create a robust classification model. Among many algorithms used for churn prediction, ensemble techniques have demonstrated greater accuracy than simpler approaches. This study aims to overcome these limitations by experimenting with five ensemble algorithms, including Adaboost, Gradient Boost, XGBoost, CatBoost, and LightGBM. The results indicate that XGBoost outperforms other techniques and is the most suitable algorithm to build the predictive model. Additionally, the study achieves higher accuracy by performing a Grid Search CV hyper-parameter setting with XGBoost, resulting in an accuracy of 81.2%. Povzetek: Študija je primerjala pet ansambelskih algoritmov za napovedovanje prekinitve naročniškega razmerja. Rezultati kažejo, da je XGBoost najboljši algoritem z natančnostjo 81,2 %.\",\"PeriodicalId\":35802,\"journal\":{\"name\":\"Informatica (Slovenia)\",\"volume\":\"23 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-07-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Informatica (Slovenia)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.31449/inf.v47i7.4797\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informatica (Slovenia)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31449/inf.v47i7.4797","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 1
摘要
电信行业的全球化和技术进步导致运营商数量大幅增加,导致市场竞争激烈。这一领域在发达国家已经变得至关重要,公司努力通过获取新客户、向上销售现有客户和延长现有客户的保留期来增加利润。在传统的缺陷预测方法中,使用单个分类器在预标记的数据集上建立模型。然而,在某些情况下,这种方法在准确预测缺陷方面存在局限性。为了克服这些限制,将增强应用于组合多个弱分类器并创建一个鲁棒分类模型。在许多用于客户流失预测的算法中,集成技术已经证明比简单的方法更准确。本研究旨在通过实验五种集成算法来克服这些限制,包括Adaboost, Gradient Boost, XGBoost, CatBoost和LightGBM。结果表明,XGBoost优于其他技术,是最适合构建预测模型的算法。此外,该研究通过使用XGBoost执行Grid Search CV超参数设置实现了更高的准确性,准确度达到81.2%。Povzetek: Študija je primerjala pet ansambelskih algoritmov za napovedovanje prekinve naročniškega razmerja。Rezultati kažejo, da je XGBoost najboljši算法项目z natan nostojo 81,2 %。
Analysis Implementation of the Ensemble Algorithm in Predicting Customer Churn in Telco Data: A Comparative Study
Globalization and technological advancements in the telecommunication industry have led to a significant rise in the number of operators, leading to intense market competition. This sector has become crucial in developed countries, and companies strive to increase profits by acquiring new customers, up-selling existing ones, and extending the retention period of current clients. In the traditional method of defect prediction, a single classifier is used to build a model on a pre-labeled dataset. However, this approach has limitations in predicting defects accurately under certain circumstances. To overcome these limitations, boosting is applied to combine multiple weak classifiers and create a robust classification model. Among many algorithms used for churn prediction, ensemble techniques have demonstrated greater accuracy than simpler approaches. This study aims to overcome these limitations by experimenting with five ensemble algorithms, including Adaboost, Gradient Boost, XGBoost, CatBoost, and LightGBM. The results indicate that XGBoost outperforms other techniques and is the most suitable algorithm to build the predictive model. Additionally, the study achieves higher accuracy by performing a Grid Search CV hyper-parameter setting with XGBoost, resulting in an accuracy of 81.2%. Povzetek: Študija je primerjala pet ansambelskih algoritmov za napovedovanje prekinitve naročniškega razmerja. Rezultati kažejo, da je XGBoost najboljši algoritem z natančnostjo 81,2 %.
期刊介绍:
Informatica is an international refereed journal with its base in Europe. It has entered its 33th year of publication. It publishes papers addressing all issues of interests to computer professionals: from scientific and technical to educational, commercial and industrial. It also publishes critical examinations of existing publications, news about major practical achievements and innovations in the computer and information industry, as well as conference announcements and reports.