{"title":"基于机器学习的用户流失预警研究","authors":"Zhou Yiran, Wang Lei, Liu Wei, Xiao Tao","doi":"10.1145/3598438.3598461","DOIUrl":null,"url":null,"abstract":"In the current competitive communication industry, how to avoid user churn has become an important issue for enterprises. The development of big data technology has provided a new way for communication companies to predict subscriber churn. This paper predicts user churn based on 900,000 data from QD Mobile using a dataset processed by random sampling. The accuracy of three algorithms, including decision tree, random forest and AdaBoost classifier, is compared for user churn prediction. The random forest algorithm is found to be the most accurate for user churn prediction. Based on this, grid search algorithm in machine learning is used to find the best parameters of the random forest model and improve the prediction accuracy to 81.84%. The result can help communication companies to predict subscriber churn probability and improve competition level.","PeriodicalId":338722,"journal":{"name":"Proceedings of the 2022 3rd International Symposium on Big Data and Artificial Intelligence","volume":"72 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Research on User Churn Warning based on Machine Learning\",\"authors\":\"Zhou Yiran, Wang Lei, Liu Wei, Xiao Tao\",\"doi\":\"10.1145/3598438.3598461\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the current competitive communication industry, how to avoid user churn has become an important issue for enterprises. The development of big data technology has provided a new way for communication companies to predict subscriber churn. This paper predicts user churn based on 900,000 data from QD Mobile using a dataset processed by random sampling. The accuracy of three algorithms, including decision tree, random forest and AdaBoost classifier, is compared for user churn prediction. The random forest algorithm is found to be the most accurate for user churn prediction. Based on this, grid search algorithm in machine learning is used to find the best parameters of the random forest model and improve the prediction accuracy to 81.84%. The result can help communication companies to predict subscriber churn probability and improve competition level.\",\"PeriodicalId\":338722,\"journal\":{\"name\":\"Proceedings of the 2022 3rd International Symposium on Big Data and Artificial Intelligence\",\"volume\":\"72 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2022 3rd International Symposium on Big Data and Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3598438.3598461\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 3rd International Symposium on Big Data and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3598438.3598461","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Research on User Churn Warning based on Machine Learning
In the current competitive communication industry, how to avoid user churn has become an important issue for enterprises. The development of big data technology has provided a new way for communication companies to predict subscriber churn. This paper predicts user churn based on 900,000 data from QD Mobile using a dataset processed by random sampling. The accuracy of three algorithms, including decision tree, random forest and AdaBoost classifier, is compared for user churn prediction. The random forest algorithm is found to be the most accurate for user churn prediction. Based on this, grid search algorithm in machine learning is used to find the best parameters of the random forest model and improve the prediction accuracy to 81.84%. The result can help communication companies to predict subscriber churn probability and improve competition level.