Portraying Fine-Grained Tenant Portrait for Churn Prediction Using Semi-Supervised Graph Convolution and Attention Network

IF 5.7 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS
Zuodong Jin;Peng Qi;Muyan Yao;Dan Tao
{"title":"Portraying Fine-Grained Tenant Portrait for Churn Prediction Using Semi-Supervised Graph Convolution and Attention Network","authors":"Zuodong Jin;Peng Qi;Muyan Yao;Dan Tao","doi":"10.1109/TBDATA.2025.3527200","DOIUrl":null,"url":null,"abstract":"With the widespread application of Big Data and intelligent information systems, the tenant has become the main form of most scenarios. As a data mining technique, the portrait has been widely used to provide targeted services. Therefore, we transfer the traditional user-driven portrait into tenant driven for churn prediction. To achieve it, this paper first proposes a three-layer architecture and defines the fine-grained features for creating portraits from the perspective of tenants. In a large-scale telecommunication industry dataset of 100,000 tenants, we construct the tenant portrait through the proposed framework, and analyze the influences of the defined features on churn possibility. Then, considering the information missing caused by privacy concerns, we come up with the <i>CrossMatch</i>, a portrait completion model based on semi-supervised and graph convolution, which combines the relation characteristics among tenants for recovering missing information. On this basis, we design the tenant churn prediction method based on a directed attention network. Moreover, we recover missing information on three public node datasets with <i>CrossMatch</i>, achieving around 1-2<inline-formula><tex-math>$\\%$</tex-math></inline-formula> improvement. We then apply the directed attention network for churn prediction and achieve an Accuracy of 75.06<inline-formula><tex-math>$\\%$</tex-math></inline-formula>, Precision of 77.78<inline-formula><tex-math>$\\%$</tex-math></inline-formula>, and F1-score of 71.43<inline-formula><tex-math>$\\%$</tex-math></inline-formula>, which outperforms all the baselines.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"11 5","pages":"2296-2307"},"PeriodicalIF":5.7000,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Big Data","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10833781/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

With the widespread application of Big Data and intelligent information systems, the tenant has become the main form of most scenarios. As a data mining technique, the portrait has been widely used to provide targeted services. Therefore, we transfer the traditional user-driven portrait into tenant driven for churn prediction. To achieve it, this paper first proposes a three-layer architecture and defines the fine-grained features for creating portraits from the perspective of tenants. In a large-scale telecommunication industry dataset of 100,000 tenants, we construct the tenant portrait through the proposed framework, and analyze the influences of the defined features on churn possibility. Then, considering the information missing caused by privacy concerns, we come up with the CrossMatch, a portrait completion model based on semi-supervised and graph convolution, which combines the relation characteristics among tenants for recovering missing information. On this basis, we design the tenant churn prediction method based on a directed attention network. Moreover, we recover missing information on three public node datasets with CrossMatch, achieving around 1-2$\%$ improvement. We then apply the directed attention network for churn prediction and achieve an Accuracy of 75.06$\%$, Precision of 77.78$\%$, and F1-score of 71.43$\%$, which outperforms all the baselines.
利用半监督图卷积和注意力网络描绘细粒度租户画像用于客户流失预测
随着大数据和智能信息系统的广泛应用,租户已经成为大多数场景的主要形式。作为一种数据挖掘技术,画像已被广泛用于提供有针对性的服务。因此,我们将传统的用户驱动画像转换为租户驱动的流失预测。为了实现这一目标,本文首先提出了一个三层架构,并定义了从租户角度创建肖像的细粒度特征。在10万租户的大型电信行业数据集中,我们通过提出的框架构建了租户画像,并分析了定义的特征对流失可能性的影响。然后,考虑到隐私问题导致的信息缺失,我们提出了一种基于半监督和图卷积的画像补全模型CrossMatch,该模型结合租户之间的关系特征来恢复缺失的信息。在此基础上,设计了基于定向注意力网络的租户流失预测方法。此外,我们使用CrossMatch在三个公共节点数据集上恢复了缺失的信息,实现了大约1- 2%的改进。然后,我们将定向注意力网络应用于流失预测,并获得了75.06美元的准确度,77.78美元的精确度和71.43美元的f1分数,优于所有基线。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
11.80
自引率
2.80%
发文量
114
期刊介绍: The IEEE Transactions on Big Data publishes peer-reviewed articles focusing on big data. These articles present innovative research ideas and application results across disciplines, including novel theories, algorithms, and applications. Research areas cover a wide range, such as big data analytics, visualization, curation, management, semantics, infrastructure, standards, performance analysis, intelligence extraction, scientific discovery, security, privacy, and legal issues specific to big data. The journal also prioritizes applications of big data in fields generating massive datasets.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信