Churn Identification in Microblogs using Convolutional Neural Networks with Structured Logical Knowledge

NUT@EMNLP Pub Date : 2017-09-01 DOI:10.18653/v1/W17-4403

Mourad Gridach, Hatem Haddad, Hala Mulki

{"title":"Churn Identification in Microblogs using Convolutional Neural Networks with Structured Logical Knowledge","authors":"Mourad Gridach, Hatem Haddad, Hala Mulki","doi":"10.18653/v1/W17-4403","DOIUrl":null,"url":null,"abstract":"For brands, gaining new customer is more expensive than keeping an existing one. Therefore, the ability to keep customers in a brand is becoming more challenging these days. Churn happens when a customer leaves a brand to another competitor. Most of the previous work considers the problem of churn prediction using the Call Detail Records (CDRs). In this paper, we use micro-posts to classify customers into churny or non-churny. We explore the power of convolutional neural networks (CNNs) since they achieved state-of-the-art in various computer vision and NLP applications. However, the robustness of end-to-end models has some limitations such as the availability of a large amount of labeled data and uninterpretability of these models. We investigate the use of CNNs augmented with structured logic rules to overcome or reduce this issue. We developed our system called Churn_teacher by using an iterative distillation method that transfers the knowledge, extracted using just the combination of three logic rules, directly into the weight of the DNNs. Furthermore, we used weight normalization to speed up training our convolutional neural networks. Experimental results showed that with just these three rules, we were able to get state-of-the-art on publicly available Twitter dataset about three Telecom brands.","PeriodicalId":207795,"journal":{"name":"NUT@EMNLP","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"NUT@EMNLP","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18653/v1/W17-4403","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 12

Abstract

For brands, gaining new customer is more expensive than keeping an existing one. Therefore, the ability to keep customers in a brand is becoming more challenging these days. Churn happens when a customer leaves a brand to another competitor. Most of the previous work considers the problem of churn prediction using the Call Detail Records (CDRs). In this paper, we use micro-posts to classify customers into churny or non-churny. We explore the power of convolutional neural networks (CNNs) since they achieved state-of-the-art in various computer vision and NLP applications. However, the robustness of end-to-end models has some limitations such as the availability of a large amount of labeled data and uninterpretability of these models. We investigate the use of CNNs augmented with structured logic rules to overcome or reduce this issue. We developed our system called Churn_teacher by using an iterative distillation method that transfers the knowledge, extracted using just the combination of three logic rules, directly into the weight of the DNNs. Furthermore, we used weight normalization to speed up training our convolutional neural networks. Experimental results showed that with just these three rules, we were able to get state-of-the-art on publicly available Twitter dataset about three Telecom brands.

查看原文本刊更多论文

基于结构化逻辑知识卷积神经网络的微博用户流失识别

对于品牌来说，获得新客户比保持现有客户的成本更高。因此，如今品牌留住顾客的能力变得越来越具有挑战性。顾客流失发生在顾客离开一个品牌而投向另一个竞争对手的时候。以前的大部分工作都考虑了使用呼叫详细记录(cdr)进行客户流失预测的问题。在本文中，我们使用微博将客户划分为“粘性”和“非粘性”。我们探索卷积神经网络(cnn)的力量，因为它们在各种计算机视觉和NLP应用中达到了最先进的水平。然而，端到端模型的鲁棒性存在一些限制，如大量标记数据的可用性和这些模型的不可解释性。我们研究了使用带有结构化逻辑规则的cnn来克服或减少这个问题。我们开发了一个名为Churn_teacher的系统，通过使用迭代蒸馏方法，将仅使用三个逻辑规则的组合提取的知识直接转移到dnn的权重中。此外，我们使用权值归一化来加速卷积神经网络的训练。实验结果表明，仅凭这三条规则，我们就能够获得有关三个电信品牌的公开可用Twitter数据集的最新技术。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

NUT@EMNLP

自引率

0.00%

发文量