User Archetype Discovery By Cluster Analysis of Caller Log Data: Tenure Evolution is Stable as Time Period Reduces

Robin Turkington, M. Mulvenna, R. Bond, S. O’neill, C. Armour
{"title":"User Archetype Discovery By Cluster Analysis of Caller Log Data: Tenure Evolution is Stable as Time Period Reduces","authors":"Robin Turkington, M. Mulvenna, R. Bond, S. O’neill, C. Armour","doi":"10.1145/3335082.3335090","DOIUrl":null,"url":null,"abstract":"Clustering analysis, or clustering, is an activity which can be applied to user event log data to determine the types of users which exist within a service, and can be used to gain insights into the client base by their behaviour. However, when applied to longitudinal user event log data, clustering can potentially misclassify regular users as ’one-off’ if their last interaction within their tenure of the service appears at the beginning of the observable data set. The main objective of this study was to investigate whether any impact of user tenure within longitudinal data on k-means clustering accuracy would occur. The current paper subjected a large telephony call log data set from a helpline to a k-means clustering algorithm to determine the types of callers that contact the helpline based on their usage characteristics (number of calls, mean duration of calls and variability of call duration). A threshold of one-month increments were applied to the data (callers appearing before the threshold but not after were removed each time) and then subsequently subjected to k-means clustering. Results showed that cluster structures remained stable after each threshold condition. Significant differences in cluster centers were found in one cluster across tenure conditions.","PeriodicalId":279162,"journal":{"name":"Proceedings of the 31st European Conference on Cognitive Ergonomics","volume":"99 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 31st European Conference on Cognitive Ergonomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3335082.3335090","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Clustering analysis, or clustering, is an activity which can be applied to user event log data to determine the types of users which exist within a service, and can be used to gain insights into the client base by their behaviour. However, when applied to longitudinal user event log data, clustering can potentially misclassify regular users as ’one-off’ if their last interaction within their tenure of the service appears at the beginning of the observable data set. The main objective of this study was to investigate whether any impact of user tenure within longitudinal data on k-means clustering accuracy would occur. The current paper subjected a large telephony call log data set from a helpline to a k-means clustering algorithm to determine the types of callers that contact the helpline based on their usage characteristics (number of calls, mean duration of calls and variability of call duration). A threshold of one-month increments were applied to the data (callers appearing before the threshold but not after were removed each time) and then subsequently subjected to k-means clustering. Results showed that cluster structures remained stable after each threshold condition. Significant differences in cluster centers were found in one cluster across tenure conditions.
通过对调用者日志数据的聚类分析发现用户原型:保留期随着时间的缩短而稳定演变
聚类分析或聚类是一种可以应用于用户事件日志数据的活动,以确定服务中存在的用户类型,并且可以通过他们的行为来了解客户群。然而,当应用于纵向用户事件日志数据时,如果常规用户在其服务期限内的最后一次交互出现在可观察数据集的开始,聚类可能会将其错误地分类为“一次性”。本研究的主要目的是调查纵向数据中的用户任期是否会对k-means聚类精度产生影响。本论文将求助热线的大量电话呼叫记录数据集应用于k-means聚类算法,以根据其使用特征(呼叫数量、呼叫平均持续时间和呼叫持续时间的可变性)确定联系求助热线的呼叫者类型。对数据应用一个月增量的阈值(每次删除在阈值之前出现的调用者,而不是在阈值之后出现的调用者),然后进行k-means聚类。结果表明,在每个阈值条件下,聚类结构都保持稳定。在一个集群中,在保留期条件下,集群中心存在显著差异。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信