Robin Turkington, M. Mulvenna, R. Bond, S. O’neill, C. Armour
{"title":"通过对调用者日志数据的聚类分析发现用户原型:保留期随着时间的缩短而稳定演变","authors":"Robin Turkington, M. Mulvenna, R. Bond, S. O’neill, C. Armour","doi":"10.1145/3335082.3335090","DOIUrl":null,"url":null,"abstract":"Clustering analysis, or clustering, is an activity which can be applied to user event log data to determine the types of users which exist within a service, and can be used to gain insights into the client base by their behaviour. However, when applied to longitudinal user event log data, clustering can potentially misclassify regular users as ’one-off’ if their last interaction within their tenure of the service appears at the beginning of the observable data set. The main objective of this study was to investigate whether any impact of user tenure within longitudinal data on k-means clustering accuracy would occur. The current paper subjected a large telephony call log data set from a helpline to a k-means clustering algorithm to determine the types of callers that contact the helpline based on their usage characteristics (number of calls, mean duration of calls and variability of call duration). A threshold of one-month increments were applied to the data (callers appearing before the threshold but not after were removed each time) and then subsequently subjected to k-means clustering. Results showed that cluster structures remained stable after each threshold condition. Significant differences in cluster centers were found in one cluster across tenure conditions.","PeriodicalId":279162,"journal":{"name":"Proceedings of the 31st European Conference on Cognitive Ergonomics","volume":"99 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"User Archetype Discovery By Cluster Analysis of Caller Log Data: Tenure Evolution is Stable as Time Period Reduces\",\"authors\":\"Robin Turkington, M. Mulvenna, R. Bond, S. O’neill, C. Armour\",\"doi\":\"10.1145/3335082.3335090\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Clustering analysis, or clustering, is an activity which can be applied to user event log data to determine the types of users which exist within a service, and can be used to gain insights into the client base by their behaviour. However, when applied to longitudinal user event log data, clustering can potentially misclassify regular users as ’one-off’ if their last interaction within their tenure of the service appears at the beginning of the observable data set. The main objective of this study was to investigate whether any impact of user tenure within longitudinal data on k-means clustering accuracy would occur. The current paper subjected a large telephony call log data set from a helpline to a k-means clustering algorithm to determine the types of callers that contact the helpline based on their usage characteristics (number of calls, mean duration of calls and variability of call duration). A threshold of one-month increments were applied to the data (callers appearing before the threshold but not after were removed each time) and then subsequently subjected to k-means clustering. Results showed that cluster structures remained stable after each threshold condition. Significant differences in cluster centers were found in one cluster across tenure conditions.\",\"PeriodicalId\":279162,\"journal\":{\"name\":\"Proceedings of the 31st European Conference on Cognitive Ergonomics\",\"volume\":\"99 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-09-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 31st European Conference on Cognitive Ergonomics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3335082.3335090\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 31st European Conference on Cognitive Ergonomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3335082.3335090","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
User Archetype Discovery By Cluster Analysis of Caller Log Data: Tenure Evolution is Stable as Time Period Reduces
Clustering analysis, or clustering, is an activity which can be applied to user event log data to determine the types of users which exist within a service, and can be used to gain insights into the client base by their behaviour. However, when applied to longitudinal user event log data, clustering can potentially misclassify regular users as ’one-off’ if their last interaction within their tenure of the service appears at the beginning of the observable data set. The main objective of this study was to investigate whether any impact of user tenure within longitudinal data on k-means clustering accuracy would occur. The current paper subjected a large telephony call log data set from a helpline to a k-means clustering algorithm to determine the types of callers that contact the helpline based on their usage characteristics (number of calls, mean duration of calls and variability of call duration). A threshold of one-month increments were applied to the data (callers appearing before the threshold but not after were removed each time) and then subsequently subjected to k-means clustering. Results showed that cluster structures remained stable after each threshold condition. Significant differences in cluster centers were found in one cluster across tenure conditions.