{"title":"Improved K-Means Algorithm and Application in Customer Segmentation","authors":"X. Qin, Shijue Zheng, Ying Huang, Guangsheng Deng","doi":"10.1109/APWCS.2010.63","DOIUrl":null,"url":null,"abstract":"Nowadays, clustering algorithms are widely used in the commercial field, such as customer analysis, and this application has achieved good effect. K-means algorithm is by far the most commonly used method for clustering. Although, the time consumption is fairly high when faced with lager-scale data. In this paper, we improved the K-means algorithm. Our improvement is based on the triangle inequality theorem. We use the improved algorithm to carry out a case study in the customer classification. The experimental results show that the improved method indeed lead to lower time consumption, and therefore more effective for large-scale dataset.","PeriodicalId":354322,"journal":{"name":"2010 Asia-Pacific Conference on Wearable Computing Systems","volume":"394 ","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Asia-Pacific Conference on Wearable Computing Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APWCS.2010.63","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
Nowadays, clustering algorithms are widely used in the commercial field, such as customer analysis, and this application has achieved good effect. K-means algorithm is by far the most commonly used method for clustering. Although, the time consumption is fairly high when faced with lager-scale data. In this paper, we improved the K-means algorithm. Our improvement is based on the triangle inequality theorem. We use the improved algorithm to carry out a case study in the customer classification. The experimental results show that the improved method indeed lead to lower time consumption, and therefore more effective for large-scale dataset.