{"title":"The expansibility research of K-Means algorithm under the GPU","authors":"Sujie Zhong, Sheng Lin, Guangping Xu, Kai Shi","doi":"10.1109/ICSESS.2016.7883172","DOIUrl":null,"url":null,"abstract":"K-Means algorithm is one of the most popular clustering analysis algorithm. Since the algorithm can be easily understood and implemented, and its execution is more efficient than common clustering algorithm, it has been used widely. At the same time, with the increasing size of the data sets processed, CPU-based serial K-Means implementation has been unable to meet the people's needof data processing. Parallel computing is considered well with the large data sets tasks. GPU-based concurrent computation can accelerate common tasks, and especially for accelerating the compute-intensive tasks. CUDA (Compute Unified Device Architecture) is one of the methods that achieving the GPU-based concurrent computation. In the paper, the author hope to achieve a K-Means algorithm implementation can handle larger data sets via CUDA and the algorithm can be used on a common computer with NVIDIA graphics cards.","PeriodicalId":175933,"journal":{"name":"2016 7th IEEE International Conference on Software Engineering and Service Science (ICSESS)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 7th IEEE International Conference on Software Engineering and Service Science (ICSESS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSESS.2016.7883172","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
K-Means algorithm is one of the most popular clustering analysis algorithm. Since the algorithm can be easily understood and implemented, and its execution is more efficient than common clustering algorithm, it has been used widely. At the same time, with the increasing size of the data sets processed, CPU-based serial K-Means implementation has been unable to meet the people's needof data processing. Parallel computing is considered well with the large data sets tasks. GPU-based concurrent computation can accelerate common tasks, and especially for accelerating the compute-intensive tasks. CUDA (Compute Unified Device Architecture) is one of the methods that achieving the GPU-based concurrent computation. In the paper, the author hope to achieve a K-Means algorithm implementation can handle larger data sets via CUDA and the algorithm can be used on a common computer with NVIDIA graphics cards.