GPU下K-Means算法的可扩展性研究

Sujie Zhong, Sheng Lin, Guangping Xu, Kai Shi
{"title":"GPU下K-Means算法的可扩展性研究","authors":"Sujie Zhong, Sheng Lin, Guangping Xu, Kai Shi","doi":"10.1109/ICSESS.2016.7883172","DOIUrl":null,"url":null,"abstract":"K-Means algorithm is one of the most popular clustering analysis algorithm. Since the algorithm can be easily understood and implemented, and its execution is more efficient than common clustering algorithm, it has been used widely. At the same time, with the increasing size of the data sets processed, CPU-based serial K-Means implementation has been unable to meet the people's needof data processing. Parallel computing is considered well with the large data sets tasks. GPU-based concurrent computation can accelerate common tasks, and especially for accelerating the compute-intensive tasks. CUDA (Compute Unified Device Architecture) is one of the methods that achieving the GPU-based concurrent computation. In the paper, the author hope to achieve a K-Means algorithm implementation can handle larger data sets via CUDA and the algorithm can be used on a common computer with NVIDIA graphics cards.","PeriodicalId":175933,"journal":{"name":"2016 7th IEEE International Conference on Software Engineering and Service Science (ICSESS)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"The expansibility research of K-Means algorithm under the GPU\",\"authors\":\"Sujie Zhong, Sheng Lin, Guangping Xu, Kai Shi\",\"doi\":\"10.1109/ICSESS.2016.7883172\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"K-Means algorithm is one of the most popular clustering analysis algorithm. Since the algorithm can be easily understood and implemented, and its execution is more efficient than common clustering algorithm, it has been used widely. At the same time, with the increasing size of the data sets processed, CPU-based serial K-Means implementation has been unable to meet the people's needof data processing. Parallel computing is considered well with the large data sets tasks. GPU-based concurrent computation can accelerate common tasks, and especially for accelerating the compute-intensive tasks. CUDA (Compute Unified Device Architecture) is one of the methods that achieving the GPU-based concurrent computation. In the paper, the author hope to achieve a K-Means algorithm implementation can handle larger data sets via CUDA and the algorithm can be used on a common computer with NVIDIA graphics cards.\",\"PeriodicalId\":175933,\"journal\":{\"name\":\"2016 7th IEEE International Conference on Software Engineering and Service Science (ICSESS)\",\"volume\":\"77 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 7th IEEE International Conference on Software Engineering and Service Science (ICSESS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSESS.2016.7883172\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 7th IEEE International Conference on Software Engineering and Service Science (ICSESS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSESS.2016.7883172","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

K-Means算法是目前最流行的聚类分析算法之一。由于该算法易于理解和实现,并且其执行效率比普通聚类算法高,因此得到了广泛的应用。同时,随着处理的数据集规模的不断增大,基于cpu的串行K-Means实现已经不能满足人们对数据处理的需求。并行计算被认为是处理大型数据集任务的好方法。基于gpu的并发计算可以加速普通任务,特别是对计算密集型任务的加速。CUDA(计算统一设备架构)是实现基于gpu的并发计算的方法之一。在本文中,作者希望通过CUDA实现可以处理更大数据集的K-Means算法,并且该算法可以在带有NVIDIA显卡的普通计算机上使用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
The expansibility research of K-Means algorithm under the GPU
K-Means algorithm is one of the most popular clustering analysis algorithm. Since the algorithm can be easily understood and implemented, and its execution is more efficient than common clustering algorithm, it has been used widely. At the same time, with the increasing size of the data sets processed, CPU-based serial K-Means implementation has been unable to meet the people's needof data processing. Parallel computing is considered well with the large data sets tasks. GPU-based concurrent computation can accelerate common tasks, and especially for accelerating the compute-intensive tasks. CUDA (Compute Unified Device Architecture) is one of the methods that achieving the GPU-based concurrent computation. In the paper, the author hope to achieve a K-Means algorithm implementation can handle larger data sets via CUDA and the algorithm can be used on a common computer with NVIDIA graphics cards.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信