Sara Daoudi, C. Zouaoui, M. C. El-Mezouar, N. Taleb
{"title":"K-Means算法的CPU/GPU并行实现的比较研究","authors":"Sara Daoudi, C. Zouaoui, M. C. El-Mezouar, N. Taleb","doi":"10.1109/ICAEE47123.2019.9014783","DOIUrl":null,"url":null,"abstract":"The K-Means algorithm is one of the most sophisticated and known algorithms for data-clustering. In this study, we will show the K-Means algorithm as it relates to OpenCL, which is a widespread parallel ecosystem that is reliable for processing and mining datasets that are large in scale. Additionally, we propose a comparative study of the three most efficient K-means algorithm implementations: The Lloyd-Forgy’s sequential Method Implementation, a parallel implementation targeting the CPU using OpenMP and finally one of the most complex implementations that uses an OpenCL language. Typically, the measure of performance is done using different data sizes. For large datasets under OpenCL, when comparing the GPU-based parallel algorithm to the CPU-based serial algorithm, the results have shown a good acceleration effect. On the other hand, for small data sets, the OpenMP implementation has turned out to be the best choice.","PeriodicalId":197612,"journal":{"name":"2019 International Conference on Advanced Electrical Engineering (ICAEE)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"A Comparative study of parallel CPU/GPU implementations of the K-Means Algorithm\",\"authors\":\"Sara Daoudi, C. Zouaoui, M. C. El-Mezouar, N. Taleb\",\"doi\":\"10.1109/ICAEE47123.2019.9014783\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The K-Means algorithm is one of the most sophisticated and known algorithms for data-clustering. In this study, we will show the K-Means algorithm as it relates to OpenCL, which is a widespread parallel ecosystem that is reliable for processing and mining datasets that are large in scale. Additionally, we propose a comparative study of the three most efficient K-means algorithm implementations: The Lloyd-Forgy’s sequential Method Implementation, a parallel implementation targeting the CPU using OpenMP and finally one of the most complex implementations that uses an OpenCL language. Typically, the measure of performance is done using different data sizes. For large datasets under OpenCL, when comparing the GPU-based parallel algorithm to the CPU-based serial algorithm, the results have shown a good acceleration effect. On the other hand, for small data sets, the OpenMP implementation has turned out to be the best choice.\",\"PeriodicalId\":197612,\"journal\":{\"name\":\"2019 International Conference on Advanced Electrical Engineering (ICAEE)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Conference on Advanced Electrical Engineering (ICAEE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAEE47123.2019.9014783\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Advanced Electrical Engineering (ICAEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAEE47123.2019.9014783","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Comparative study of parallel CPU/GPU implementations of the K-Means Algorithm
The K-Means algorithm is one of the most sophisticated and known algorithms for data-clustering. In this study, we will show the K-Means algorithm as it relates to OpenCL, which is a widespread parallel ecosystem that is reliable for processing and mining datasets that are large in scale. Additionally, we propose a comparative study of the three most efficient K-means algorithm implementations: The Lloyd-Forgy’s sequential Method Implementation, a parallel implementation targeting the CPU using OpenMP and finally one of the most complex implementations that uses an OpenCL language. Typically, the measure of performance is done using different data sizes. For large datasets under OpenCL, when comparing the GPU-based parallel algorithm to the CPU-based serial algorithm, the results have shown a good acceleration effect. On the other hand, for small data sets, the OpenMP implementation has turned out to be the best choice.