图形处理单元上的并行模式挖掘

Proceedings of the 14th International Carpathian Control Conference (ICCC) Pub Date : 2013-05-26 DOI:10.1109/CARPATHIANCC.2013.6560525

K. Hryniów

{"title":"图形处理单元上的并行模式挖掘","authors":"K. Hryniów","doi":"10.1109/CARPATHIANCC.2013.6560525","DOIUrl":null,"url":null,"abstract":"Frequent pattern mining is a field with many practical applications, where large computational power and speed are needed. Many state-of-the-art frequent pattern mining applications are an inefficient solutions for both shared memory and multiprocessor systems due to problems with parallelism and memory. One of possible solutions to the problem is the use of Graphics Processing Unit (GPU) in the system along with modification of classical pattern mining algorithms in such a way, that the sequential part of algorithm is run on host and the parallel part on GPU. Such solution allows for considerable speed-up (of up to two orders of magnitude), but for more complicated problems and FPM algorithms it can be hard to achieve. So far there were presented 3 modifications of the most basic Apriori algorithm for solving GPGPU (general-purpose computation on graphics hardware) problems. Each of proposed parallel implementations (PBI, TBI, GPA) is suited only for frequent itemset mining, furthermore even the best of them (TBI) obtains results only 2-5 times faster then CPU-based versions of FP-growth algorithm. On ICCC'12 there was presented parallel version of more complex GSP algorithm adjusted for GPGPU problems, capable of obtaining results 50-100 times faster then CPU-based version, with the same accuracy. Not only is the GPGPU implementation the fastest, but also it is suited for solving more complex frequent pattern mining problems. This paper presents extension of proposed modifications to more complex algorithms from Apriori family. The modifications are evaluated both theoretically and with use of experimental setup consisting of nVidia Tesla card and CUDA parallel computing platform. Solution proposed in the paper uses more complex, faster frequent pattern mining algorithms then GSP, making it well suited for solving real-time GPGPU problems for very large data sets.","PeriodicalId":373601,"journal":{"name":"Proceedings of the 14th International Carpathian Control Conference (ICCC)","volume":"1461 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Parallel pattern mining on Graphics Processing Units\",\"authors\":\"K. Hryniów\",\"doi\":\"10.1109/CARPATHIANCC.2013.6560525\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Frequent pattern mining is a field with many practical applications, where large computational power and speed are needed. Many state-of-the-art frequent pattern mining applications are an inefficient solutions for both shared memory and multiprocessor systems due to problems with parallelism and memory. One of possible solutions to the problem is the use of Graphics Processing Unit (GPU) in the system along with modification of classical pattern mining algorithms in such a way, that the sequential part of algorithm is run on host and the parallel part on GPU. Such solution allows for considerable speed-up (of up to two orders of magnitude), but for more complicated problems and FPM algorithms it can be hard to achieve. So far there were presented 3 modifications of the most basic Apriori algorithm for solving GPGPU (general-purpose computation on graphics hardware) problems. Each of proposed parallel implementations (PBI, TBI, GPA) is suited only for frequent itemset mining, furthermore even the best of them (TBI) obtains results only 2-5 times faster then CPU-based versions of FP-growth algorithm. On ICCC'12 there was presented parallel version of more complex GSP algorithm adjusted for GPGPU problems, capable of obtaining results 50-100 times faster then CPU-based version, with the same accuracy. Not only is the GPGPU implementation the fastest, but also it is suited for solving more complex frequent pattern mining problems. This paper presents extension of proposed modifications to more complex algorithms from Apriori family. The modifications are evaluated both theoretically and with use of experimental setup consisting of nVidia Tesla card and CUDA parallel computing platform. Solution proposed in the paper uses more complex, faster frequent pattern mining algorithms then GSP, making it well suited for solving real-time GPGPU problems for very large data sets.\",\"PeriodicalId\":373601,\"journal\":{\"name\":\"Proceedings of the 14th International Carpathian Control Conference (ICCC)\",\"volume\":\"1461 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-05-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 14th International Carpathian Control Conference (ICCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CARPATHIANCC.2013.6560525\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 14th International Carpathian Control Conference (ICCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CARPATHIANCC.2013.6560525","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

摘要

频繁模式挖掘是一个具有许多实际应用的领域，需要大量的计算能力和速度。由于并行性和内存问题，许多最先进的频繁模式挖掘应用程序对于共享内存和多处理器系统来说都是低效的解决方案。一种可能的解决方案是在系统中使用图形处理单元(GPU)，并对经典的模式挖掘算法进行修改，使算法的顺序部分在主机上运行，并行部分在GPU上运行。这样的解决方案允许相当大的加速(高达两个数量级)，但对于更复杂的问题和FPM算法，它可能很难实现。到目前为止，针对解决GPGPU(图形硬件上的通用计算)问题，提出了对最基本的Apriori算法的3种修改。每种提出的并行实现(PBI, TBI, GPA)都只适合频繁的项集挖掘，而且即使是最好的并行实现(TBI)也只比基于cpu版本的FP-growth算法快2-5倍。在ICCC'12上，提出了针对GPGPU问题调整的更复杂的GSP算法的并行版本，其获得结果的速度比基于cpu的版本快50-100倍，且精度相同。GPGPU不仅实现速度最快，而且适合解决更复杂的频繁模式挖掘问题。本文将提出的修正扩展到Apriori族中更复杂的算法。利用nVidia Tesla卡和CUDA并行计算平台组成的实验装置对这些改进进行了理论和实验评估。本文提出的解决方案使用了比GSP更复杂、更快的频繁模式挖掘算法，使其非常适合于解决超大数据集的实时GPGPU问题。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Parallel pattern mining on Graphics Processing Units

Frequent pattern mining is a field with many practical applications, where large computational power and speed are needed. Many state-of-the-art frequent pattern mining applications are an inefficient solutions for both shared memory and multiprocessor systems due to problems with parallelism and memory. One of possible solutions to the problem is the use of Graphics Processing Unit (GPU) in the system along with modification of classical pattern mining algorithms in such a way, that the sequential part of algorithm is run on host and the parallel part on GPU. Such solution allows for considerable speed-up (of up to two orders of magnitude), but for more complicated problems and FPM algorithms it can be hard to achieve. So far there were presented 3 modifications of the most basic Apriori algorithm for solving GPGPU (general-purpose computation on graphics hardware) problems. Each of proposed parallel implementations (PBI, TBI, GPA) is suited only for frequent itemset mining, furthermore even the best of them (TBI) obtains results only 2-5 times faster then CPU-based versions of FP-growth algorithm. On ICCC'12 there was presented parallel version of more complex GSP algorithm adjusted for GPGPU problems, capable of obtaining results 50-100 times faster then CPU-based version, with the same accuracy. Not only is the GPGPU implementation the fastest, but also it is suited for solving more complex frequent pattern mining problems. This paper presents extension of proposed modifications to more complex algorithms from Apriori family. The modifications are evaluated both theoretically and with use of experimental setup consisting of nVidia Tesla card and CUDA parallel computing platform. Solution proposed in the paper uses more complex, faster frequent pattern mining algorithms then GSP, making it well suited for solving real-time GPGPU problems for very large data sets.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 14th International Carpathian Control Conference (ICCC)

自引率

0.00%

发文量