使用图形处理器加速XML挖掘

S. Rathi, C. A. Dhote, Vivek Bangera
{"title":"使用图形处理器加速XML挖掘","authors":"S. Rathi, C. A. Dhote, Vivek Bangera","doi":"10.1109/ICCICCT.2014.6992945","DOIUrl":null,"url":null,"abstract":"Mining of association rules is an important research direction of data mining. Extensive use of XML on web makes it an interesting source for data extraction from large data sets. There is a growing demand for modern tools and technologies which can efficiently handle such large data. This paper proposes a collaborative approach to extract association rules from structured XML data with the help of cost effective and energy efficient Graphic Processors. The serial approach comprises of deserialization using XPath followed by parallel sorting. In the parallel model there is parallel deserialization of XML data with the help of graphic processor followed by sorting the converted XML data with the help of in-built multithreaded structure of GPU. An empirical performance study on synthetic data is given, demonstrating a remarkable speed increase on a GPU as compared with fully optimized CPU implementation.","PeriodicalId":6615,"journal":{"name":"2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT)","volume":"51 1","pages":"144-148"},"PeriodicalIF":0.0000,"publicationDate":"2014-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Accelerating XML mining using graphic processors\",\"authors\":\"S. Rathi, C. A. Dhote, Vivek Bangera\",\"doi\":\"10.1109/ICCICCT.2014.6992945\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Mining of association rules is an important research direction of data mining. Extensive use of XML on web makes it an interesting source for data extraction from large data sets. There is a growing demand for modern tools and technologies which can efficiently handle such large data. This paper proposes a collaborative approach to extract association rules from structured XML data with the help of cost effective and energy efficient Graphic Processors. The serial approach comprises of deserialization using XPath followed by parallel sorting. In the parallel model there is parallel deserialization of XML data with the help of graphic processor followed by sorting the converted XML data with the help of in-built multithreaded structure of GPU. An empirical performance study on synthetic data is given, demonstrating a remarkable speed increase on a GPU as compared with fully optimized CPU implementation.\",\"PeriodicalId\":6615,\"journal\":{\"name\":\"2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT)\",\"volume\":\"51 1\",\"pages\":\"144-148\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-07-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCICCT.2014.6992945\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCICCT.2014.6992945","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

关联规则挖掘是数据挖掘的一个重要研究方向。XML在web上的广泛使用使其成为从大型数据集中提取数据的有趣来源。人们对现代工具和技术的需求日益增长,这些工具和技术可以有效地处理如此大的数据。本文提出了一种协作的方法,在高效节能的图形处理器的帮助下,从结构化XML数据中提取关联规则。串行方法包括使用XPath进行反序列化,然后进行并行排序。在并行模型中,首先利用图形处理器对XML数据进行并行反序列化,然后利用GPU内置的多线程结构对转换后的XML数据进行排序。对合成数据的性能进行了实证研究,表明与完全优化的CPU实现相比,GPU上的速度有了显着的提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Accelerating XML mining using graphic processors
Mining of association rules is an important research direction of data mining. Extensive use of XML on web makes it an interesting source for data extraction from large data sets. There is a growing demand for modern tools and technologies which can efficiently handle such large data. This paper proposes a collaborative approach to extract association rules from structured XML data with the help of cost effective and energy efficient Graphic Processors. The serial approach comprises of deserialization using XPath followed by parallel sorting. In the parallel model there is parallel deserialization of XML data with the help of graphic processor followed by sorting the converted XML data with the help of in-built multithreaded structure of GPU. An empirical performance study on synthetic data is given, demonstrating a remarkable speed increase on a GPU as compared with fully optimized CPU implementation.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信