Exploiting Graphic Card Processor Technology to Accelerate Data Mining Queries in SAP NetWeaver BIA

2008 IEEE International Conference on Data Mining Workshops Pub Date : 2008-12-15 DOI:10.1109/ICDMW.2008.61

Christoph Weyerhaeuser, Tobias Mindnich, Franz Färber, Wolfgang Lehner

{"title":"Exploiting Graphic Card Processor Technology to Accelerate Data Mining Queries in SAP NetWeaver BIA","authors":"Christoph Weyerhaeuser, Tobias Mindnich, Franz Färber, Wolfgang Lehner","doi":"10.1109/ICDMW.2008.61","DOIUrl":null,"url":null,"abstract":"Within business Intelligence contexts, the importance of data mining algorithms is continuously increasing, particularly from the perspective of applications and users that demand novel algorithms on the one hand and an efficient implementation exploiting novel system architectures on the other hand. Within this paper, we focus on the latter issue and report our experience with the exploitation of graphic card processor technology within the SAP NetWeaver business intelligence accelerator (BIA). The BIA represents a highly distributed analytical engine that supports OLAP and data mining processing primitives. The system organizes data entities in column-wise fashion and its operation is completely main-memory-based. Since case studies have shown that classic data mining queries spend a large portion of their runtime on scanning and filtering the data as a necessary prerequisite to the actual mining step, our main goal was to speed up this expensive scanning and filtering process. In a first step, the paper outlines the basic data mining processing techniques within SAP NetWeaver BIA and illustrates the implementation of scans and filters. In a second step, we give insight into the main features of a hybrid system architecture design exploiting graphic card processor technology. Finally, we sketch the implementation and give details of our vast evaluations.","PeriodicalId":175955,"journal":{"name":"2008 IEEE International Conference on Data Mining Workshops","volume":"15 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE International Conference on Data Mining Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDMW.2008.61","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 13

Abstract

Within business Intelligence contexts, the importance of data mining algorithms is continuously increasing, particularly from the perspective of applications and users that demand novel algorithms on the one hand and an efficient implementation exploiting novel system architectures on the other hand. Within this paper, we focus on the latter issue and report our experience with the exploitation of graphic card processor technology within the SAP NetWeaver business intelligence accelerator (BIA). The BIA represents a highly distributed analytical engine that supports OLAP and data mining processing primitives. The system organizes data entities in column-wise fashion and its operation is completely main-memory-based. Since case studies have shown that classic data mining queries spend a large portion of their runtime on scanning and filtering the data as a necessary prerequisite to the actual mining step, our main goal was to speed up this expensive scanning and filtering process. In a first step, the paper outlines the basic data mining processing techniques within SAP NetWeaver BIA and illustrates the implementation of scans and filters. In a second step, we give insight into the main features of a hybrid system architecture design exploiting graphic card processor technology. Finally, we sketch the implementation and give details of our vast evaluations.

查看原文本刊更多论文

在商业智能上下文中，数据挖掘算法的重要性正在不断增加，特别是从应用程序和用户的角度来看，一方面需要新颖的算法，另一方面需要利用新颖的系统架构的有效实现。在本文中，我们将重点讨论后一个问题，并报告我们在SAP NetWeaver商业智能加速器(BIA)中开发图形卡处理器技术的经验。BIA代表了一个高度分布式的分析引擎，它支持OLAP和数据挖掘处理原语。该系统以列方式组织数据实体，其操作完全基于主存。由于案例研究表明，作为实际挖掘步骤的必要先决条件，经典的数据挖掘查询花费了很大一部分运行时用于扫描和过滤数据，因此我们的主要目标是加快这一昂贵的扫描和过滤过程。首先，本文概述了SAP NetWeaver BIA中的基本数据挖掘处理技术，并举例说明了扫描和过滤器的实现。在第二步中，我们深入了解了利用图形卡处理器技术的混合系统架构设计的主要特征。最后，我们概述了执行情况，并给出了我们大量评估的细节。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2008 IEEE International Conference on Data Mining Workshops

自引率

0.00%

发文量