{"title":"XJoin","authors":"Eugenio Marinelli, Raja Appuswamy","doi":"10.1145/3465998.3466012","DOIUrl":"https://doi.org/10.1145/3465998.3466012","url":null,"abstract":"Modern server hardware is increasingly heterogeneous with a diverse mix of XPU architectures deployed across CPU, GPU, and FPGAs. However, till date, database developers have had to rely on either proprietary, architecture-specific solutions (like CUDA), or low-level, cross-architecture solutions that complicate development (like OpenCL). The lack of portable parallelism caused by the absence of a common high-level programming framework is one of the main reasons preventing a wider adoption of XPUs by database systems. In this paper, we take the first steps towards solving this problem using oneAPI-a cross-industry effort for developing an open, standards-based unified programming model that extends standard C++ to provide portable parallelism across diverse processor architectures. In particular, we port a recently-proposed, highly-optimized, GPU-based hash join algorithm from CUDA to Data Parallel C++ (DPC++). We then execute the hash join on multicore CPUs, integrated GPUs (Intel GEN9), and discrete GPUs (Intel DG1 and NVIDIA GeForce) without changing a single line of kernel code to demonstrate that DPC++ enables portable parallelism. We compare the performance of DPC++ kernels with hand-optimized CUDA kernels and model-based theoretical performance bounds to demonstrate the performance-portability trade off in using DPC++.","PeriodicalId":382224,"journal":{"name":"Proceedings of the 17th International Workshop on Data Management on New Hardware (DaMoN 2021)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125927605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"GalOP","authors":"Nils Boeschen, Carsten Binnig","doi":"10.1145/3465998.3466007","DOIUrl":"https://doi.org/10.1145/3465998.3466007","url":null,"abstract":"In this paper, we present GalOP --- a GPU-accelerated main memory DBMS for OLTP. At the core GalOP is based on a novel deterministic concurrency scheme for GPUs which orders conflicting transactions before the execution on the GPU. In our initial evaluation, we show that GalOP can provide robust performance for high and low conflict scenarios and outperforms recent CPU-based schemes by up to 10x.","PeriodicalId":382224,"journal":{"name":"Proceedings of the 17th International Workshop on Data Management on New Hardware (DaMoN 2021)","volume":"207 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114756560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}