J. Fuentes, Weiyu Chen, Guei-Yuan Lueh, I. Scherson
{"title":"A Lock-Free Skiplist for Integrated Graphics Processing Units","authors":"J. Fuentes, Weiyu Chen, Guei-Yuan Lueh, I. Scherson","doi":"10.1109/IPDPSW.2019.00015","DOIUrl":null,"url":null,"abstract":"With the advent of computing systems with on-die integrated graphics processing unit (iGPU), new general-purpose GPU programming challenges have emerged from these heterogeneous processors. We propose a lock-free skiplist for Intel's integrated graphics processor that is optimized to achieve the best performance using the C for Media framework. To the best of our knowledge, this is the first implementation of a lock-free data structure for iGPU. Experimental results show that our proposal is more compute-efficient than an existing discrete GPU implementation and outperforms state-of-the-art lock-free and lock-based skiplists for multi-core CPU, achieving up to 3.5x speedup. Additionally, energy savings of up to 300% are obtained when running different skiplist workloads on iGPU instead of CPU cores, hence further improving energy efficiency.","PeriodicalId":292054,"journal":{"name":"2019 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPDPSW.2019.00015","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
With the advent of computing systems with on-die integrated graphics processing unit (iGPU), new general-purpose GPU programming challenges have emerged from these heterogeneous processors. We propose a lock-free skiplist for Intel's integrated graphics processor that is optimized to achieve the best performance using the C for Media framework. To the best of our knowledge, this is the first implementation of a lock-free data structure for iGPU. Experimental results show that our proposal is more compute-efficient than an existing discrete GPU implementation and outperforms state-of-the-art lock-free and lock-based skiplists for multi-core CPU, achieving up to 3.5x speedup. Additionally, energy savings of up to 300% are obtained when running different skiplist workloads on iGPU instead of CPU cores, hence further improving energy efficiency.
随着带有片上集成图形处理单元(iGPU)的计算系统的出现,这些异构处理器出现了新的通用GPU编程挑战。我们为英特尔的集成图形处理器提出了一个无锁跳线列表,该跳线列表使用C for Media框架进行了优化,以达到最佳性能。据我们所知,这是第一个实现iGPU的无锁数据结构。实验结果表明,我们的提议比现有的离散GPU实现更具计算效率,并且优于多核CPU的最先进的无锁和基于锁的跳跃器,实现高达3.5倍的加速。此外,在iGPU而不是CPU内核上运行不同的skip - list工作负载,可以节省高达300%的能源,从而进一步提高能源效率。