Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming最新文献_第2页

A scalable distance-1 vertex coloring algorithm for power-law graphs 幂律图的可伸缩距离-1顶点着色算法

Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming Pub Date : 2018-02-10 DOI: 10.1145/3178487.3178521

J. Firoz, Marcin Zalewski, A. Lumsdaine

引用次数: 1

Strong trylocks for reader-writer locks 读写锁的强尝试锁

Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming Pub Date : 2018-02-10 DOI: 10.1145/3178487.3178519

Andreia Correia, P. Ramalhete

引用次数: 4

Communication-avoiding parallel minimum cuts and connected components 通信-避免平行最小切割和连接组件

Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming Pub Date : 2018-02-10 DOI: 10.1145/3178487.3178504

Lukas Gianinazzi, Pavel Kalvoda, A. Palma, Maciej Besta, T. Hoefler

{"title":"Communication-avoiding parallel minimum cuts and connected components","authors":"Lukas Gianinazzi, Pavel Kalvoda, A. Palma, Maciej Besta, T. Hoefler","doi":"10.1145/3178487.3178504","DOIUrl":"https://doi.org/10.1145/3178487.3178504","url":null,"abstract":"We present novel scalable parallel algorithms for finding global minimum cuts and connected components, which are important and fundamental problems in graph processing. To take advantage of future massively parallel architectures, our algorithms are communication-avoiding: they reduce the costs of communication across the network and the cache hierarchy. The fundamental technique underlying our work is the randomized sparsification of a graph: removing a fraction of graph edges, deriving a solution for such a sparsified graph, and using the result to obtain a solution for the original input. We design and implement sparsification with O(1) synchronization steps. Our global minimum cut algorithm decreases communication costs and computation compared to the state-of-the-art, while our connected components algorithm incurs few cache misses and synchronization steps. We validate our approach by evaluating MPI implementations of the algorithms on a petascale supercomputer. We also provide an approximate variant of the minimum cut algorithm and show that it approximates the exact solutions well while using a fraction of cores in a fraction of time.","PeriodicalId":193776,"journal":{"name":"Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming","volume":"2 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132214886","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 33

Optimizing N-dimensional, winograd-based convolution for manycore CPUs 为多核cpu优化n维、基于winograd的卷积

Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming Pub Date : 2018-02-10 DOI: 10.1145/3178487.3178496

Zhen Jia, A. Zlateski, F. Durand, Kai Li

引用次数: 45

Register-based implementation of the sparse general matrix-matrix multiplication on GPUs 基于寄存器的稀疏一般矩阵-矩阵乘法在gpu上的实现

Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming Pub Date : 2018-02-10 DOI: 10.1145/3178487.3178529

Junhong Liu, Xinchao He, Weifeng Liu, Guangming Tan

引用次数: 13

Layrub: layer-centric GPU memory reuse and data migration in extreme-scale deep learning systems Layrub:极端规模深度学习系统中以层为中心的GPU内存重用和数据迁移

Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming Pub Date : 2018-02-10 DOI: 10.1145/3178487.3178528

Bo Liu, Wenbin Jiang, Hai Jin, Xuanhua Shi, Yang Ma

引用次数: 2

High-performance genomic analysis framework with in-memory computing 具有内存计算的高性能基因组分析框架

Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming Pub Date : 2018-02-10 DOI: 10.1145/3178487.3178511

Xueqi Li, Guangming Tan, Bingchen Wang, Ninghui Sun

引用次数: 6

Featherlight on-the-fly false-sharing detection 轻便的实时假共享检测

Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming Pub Date : 2018-02-10 DOI: 10.1145/3178487.3178499

Milind Chabbi, Shasha Wen, Xu Liu

引用次数: 18

A persistent lock-free queue for non-volatile memory 用于非易失性内存的持久无锁队列

Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming Pub Date : 2018-02-10 DOI: 10.1145/3178487.3178490

Michal Friedman, M. Herlihy, Virendra J. Marathe, E. Petrank

引用次数: 110

Two concurrent data structures for efficient datalog query processing 两个并发数据结构，用于高效的数据查询处理

Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming Pub Date : 2018-02-10 DOI: 10.1145/3178487.3178525

Herbert Jordan, Bernhard Scholz, Pavle Subotic

引用次数: 2