2017 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)最新文献_第2页

Dataflow Programming for Stream Processing 流处理的数据流编程

2017 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW) Pub Date : 2017-10-01 DOI: 10.1109/SBAC-PADW.2017.26

Marcos Paulo Rocha, F. França, A. S. Nery, Leandro S. Guedes

引用次数: 1

Efficient Pathfinding Co-Processors for FPGAs fpga的高效寻路协处理器

2017 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW) Pub Date : 2017-10-01 DOI: 10.1109/SBAC-PADW.2017.25

A. S. Nery, A. Sena, Leandro S. Guedes

{"title":"Efficient Pathfinding Co-Processors for FPGAs","authors":"A. S. Nery, A. Sena, Leandro S. Guedes","doi":"10.1109/SBAC-PADW.2017.25","DOIUrl":"https://doi.org/10.1109/SBAC-PADW.2017.25","url":null,"abstract":"Pathfinding algorithms are at the heart of several classes of applications, such as network appliances (routing), GPS navigation and autonomous cars, which are related to recent trends in Artificial Intelligence and Internet of Things (IoT). Moreover, advances in semiconductor miniaturization technologies have enabled the design of efficient Systems-on-Chip (SoC) devices, with demanding performance requirements and energy consumption constraints. Such systems might include Field Programmable Gate Arrays (FPGAs) to allow the design of customized co-processors that yield lower power consumption and higher performance. Therefore, this work aims at designing and evaluating four efficient pathfinding co-processors, each one implementing a different well-known pathfinding algorithm: breadth-first, dijkstra, greedy and a-star. Each co-processor is designed using Xilinx High-Level Synthesis (HLS) compiler and is implemented in the programming logic of a Xilinx FPGA embedded with an ARM microprocessor, which is in charge of controlling the set of co-processors. Extensive performance, circuit-area and energy consumption results shows that each co-processor can efficiently execute a pathfinding algorithm, paving the way for novel dedicated accelerators.","PeriodicalId":325990,"journal":{"name":"2017 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)","volume":"Volume 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124431825","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Automatic Scan Parallelization in OpenMP OpenMP中的自动扫描并行化

2017 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW) Pub Date : 2017-10-01 DOI: 10.1109/SBAC-PADW.2017.23

Maicol Zegarra, M. Pereira, X. Martorell, G. Araújo

{"title":"Automatic Scan Parallelization in OpenMP","authors":"Maicol Zegarra, M. Pereira, X. Martorell, G. Araújo","doi":"10.1109/SBAC-PADW.2017.23","DOIUrl":"https://doi.org/10.1109/SBAC-PADW.2017.23","url":null,"abstract":"Prefix Scan (or simply scan) is an operator that computes all the partial sums of a vector. A scan operation results in a vector where each element is the sum of the preceding elements in the original vector up to the corresponding position. Scan is a key operation in many relevant problems like sorting, lexical analysis, string comparison, image filtering among others. Although there are libraries that provide hand-parallelized implementations of scan in CUDA and OpenCL, no automatic parallelization solution exists for this operator in OpenMP. This paper proposes a new clause for OpenMP which enables the automatic synthesis of the parallel scan. By using the proposed clause a programmer can considerably reduce the complexity of designing scan based algorithms, thus allowing he or she to focus the attention on the problem and not on learning new parallel programming models or languages. Scan was designed in AClang, an open-source LLVM/Clang compiler framework that implements the recently released OpenMP 4.X Accelerator Programming Model. Experiments running a set of typical scan based algorithms on NVIDIA, Intel, and ARM GPUs reveal that the performance of the proposed OpenMP clause is equivalent to that achieved when using OpenCL library calls, with the advantage of a simpler programming complexity.","PeriodicalId":325990,"journal":{"name":"2017 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132919374","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

A Case Study of Performance Optimization in a Heterogeneous Environment 异构环境下的性能优化案例研究

2017 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW) Pub Date : 2017-10-01 DOI: 10.1109/SBAC-PADW.2017.11

Leandro Pereira, C. Bentes, Maria Clicia Stelling de Castro, E. Garcia

引用次数: 0

HPSM: A Programming Framework for Multi-CPU and Multi-GPU Systems HPSM:一个多cpu和多gpu系统的编程框架

2017 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW) Pub Date : 2017-10-01 DOI: 10.1109/SBAC-PADW.2017.14

J. F. Lima, D. D. Domenico

引用次数: 3

A Communication Protocol for Fog Computing Based on Network Coding Applied to Wireless Sensors 基于网络编码的雾计算通信协议在无线传感器中的应用

2017 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW) Pub Date : 2017-10-01 DOI: 10.1109/SBAC-PADW.2017.27

B. Marques, I. M. Coelho, A. Sena, M. D. Castro

引用次数: 11

Assessing Sparse Triangular Linear System Solvers on GPUs 在gpu上评估稀疏三角形线性系统求解器

2017 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW) Pub Date : 2017-10-01 DOI: 10.1109/SBAC-PADW.2017.15

Daniel Erguiz, Ernesto Dufrechu, P. Ezzatti

引用次数: 14

Automatic Partitioning of Stencil Computations on Heterogeneous Systems 异构系统中模板计算的自动划分

2017 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW) Pub Date : 2017-10-01 DOI: 10.1109/SBAC-PADW.2017.16

Alyson D. Pereira, Rodrigo C. O. Rocha, Luiz E. Ramos, M. Castro, L. F. Góes

引用次数: 4