基于线程MPI的Epiphany多核协处理器并行编程模型

Proceedings of the 3rd International Workshop on Many-core Embedded Systems Pub Date : 2015-06-13 DOI:10.1145/2768177.2768183

J. Ross, D. Richie, S. Park, D. Shires

{"title":"基于线程MPI的Epiphany多核协处理器并行编程模型","authors":"J. Ross, D. Richie, S. Park, D. Shires","doi":"10.1145/2768177.2768183","DOIUrl":null,"url":null,"abstract":"The Adapteva Epiphany many-core architecture comprises a 2D tiled mesh Network-on-Chip (NoC) of low-power RISC cores with minimal uncore functionality. It offers high computational energy efficiency for both integer and floating point calculations as well as parallel scalability. Yet despite the interesting architectural features, a compelling programming model has not been presented to date. This paper demonstrates an efficient parallel programming model for the Epiphany architecture based on the Message Passing Interface (MPI) standard. Using MPI exploits the similarities between the Epiphany architecture and a conventional parallel distributed cluster of serial cores. Our approach enables MPI codes to execute on the RISC array processor with little modification and achieve high performance. We report benchmark results for the threaded MPI implementation of four algorithms (dense matrix-matrix multiplication, N-body particle interaction, a five-point 2D stencil update, and 2D FFT) and highlight the importance of fast inter-core communication for the architecture.","PeriodicalId":374555,"journal":{"name":"Proceedings of the 3rd International Workshop on Many-core Embedded Systems","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"25","resultStr":"{\"title\":\"Parallel Programming Model for the Epiphany Many-Core Coprocessor Using Threaded MPI\",\"authors\":\"J. Ross, D. Richie, S. Park, D. Shires\",\"doi\":\"10.1145/2768177.2768183\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Adapteva Epiphany many-core architecture comprises a 2D tiled mesh Network-on-Chip (NoC) of low-power RISC cores with minimal uncore functionality. It offers high computational energy efficiency for both integer and floating point calculations as well as parallel scalability. Yet despite the interesting architectural features, a compelling programming model has not been presented to date. This paper demonstrates an efficient parallel programming model for the Epiphany architecture based on the Message Passing Interface (MPI) standard. Using MPI exploits the similarities between the Epiphany architecture and a conventional parallel distributed cluster of serial cores. Our approach enables MPI codes to execute on the RISC array processor with little modification and achieve high performance. We report benchmark results for the threaded MPI implementation of four algorithms (dense matrix-matrix multiplication, N-body particle interaction, a five-point 2D stencil update, and 2D FFT) and highlight the importance of fast inter-core communication for the architecture.\",\"PeriodicalId\":374555,\"journal\":{\"name\":\"Proceedings of the 3rd International Workshop on Many-core Embedded Systems\",\"volume\":\"50 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-06-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"25\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 3rd International Workshop on Many-core Embedded Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2768177.2768183\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 3rd International Workshop on Many-core Embedded Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2768177.2768183","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 25

摘要

Adapteva Epiphany多核架构包括一个2D平铺网格片上网络(NoC)的低功耗RISC内核，具有最小的非核心功能。它为整数和浮点计算以及并行可扩展性提供了很高的计算能效。然而，尽管有有趣的体系结构特性，迄今为止还没有一个引人注目的编程模型。本文提出了一种基于消息传递接口(Message Passing Interface, MPI)标准的Epiphany架构的高效并行编程模型。使用MPI利用了Epiphany架构与传统的串行核并行分布式集群之间的相似性。我们的方法使MPI代码能够在RISC阵列处理器上执行，几乎没有修改，并实现高性能。我们报告了四种算法(密集矩阵-矩阵乘法、n体粒子相互作用、五点二维模板更新和二维FFT)的线程MPI实现的基准测试结果，并强调了快速核间通信对该架构的重要性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Parallel Programming Model for the Epiphany Many-Core Coprocessor Using Threaded MPI

The Adapteva Epiphany many-core architecture comprises a 2D tiled mesh Network-on-Chip (NoC) of low-power RISC cores with minimal uncore functionality. It offers high computational energy efficiency for both integer and floating point calculations as well as parallel scalability. Yet despite the interesting architectural features, a compelling programming model has not been presented to date. This paper demonstrates an efficient parallel programming model for the Epiphany architecture based on the Message Passing Interface (MPI) standard. Using MPI exploits the similarities between the Epiphany architecture and a conventional parallel distributed cluster of serial cores. Our approach enables MPI codes to execute on the RISC array processor with little modification and achieve high performance. We report benchmark results for the threaded MPI implementation of four algorithms (dense matrix-matrix multiplication, N-body particle interaction, a five-point 2D stencil update, and 2D FFT) and highlight the importance of fast inter-core communication for the architecture.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 3rd International Workshop on Many-core Embedded Systems

自引率

0.00%

发文量