Speedup analysis of data-parallel applications on Multi-core NoCs

2009 IEEE 8th International Conference on ASIC Pub Date : 2009-12-11 DOI:10.1109/ASICON.2009.5351597

Xiaowen Chen, Zhonghai Lu, A. Jantsch, Shuming Chen

引用次数: 11

Abstract

As more computing cores are integrated onto a single chip, the effect of network communication latency is becoming more and more significant on Multi-core Network-on-Chips (NoCs). For data-parallel applications, we study the model of parallel speedup by including network communication latency in Amdahl's law. The speedup analysis considers the effect of network topology, network size, traffic model and computation/communication ratio. We also study the speedup efficiency. In our Multi-core NoC platform, a real data-parallel application, i.e. matrix multiplication, is used to validate the analysis. Our theoretical analysis and the application results show that the speedup improvement is nonlinear and the speedup efficiency decreases as the system size is scaled up. Such analysis can be used to guide architects and programmers to improve parallel processing efficiency by reducing network latency with optimized network design and increasing computation proportion in the program.

查看原文本刊更多论文

多核noc上数据并行应用的加速分析

随着越来越多的计算核心被集成到单个芯片上，网络通信延迟对多核片上网络(noc)的影响越来越显著。对于数据并行应用，我们研究了在Amdahl定律中包含网络通信延迟的并行加速模型。加速分析考虑了网络拓扑结构、网络规模、流量模型和计算/通信比的影响。我们还研究了加速效率。在我们的多核NoC平台上，一个真实的数据并行应用，即矩阵乘法，被用来验证分析。理论分析和应用结果表明，加速提升是非线性的，加速效率随着系统规模的增大而降低。这种分析可以指导架构师和程序员通过优化网络设计来降低网络延迟，提高程序中的计算比例，从而提高并行处理效率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2009 IEEE 8th International Conference on ASIC

自引率

0.00%

发文量