Address interleaving for low-cost NoCs

2016 11th International Symposium on Reconfigurable Communication-centric Systems-on-Chip (ReCoSoC) Pub Date : 2016-06-27 DOI:10.1109/ReCoSoC.2016.7533892

M. Grammatikakis, Kyprianos Papademetriou, P. Petrakis, M. Coppola, Michael Soulie

{"title":"Address interleaving for low-cost NoCs","authors":"M. Grammatikakis, Kyprianos Papademetriou, P. Petrakis, M. Coppola, Michael Soulie","doi":"10.1109/ReCoSoC.2016.7533892","DOIUrl":null,"url":null,"abstract":"New generations of NoC-based platforms incorporate address interleaving, which enables balancing transactions between the memory nodes. The memory space is distributed in different nodes of the NoC, and accessed alternately by each on-chip initiator. A memory node is accessed depending on the transaction request address through a memory map. Interleaving can allow for efficient use of NoC bandwidth and congestion reduction, and we study whether its gains scale over system size. In this work we concentrate on an instance of a customizable point-to-point interconnect from STMicroelectronics called STNoC. We first evaluate a setup with 4 CPU initiators and 4 memories, and show that interleaving relieves the NoC from congestion and permits higher packet injection rates. We also show that this depends on the number of packets sent per transaction by an initiator prior to changing destination memory node; this is called interleaving step. We then enriched the setup with several DMA engines, which is in accordance with industry roadmap. We experimented with MPSoCs having up to 32-nodes and for various link-widths of the STNoC. When link-width was 32 Bytes, the aggregate throughput gain from address interleaving was 20.8%, but when we set it 8 Bytes the throughput gain reached 69.64%. This implies silicon savings in SoCs, as it is not always necessary to configure NoCs with wide link-widths.","PeriodicalId":248789,"journal":{"name":"2016 11th International Symposium on Reconfigurable Communication-centric Systems-on-Chip (ReCoSoC)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 11th International Symposium on Reconfigurable Communication-centric Systems-on-Chip (ReCoSoC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ReCoSoC.2016.7533892","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

New generations of NoC-based platforms incorporate address interleaving, which enables balancing transactions between the memory nodes. The memory space is distributed in different nodes of the NoC, and accessed alternately by each on-chip initiator. A memory node is accessed depending on the transaction request address through a memory map. Interleaving can allow for efficient use of NoC bandwidth and congestion reduction, and we study whether its gains scale over system size. In this work we concentrate on an instance of a customizable point-to-point interconnect from STMicroelectronics called STNoC. We first evaluate a setup with 4 CPU initiators and 4 memories, and show that interleaving relieves the NoC from congestion and permits higher packet injection rates. We also show that this depends on the number of packets sent per transaction by an initiator prior to changing destination memory node; this is called interleaving step. We then enriched the setup with several DMA engines, which is in accordance with industry roadmap. We experimented with MPSoCs having up to 32-nodes and for various link-widths of the STNoC. When link-width was 32 Bytes, the aggregate throughput gain from address interleaving was 20.8%, but when we set it 8 Bytes the throughput gain reached 69.64%. This implies silicon savings in SoCs, as it is not always necessary to configure NoCs with wide link-widths.

查看原文本刊更多论文

低成本noc的地址交错

新一代的基于noc的平台整合了地址交错，这使得在内存节点之间平衡事务成为可能。内存空间分布在NoC的不同节点上，由每个片上启动器交替访问。通过内存映射根据事务请求地址访问内存节点。交错可以有效地利用NoC带宽和减少拥塞，我们研究了它的收益是否随系统大小而变化。在这项工作中，我们专注于意法半导体的一个可定制的点对点互连实例，称为STNoC。我们首先评估了一个有4个CPU启动器和4个内存的设置，并表明交错可以缓解NoC的拥塞，并允许更高的数据包注入速率。我们还表明，这取决于在更改目标内存节点之前发起者发送的每个事务的数据包数量;这被称为交错步骤。然后我们用几个DMA引擎丰富了设置，这与行业路线图是一致的。我们对多达32个节点的mpsoc和STNoC的各种链路宽度进行了实验。当链路宽度为32字节时，地址交错的总吞吐量增益为20.8%，但当我们将其设置为8字节时，吞吐量增益达到69.64%。这意味着soc中的硅节省，因为并不总是需要配置具有宽链路宽度的noc。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 11th International Symposium on Reconfigurable Communication-centric Systems-on-Chip (ReCoSoC)

自引率

0.00%

发文量