Yujing Feng, Han Li, Xu Tan, Xiaochun Ye, Dongrui Fan, Zhimin Tang
{"title":"Optimizing network efficiency of dataflow architectures through dynamic packet merging","authors":"Yujing Feng, Han Li, Xu Tan, Xiaochun Ye, Dongrui Fan, Zhimin Tang","doi":"10.1109/IGCC.2018.8752155","DOIUrl":null,"url":null,"abstract":"Dataflow processor has shown its unique advantages in executing high performance computing applications with its communication-exposed microarchitecture. In dataflow processors, large amounts of data are directly transferred between instructions through a network-on-chip. The efficiency of data transfer is an imperative performance metric that needs to be optimized in most dataflow processors. Based on the specific features of the dataflow network, we propose a mechanism for dynamically merging the packets in the routers. By testing workloads with varying characteristics, the experiment results demonstrate that the average latency of data transfer is reduced by 11.8%, the performance of dataflow accelerator is improved by 14.0%.","PeriodicalId":388554,"journal":{"name":"2018 Ninth International Green and Sustainable Computing Conference (IGSC)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Ninth International Green and Sustainable Computing Conference (IGSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IGCC.2018.8752155","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Dataflow processor has shown its unique advantages in executing high performance computing applications with its communication-exposed microarchitecture. In dataflow processors, large amounts of data are directly transferred between instructions through a network-on-chip. The efficiency of data transfer is an imperative performance metric that needs to be optimized in most dataflow processors. Based on the specific features of the dataflow network, we propose a mechanism for dynamically merging the packets in the routers. By testing workloads with varying characteristics, the experiment results demonstrate that the average latency of data transfer is reduced by 11.8%, the performance of dataflow accelerator is improved by 14.0%.