{"title":"Data Communication Analysis and Algorithm for the Efficiency of Data Distribution in High Performance Computing","authors":"Jinchuan Ke, M. Jiang","doi":"10.1109/ICACTE.2008.157","DOIUrl":null,"url":null,"abstract":"To alleviate the data transfer communication cost among the processor elements, many of the hardware interconnects enable data transfer to be performed at the same time as computation. The assignment of tasks to processors problem is well known to be NP-complete except in a few special cases. To improve the overall performance of high performance computing (HPC), this paper develops a scheme of HPC code generator and presents a data partitioning algorithm for the efficiency of data distribution. The algorithm generates a efficient data partitions, from which the optimized assignments can be selected for reducing the processor element communications, and the complexity is greatly reduced from exponential to polynomial. The algorithm is tested and integrated in HPC tools running on CRAY-T3E, YMP, IBM Regatta, and SGI workstation.","PeriodicalId":364568,"journal":{"name":"2008 International Conference on Advanced Computer Theory and Engineering","volume":"19 S6","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 International Conference on Advanced Computer Theory and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACTE.2008.157","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
To alleviate the data transfer communication cost among the processor elements, many of the hardware interconnects enable data transfer to be performed at the same time as computation. The assignment of tasks to processors problem is well known to be NP-complete except in a few special cases. To improve the overall performance of high performance computing (HPC), this paper develops a scheme of HPC code generator and presents a data partitioning algorithm for the efficiency of data distribution. The algorithm generates a efficient data partitions, from which the optimized assignments can be selected for reducing the processor element communications, and the complexity is greatly reduced from exponential to polynomial. The algorithm is tested and integrated in HPC tools running on CRAY-T3E, YMP, IBM Regatta, and SGI workstation.