{"title":"具有厚重叠层的并行细胞内粒子模拟的低成本负载平衡","authors":"Y. Miyake, H. Nakashima","doi":"10.1109/TrustCom.2013.134","DOIUrl":null,"url":null,"abstract":"This paper describes a parallel implementation of our practical particle-in-cell (PIC) simulator with the OhHelp dynamic load-balancing algorithm. Although the code parallelization is based on simple block domain decomposition, OhHelp accomplishes load balancing and thus the scalability in terms of the number of particles by making each computation node help another heavily loaded node. In addition to the OhHelp application, a number of additional layers overlapping with adjacent domains are newly introduced outside the boundaries of each subdomain for the purpose of minimizing overhead costs of OhHelp. The optimization can drastically reduce overhead costs for particle transfer among nodes, whereas it leads to increase in domain size which each node is responsible for. Despite this trade-off feature, the overlapping layer attachment and a further lower-level optimization exert 1.8-fold improvement of the PIC simulator performance. Consequently, the optimized simulator exhibits a good scalability and a stable efficiency in parallel executions using up to 4096 cores, showing small parallel efficiency degradation of 3% from 16- to 4096-core parallel executions.","PeriodicalId":206739,"journal":{"name":"2013 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Low-Cost Load Balancing for Parallel Particle-in-Cell Simulations with Thick Overlapping Layers\",\"authors\":\"Y. Miyake, H. Nakashima\",\"doi\":\"10.1109/TrustCom.2013.134\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper describes a parallel implementation of our practical particle-in-cell (PIC) simulator with the OhHelp dynamic load-balancing algorithm. Although the code parallelization is based on simple block domain decomposition, OhHelp accomplishes load balancing and thus the scalability in terms of the number of particles by making each computation node help another heavily loaded node. In addition to the OhHelp application, a number of additional layers overlapping with adjacent domains are newly introduced outside the boundaries of each subdomain for the purpose of minimizing overhead costs of OhHelp. The optimization can drastically reduce overhead costs for particle transfer among nodes, whereas it leads to increase in domain size which each node is responsible for. Despite this trade-off feature, the overlapping layer attachment and a further lower-level optimization exert 1.8-fold improvement of the PIC simulator performance. Consequently, the optimized simulator exhibits a good scalability and a stable efficiency in parallel executions using up to 4096 cores, showing small parallel efficiency degradation of 3% from 16- to 4096-core parallel executions.\",\"PeriodicalId\":206739,\"journal\":{\"name\":\"2013 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications\",\"volume\":\"44 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-07-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TrustCom.2013.134\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TrustCom.2013.134","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Low-Cost Load Balancing for Parallel Particle-in-Cell Simulations with Thick Overlapping Layers
This paper describes a parallel implementation of our practical particle-in-cell (PIC) simulator with the OhHelp dynamic load-balancing algorithm. Although the code parallelization is based on simple block domain decomposition, OhHelp accomplishes load balancing and thus the scalability in terms of the number of particles by making each computation node help another heavily loaded node. In addition to the OhHelp application, a number of additional layers overlapping with adjacent domains are newly introduced outside the boundaries of each subdomain for the purpose of minimizing overhead costs of OhHelp. The optimization can drastically reduce overhead costs for particle transfer among nodes, whereas it leads to increase in domain size which each node is responsible for. Despite this trade-off feature, the overlapping layer attachment and a further lower-level optimization exert 1.8-fold improvement of the PIC simulator performance. Consequently, the optimized simulator exhibits a good scalability and a stable efficiency in parallel executions using up to 4096 cores, showing small parallel efficiency degradation of 3% from 16- to 4096-core parallel executions.