G. Filippone, W. Spataro, D. D'Ambrosio, D. Spataro, D. Marocco, G. Trunfio
{"title":"加速泥石流模拟的CUDA动态活动线程列表策略","authors":"G. Filippone, W. Spataro, D. D'Ambrosio, D. Spataro, D. Marocco, G. Trunfio","doi":"10.1109/PDP.2015.103","DOIUrl":null,"url":null,"abstract":"Cellular Automata represent a formal frame for dynamical systems which evolve on the base of local interactions. We here present first results of the CUDA parallelization of the SCIDDICA S3-hex Complex Cellular Automata model for simulating debris flows. In particular, a first strategy for the parallelization of the model is based on a straightforward one thread - one cell approach, where each cell in the cellular space is computed by a CUDA thread. A second approach concerns the adoption of a list of CA computational active cells which is handled step by step by an efficient stream compaction algorithm, in order to reduce the excessive use of computationally inactive threads. First results performed on different graphic processors have shown that, by adopting the different CUDA strategies, this kind of hardware can be effective for landslide risk mitigation.","PeriodicalId":285111,"journal":{"name":"2015 23rd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"CUDA Dynamic Active Thread List Strategy to Accelerate Debris Flow Simulations\",\"authors\":\"G. Filippone, W. Spataro, D. D'Ambrosio, D. Spataro, D. Marocco, G. Trunfio\",\"doi\":\"10.1109/PDP.2015.103\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Cellular Automata represent a formal frame for dynamical systems which evolve on the base of local interactions. We here present first results of the CUDA parallelization of the SCIDDICA S3-hex Complex Cellular Automata model for simulating debris flows. In particular, a first strategy for the parallelization of the model is based on a straightforward one thread - one cell approach, where each cell in the cellular space is computed by a CUDA thread. A second approach concerns the adoption of a list of CA computational active cells which is handled step by step by an efficient stream compaction algorithm, in order to reduce the excessive use of computationally inactive threads. First results performed on different graphic processors have shown that, by adopting the different CUDA strategies, this kind of hardware can be effective for landslide risk mitigation.\",\"PeriodicalId\":285111,\"journal\":{\"name\":\"2015 23rd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-03-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 23rd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PDP.2015.103\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 23rd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PDP.2015.103","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
CUDA Dynamic Active Thread List Strategy to Accelerate Debris Flow Simulations
Cellular Automata represent a formal frame for dynamical systems which evolve on the base of local interactions. We here present first results of the CUDA parallelization of the SCIDDICA S3-hex Complex Cellular Automata model for simulating debris flows. In particular, a first strategy for the parallelization of the model is based on a straightforward one thread - one cell approach, where each cell in the cellular space is computed by a CUDA thread. A second approach concerns the adoption of a list of CA computational active cells which is handled step by step by an efficient stream compaction algorithm, in order to reduce the excessive use of computationally inactive threads. First results performed on different graphic processors have shown that, by adopting the different CUDA strategies, this kind of hardware can be effective for landslide risk mitigation.