Parallel stepwise stochastic simulation: harnessing GPUs to explore possible futures states of a chromosome folding model thanks to the possible futures algorithm (PFA)

Proceedings of the 1st ACM SIGSIM Conference on Principles of Advanced Discrete Simulation Pub Date : 2013-05-19 DOI:10.1145/2486092.2486114

Jonathan Passerat-Palmbach, Jonathan Caux, Y. Pennec, R. Reuillon, I. Junier, F. Képès, D. Hill

{"title":"Parallel stepwise stochastic simulation: harnessing GPUs to explore possible futures states of a chromosome folding model thanks to the possible futures algorithm (PFA)","authors":"Jonathan Passerat-Palmbach, Jonathan Caux, Y. Pennec, R. Reuillon, I. Junier, F. Képès, D. Hill","doi":"10.1145/2486092.2486114","DOIUrl":null,"url":null,"abstract":"For the sake of software compatibility, simulations are often parallelized without much code rewriting. Performances can be further improved by optimizing codes so that to use the maximum power offered by parallel architectures. While this approach can provide some speed-up, performance of parallelized codes can be strongly limited a priori because traditional algorithms have been designed for sequential technologies. Thus, additional increase of performance should ultimately rely on some redesign of algorithms. Here, we redesign an algorithm that has traditionally been used to simulate the folding properties of polymers. We address the issue of performance in the context of biological applications, more particularly in the active field of chromosome modelling. Due to the strong confinement of chromosomes in the cells, simulation of their motion is slowed down by the laborious search for the next valid states to progress. Our redesign, that we call the Possible Futures Algorithm (PFA), relies on the parallel computation of possible evolutions of the same state, which effectively increases the probability to obtain a valid state at each step. We apply PFA on a GPU-based architecture, allowing us to optimally reduce the latency induced by the computation overhead of possible futures. We show that compared to the initial sequential model the acceptance rate of new states significantly increases without impacting the execution time. In particular, the stronger the confinement of the chromosome, the more efficient PFA becomes, making our approach appealing for biological applications. While most of our results were obtained using Fermi architecture GPUs from NVIDIA, we highlight improved performance on the cutting-edge Kepler architecture K20 GPUs.","PeriodicalId":115341,"journal":{"name":"Proceedings of the 1st ACM SIGSIM Conference on Principles of Advanced Discrete Simulation","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 1st ACM SIGSIM Conference on Principles of Advanced Discrete Simulation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2486092.2486114","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

For the sake of software compatibility, simulations are often parallelized without much code rewriting. Performances can be further improved by optimizing codes so that to use the maximum power offered by parallel architectures. While this approach can provide some speed-up, performance of parallelized codes can be strongly limited a priori because traditional algorithms have been designed for sequential technologies. Thus, additional increase of performance should ultimately rely on some redesign of algorithms. Here, we redesign an algorithm that has traditionally been used to simulate the folding properties of polymers. We address the issue of performance in the context of biological applications, more particularly in the active field of chromosome modelling. Due to the strong confinement of chromosomes in the cells, simulation of their motion is slowed down by the laborious search for the next valid states to progress. Our redesign, that we call the Possible Futures Algorithm (PFA), relies on the parallel computation of possible evolutions of the same state, which effectively increases the probability to obtain a valid state at each step. We apply PFA on a GPU-based architecture, allowing us to optimally reduce the latency induced by the computation overhead of possible futures. We show that compared to the initial sequential model the acceptance rate of new states significantly increases without impacting the execution time. In particular, the stronger the confinement of the chromosome, the more efficient PFA becomes, making our approach appealing for biological applications. While most of our results were obtained using Fermi architecture GPUs from NVIDIA, we highlight improved performance on the cutting-edge Kepler architecture K20 GPUs.

查看原文本刊更多论文

并行逐步随机模拟:利用可能未来算法(PFA)利用gpu探索染色体折叠模型的可能未来状态

为了软件的兼容性，模拟通常是并行的，而不需要重写太多的代码。通过优化代码，可以进一步提高性能，从而使用并行架构提供的最大功率。虽然这种方法可以提供一些加速，但并行代码的性能可能会受到先天的严重限制，因为传统算法是为顺序技术设计的。因此，性能的额外提高最终应该依赖于对算法的重新设计。在这里，我们重新设计了一种传统上用于模拟聚合物折叠特性的算法。我们在生物学应用的背景下解决性能问题，特别是在染色体建模的活跃领域。由于染色体在细胞中的强烈限制，它们的运动模拟由于费力地寻找下一个有效状态而减慢了速度。我们的重新设计，我们称之为可能未来算法(PFA)，依赖于同一状态的可能演化的并行计算，这有效地增加了在每一步获得有效状态的概率。我们在基于gpu的架构上应用PFA，使我们能够最佳地减少由可能的未来的计算开销引起的延迟。我们表明，与初始顺序模型相比，新状态的接受率显著提高，而不会影响执行时间。特别是，染色体的约束越强，PFA的效率就越高，这使得我们的方法对生物学应用具有吸引力。虽然我们的大部分结果是使用NVIDIA的费米架构gpu获得的，但我们强调了尖端的开普勒架构K20 gpu的性能改进。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 1st ACM SIGSIM Conference on Principles of Advanced Discrete Simulation

自引率

0.00%

发文量