Cell automaton modelling algorithms: Implementation and testing in GPU systems

2011 15th IEEE International Conference on Intelligent Engineering Systems Pub Date : 2011-06-23 DOI:10.1109/INES.2011.5954741

Tamas Bajzat, E. Hajnal

引用次数: 4

Abstract

The architecture of today's video cards is able to execute up to hundreds of thousands of operation in parallel. This ability creates the possibility to solve computationally intensive tasks with minimal effort. Our research aims to investigate how to use the graphics hardware for general computing ability in biological models. In the development we have used a re-thought, and upgraded successor of the Nvidia G80 architecture, Fermi-GF104 architecture, and the associated CUDA programming environment in C/C++ language environment. After the developer machine and the test environment were complied, a general cellular automaton modelling framework was developed. It is solved partly by parallel algorithm because it calculates on matrix data structure. Several parallel algorithms and data were tested using the system. The speed of program execution was measured and the CGMA (compute to global memory access) ratio was determined. Compared to the performance of the serial execution we experienced an order of magnitude increase.

查看原文本刊更多论文

单元自动机建模算法:在GPU系统中的实现和测试

当今视频卡的架构能够并行执行多达数十万个操作。这种能力创造了以最小的努力解决计算密集型任务的可能性。我们的研究目的是探讨如何在生物模型中使用图形硬件来实现通用计算能力。在开发中，我们使用了重新思考，并升级了Nvidia G80架构的继承者，Fermi-GF104架构，以及相关的CUDA编程环境中的C/ c++语言环境。在编写了开发机和测试环境后，开发了通用元胞自动机建模框架。由于它是在矩阵数据结构上进行计算的，所以部分地采用并行算法求解。利用该系统对几种并行算法和数据进行了测试。测量了程序的执行速度，并确定了CGMA(计算与全局内存访问)的比率。与串行执行的性能相比，我们经历了一个数量级的增长。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2011 15th IEEE International Conference on Intelligent Engineering Systems

自引率

0.00%

发文量