Cell automaton modelling algorithms: Implementation and testing in GPU systems

Tamas Bajzat, E. Hajnal
{"title":"Cell automaton modelling algorithms: Implementation and testing in GPU systems","authors":"Tamas Bajzat, E. Hajnal","doi":"10.1109/INES.2011.5954741","DOIUrl":null,"url":null,"abstract":"The architecture of today's video cards is able to execute up to hundreds of thousands of operation in parallel. This ability creates the possibility to solve computationally intensive tasks with minimal effort. Our research aims to investigate how to use the graphics hardware for general computing ability in biological models. In the development we have used a re-thought, and upgraded successor of the Nvidia G80 architecture, Fermi-GF104 architecture, and the associated CUDA programming environment in C/C++ language environment. After the developer machine and the test environment were complied, a general cellular automaton modelling framework was developed. It is solved partly by parallel algorithm because it calculates on matrix data structure. Several parallel algorithms and data were tested using the system. The speed of program execution was measured and the CGMA (compute to global memory access) ratio was determined. Compared to the performance of the serial execution we experienced an order of magnitude increase.","PeriodicalId":414812,"journal":{"name":"2011 15th IEEE International Conference on Intelligent Engineering Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 15th IEEE International Conference on Intelligent Engineering Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INES.2011.5954741","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

The architecture of today's video cards is able to execute up to hundreds of thousands of operation in parallel. This ability creates the possibility to solve computationally intensive tasks with minimal effort. Our research aims to investigate how to use the graphics hardware for general computing ability in biological models. In the development we have used a re-thought, and upgraded successor of the Nvidia G80 architecture, Fermi-GF104 architecture, and the associated CUDA programming environment in C/C++ language environment. After the developer machine and the test environment were complied, a general cellular automaton modelling framework was developed. It is solved partly by parallel algorithm because it calculates on matrix data structure. Several parallel algorithms and data were tested using the system. The speed of program execution was measured and the CGMA (compute to global memory access) ratio was determined. Compared to the performance of the serial execution we experienced an order of magnitude increase.
单元自动机建模算法:在GPU系统中的实现和测试
当今视频卡的架构能够并行执行多达数十万个操作。这种能力创造了以最小的努力解决计算密集型任务的可能性。我们的研究目的是探讨如何在生物模型中使用图形硬件来实现通用计算能力。在开发中,我们使用了重新思考,并升级了Nvidia G80架构的继承者,Fermi-GF104架构,以及相关的CUDA编程环境中的C/ c++语言环境。在编写了开发机和测试环境后,开发了通用元胞自动机建模框架。由于它是在矩阵数据结构上进行计算的,所以部分地采用并行算法求解。利用该系统对几种并行算法和数据进行了测试。测量了程序的执行速度,并确定了CGMA(计算与全局内存访问)的比率。与串行执行的性能相比,我们经历了一个数量级的增长。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信