{"title":"Cell automaton modelling algorithms: Implementation and testing in GPU systems","authors":"Tamas Bajzat, E. Hajnal","doi":"10.1109/INES.2011.5954741","DOIUrl":null,"url":null,"abstract":"The architecture of today's video cards is able to execute up to hundreds of thousands of operation in parallel. This ability creates the possibility to solve computationally intensive tasks with minimal effort. Our research aims to investigate how to use the graphics hardware for general computing ability in biological models. In the development we have used a re-thought, and upgraded successor of the Nvidia G80 architecture, Fermi-GF104 architecture, and the associated CUDA programming environment in C/C++ language environment. After the developer machine and the test environment were complied, a general cellular automaton modelling framework was developed. It is solved partly by parallel algorithm because it calculates on matrix data structure. Several parallel algorithms and data were tested using the system. The speed of program execution was measured and the CGMA (compute to global memory access) ratio was determined. Compared to the performance of the serial execution we experienced an order of magnitude increase.","PeriodicalId":414812,"journal":{"name":"2011 15th IEEE International Conference on Intelligent Engineering Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 15th IEEE International Conference on Intelligent Engineering Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INES.2011.5954741","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
The architecture of today's video cards is able to execute up to hundreds of thousands of operation in parallel. This ability creates the possibility to solve computationally intensive tasks with minimal effort. Our research aims to investigate how to use the graphics hardware for general computing ability in biological models. In the development we have used a re-thought, and upgraded successor of the Nvidia G80 architecture, Fermi-GF104 architecture, and the associated CUDA programming environment in C/C++ language environment. After the developer machine and the test environment were complied, a general cellular automaton modelling framework was developed. It is solved partly by parallel algorithm because it calculates on matrix data structure. Several parallel algorithms and data were tested using the system. The speed of program execution was measured and the CGMA (compute to global memory access) ratio was determined. Compared to the performance of the serial execution we experienced an order of magnitude increase.