{"title":"异构体系结构(CPU+GPU)的一些规范对OpenCL三维锥束CT图像重建的重要性","authors":"T. Nouioua, A. H. Belbachir","doi":"10.46300/91011.2021.15.33","DOIUrl":null,"url":null,"abstract":"Medical imaging has found an important way for routine daily practice using cone-beam computed tomography to reconstruct a 3D volume image using the Feldkamp-Davis-Kress (FDK) algorithm. This way can minimize the patient’s time exposure to X-rays. However, its implementation is very costly in computation time, which constitutes a handicap problem in practice. For this reason, the use of acceleration methods on GPU becomes a real solution. For the acceleration of the FDK algorithm, we have used the GPU on heterogeneous platforms. To take full advantage of the GPU, we have chosen useful features of the GPUs and, we have launched the acceleration of the reconstruction according to some technical criteria, namely the work-groups and the work-items. We have found that the number of parallel cores, as well as the memory bandwidth, have no effect on runtimes speedup without being rough in the choice of the number of work-items, which represents a real challenge to master in order to be able to divide them efficiently into work-groups according to the device specifications considered as principal difficulties if we do not study technically the GPU as a hardware device. After an optimized implementation using kernels launched optimally on GPU, we have deduced that the high capacities of the devices must be chosen with a rough optimization of the work-items which are divided into several work-groups according to the hardware limitations.","PeriodicalId":53488,"journal":{"name":"International Journal of Biology and Biomedical Engineering","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Importance of Some Specifications of Heterogeneous Architectures (CPU+GPU) for 3D Cone-Beam-CT Image Reconstruction Using OpenCL\",\"authors\":\"T. Nouioua, A. H. Belbachir\",\"doi\":\"10.46300/91011.2021.15.33\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Medical imaging has found an important way for routine daily practice using cone-beam computed tomography to reconstruct a 3D volume image using the Feldkamp-Davis-Kress (FDK) algorithm. This way can minimize the patient’s time exposure to X-rays. However, its implementation is very costly in computation time, which constitutes a handicap problem in practice. For this reason, the use of acceleration methods on GPU becomes a real solution. For the acceleration of the FDK algorithm, we have used the GPU on heterogeneous platforms. To take full advantage of the GPU, we have chosen useful features of the GPUs and, we have launched the acceleration of the reconstruction according to some technical criteria, namely the work-groups and the work-items. We have found that the number of parallel cores, as well as the memory bandwidth, have no effect on runtimes speedup without being rough in the choice of the number of work-items, which represents a real challenge to master in order to be able to divide them efficiently into work-groups according to the device specifications considered as principal difficulties if we do not study technically the GPU as a hardware device. After an optimized implementation using kernels launched optimally on GPU, we have deduced that the high capacities of the devices must be chosen with a rough optimization of the work-items which are divided into several work-groups according to the hardware limitations.\",\"PeriodicalId\":53488,\"journal\":{\"name\":\"International Journal of Biology and Biomedical Engineering\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Biology and Biomedical Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.46300/91011.2021.15.33\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Biochemistry, Genetics and Molecular Biology\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Biology and Biomedical Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.46300/91011.2021.15.33","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
Importance of Some Specifications of Heterogeneous Architectures (CPU+GPU) for 3D Cone-Beam-CT Image Reconstruction Using OpenCL
Medical imaging has found an important way for routine daily practice using cone-beam computed tomography to reconstruct a 3D volume image using the Feldkamp-Davis-Kress (FDK) algorithm. This way can minimize the patient’s time exposure to X-rays. However, its implementation is very costly in computation time, which constitutes a handicap problem in practice. For this reason, the use of acceleration methods on GPU becomes a real solution. For the acceleration of the FDK algorithm, we have used the GPU on heterogeneous platforms. To take full advantage of the GPU, we have chosen useful features of the GPUs and, we have launched the acceleration of the reconstruction according to some technical criteria, namely the work-groups and the work-items. We have found that the number of parallel cores, as well as the memory bandwidth, have no effect on runtimes speedup without being rough in the choice of the number of work-items, which represents a real challenge to master in order to be able to divide them efficiently into work-groups according to the device specifications considered as principal difficulties if we do not study technically the GPU as a hardware device. After an optimized implementation using kernels launched optimally on GPU, we have deduced that the high capacities of the devices must be chosen with a rough optimization of the work-items which are divided into several work-groups according to the hardware limitations.
期刊介绍:
Topics: Molecular Dynamics, Biochemistry, Biophysics, Quantum Chemistry, Molecular Biology, Cell Biology, Immunology, Neurophysiology, Genetics, Population Dynamics, Dynamics of Diseases, Bioecology, Epidemiology, Social Dynamics, PhotoBiology, PhotoChemistry, Plant Biology, Microbiology, Immunology, Bioinformatics, Signal Transduction, Environmental Systems, Psychological and Cognitive Systems, Pattern Formation, Evolution, Game Theory and Adaptive Dynamics, Bioengineering, Biotechnolgies, Medical Imaging, Medical Signal Processing, Feedback Control in Biology and Chemistry, Fluid Mechanics and Applications in Biomedicine, Space Medicine and Biology, Nuclear Biology and Medicine.