{"title":"Implementation of the CA-CFAR algorithm for pulsed-Doppler radar on a GPU architecture","authors":"C. J. Venter, H. Grobler, K. Almalki","doi":"10.1109/AEECT.2011.6132514","DOIUrl":null,"url":null,"abstract":"The Cell-Averaging Constant False-Alarm Rate (CA-CFAR) algorithm was implemented and optimized in software on the NVIDIA Tesla C1060 GPU architecture for application in pulsed-Doppler radar signal processors. A systematic approach was followed to gradually explore opportunities for parallel execution and optimization by implementing the algorithm first in MATLAB (CPU), followed by native C (CPU) and finally NVIDIA CUDA (GPU) environments. Three techniques for implementing the CA-CFAR in software were identified and implemented, namely a na¨ıve technique, sliding window technique and a new variant which employs the Summed-Area Table (SAT) algorithm. The na¨ıve technique performed best on the GPU architecture. The SAT technique shows potential, especially for cases where very large CFAR windows are required. However, the results do not justify using the GPU architecture instead of the CPU architecture for this application when data transfer to and from the GPU is taken into consideration.","PeriodicalId":408446,"journal":{"name":"2011 IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies (AEECT)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies (AEECT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AEECT.2011.6132514","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11
Abstract
The Cell-Averaging Constant False-Alarm Rate (CA-CFAR) algorithm was implemented and optimized in software on the NVIDIA Tesla C1060 GPU architecture for application in pulsed-Doppler radar signal processors. A systematic approach was followed to gradually explore opportunities for parallel execution and optimization by implementing the algorithm first in MATLAB (CPU), followed by native C (CPU) and finally NVIDIA CUDA (GPU) environments. Three techniques for implementing the CA-CFAR in software were identified and implemented, namely a na¨ıve technique, sliding window technique and a new variant which employs the Summed-Area Table (SAT) algorithm. The na¨ıve technique performed best on the GPU architecture. The SAT technique shows potential, especially for cases where very large CFAR windows are required. However, the results do not justify using the GPU architecture instead of the CPU architecture for this application when data transfer to and from the GPU is taken into consideration.
为了在脉冲多普勒雷达信号处理器中应用,在NVIDIA Tesla C1060 GPU架构上对CA-CFAR算法进行了软件实现和优化。通过首先在MATLAB (CPU)中实现算法,然后在本机C (CPU)中实现算法,最后在NVIDIA CUDA (GPU)环境中实现算法,采用系统的方法逐步探索并行执行和优化的机会。确定并实施了在软件中实现CA-CFAR的三种技术,即na¨ıve技术、滑动窗口技术和采用sum - area Table (SAT)算法的新变体。na¨ıve技术在GPU架构上表现最好。SAT技术显示出潜力,特别是在需要非常大的CFAR窗口的情况下。然而,当考虑到从GPU传输数据时,结果并不能证明使用GPU架构而不是CPU架构用于此应用程序。