{"title":"OpenCL GPU物理光学100倍加速","authors":"A. Densmore, Y. Rahmat-Samii","doi":"10.1109/APS.2014.6905136","DOIUrl":null,"url":null,"abstract":"The application of the graphics processor to computing antenna radiation patterns provides a substantial acceleration in speed. The same brute force physical optics (PO) code; e.g., a symmetric single reflector of two hundred forty wavelengths diameter, that runs single-threaded C/C++ on an Intel Core 2 Q8200 CPU and compiled to be optimized for speed, runs one hundred times faster when ported to OpenCL and run on an AMD HD5850 GPU hosted by the CPU system. Porting the code to OpenCL involves opening the processing loops to be run in parallel and some additional code to host the GPU in the CPU system. The ability to accelerate antenna pattern computations by a factor of one hundred provides real substantial benefits when many patterns need to be computed repeatedly, such as when optimizing a reflector system.","PeriodicalId":6663,"journal":{"name":"2014 IEEE Antennas and Propagation Society International Symposium (APSURSI)","volume":"7 1","pages":"1620-1621"},"PeriodicalIF":0.0000,"publicationDate":"2014-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"OpenCL GPU 100× acceleration of physical optics\",\"authors\":\"A. Densmore, Y. Rahmat-Samii\",\"doi\":\"10.1109/APS.2014.6905136\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The application of the graphics processor to computing antenna radiation patterns provides a substantial acceleration in speed. The same brute force physical optics (PO) code; e.g., a symmetric single reflector of two hundred forty wavelengths diameter, that runs single-threaded C/C++ on an Intel Core 2 Q8200 CPU and compiled to be optimized for speed, runs one hundred times faster when ported to OpenCL and run on an AMD HD5850 GPU hosted by the CPU system. Porting the code to OpenCL involves opening the processing loops to be run in parallel and some additional code to host the GPU in the CPU system. The ability to accelerate antenna pattern computations by a factor of one hundred provides real substantial benefits when many patterns need to be computed repeatedly, such as when optimizing a reflector system.\",\"PeriodicalId\":6663,\"journal\":{\"name\":\"2014 IEEE Antennas and Propagation Society International Symposium (APSURSI)\",\"volume\":\"7 1\",\"pages\":\"1620-1621\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-07-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 IEEE Antennas and Propagation Society International Symposium (APSURSI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/APS.2014.6905136\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE Antennas and Propagation Society International Symposium (APSURSI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APS.2014.6905136","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The application of the graphics processor to computing antenna radiation patterns provides a substantial acceleration in speed. The same brute force physical optics (PO) code; e.g., a symmetric single reflector of two hundred forty wavelengths diameter, that runs single-threaded C/C++ on an Intel Core 2 Q8200 CPU and compiled to be optimized for speed, runs one hundred times faster when ported to OpenCL and run on an AMD HD5850 GPU hosted by the CPU system. Porting the code to OpenCL involves opening the processing loops to be run in parallel and some additional code to host the GPU in the CPU system. The ability to accelerate antenna pattern computations by a factor of one hundred provides real substantial benefits when many patterns need to be computed repeatedly, such as when optimizing a reflector system.