{"title":"Scalable multi-core sonar beamforming with Computational Process Networks","authors":"J. Bridgman, G. E. Allen, B. Evans","doi":"10.1109/ACSSC.2010.5757732","DOIUrl":null,"url":null,"abstract":"This paper evaluates the scalability with respect to processor cores of a three-dimensional sonar beamforming kernel implemented on a multi-core workstation. Beamforming is an example of an extremely parallelizable problem. This implementation is instrumented with OpenMP to exploit multi-core computer systems. However, when executed on a 16-core machine, this kernel scales much less than expected. We implement this beamformer system within the scalable framework of Computational Process Networks to achieve additional performance and processor utilization for a larger number of cores. On our benchmark machine, the implementation with Computational Process Networks obtains a throughput speedup of more than two times over OpenMP with the default settings, and 13% improvement in throughput over OpenMP with optimized settings.","PeriodicalId":170947,"journal":{"name":"2010 Conference Record of the Forty Fourth Asilomar Conference on Signals, Systems and Computers","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Conference Record of the Forty Fourth Asilomar Conference on Signals, Systems and Computers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACSSC.2010.5757732","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
This paper evaluates the scalability with respect to processor cores of a three-dimensional sonar beamforming kernel implemented on a multi-core workstation. Beamforming is an example of an extremely parallelizable problem. This implementation is instrumented with OpenMP to exploit multi-core computer systems. However, when executed on a 16-core machine, this kernel scales much less than expected. We implement this beamformer system within the scalable framework of Computational Process Networks to achieve additional performance and processor utilization for a larger number of cores. On our benchmark machine, the implementation with Computational Process Networks obtains a throughput speedup of more than two times over OpenMP with the default settings, and 13% improvement in throughput over OpenMP with optimized settings.
本文评估了在多核工作站上实现的三维声纳波束形成内核在处理器内核方面的可扩展性。波束形成是一个极端并行问题的例子。该实现使用OpenMP来利用多核计算机系统。然而,当在16核机器上执行时,这个内核的可伸缩性远远小于预期。我们在计算过程网络的可扩展框架内实现该波束形成系统,以实现更多内核的额外性能和处理器利用率。在我们的基准机器上,使用Computational Process Networks的实现比使用默认设置的OpenMP获得了两倍以上的吞吐量加速,并且比使用优化设置的OpenMP提高了13%的吞吐量。