Accelerating radio astronomy imaging with RICK

IF 1.9 4区 物理与天体物理 Q2 ASTRONOMY & ASTROPHYSICS
E. De Rubeis , G. Lacopo , C. Gheller , L. Tornatore , G. Taffoni
{"title":"Accelerating radio astronomy imaging with RICK","authors":"E. De Rubeis ,&nbsp;G. Lacopo ,&nbsp;C. Gheller ,&nbsp;L. Tornatore ,&nbsp;G. Taffoni","doi":"10.1016/j.ascom.2024.100895","DOIUrl":null,"url":null,"abstract":"<div><div>This paper presents an implementation of radio astronomy imaging algorithms on modern High Performance Computing (HPC) infrastructures, exploiting distributed memory parallelism and acceleration throughout multiple GPUs. Our code, called RICK (Radio Imaging Code Kernels), is capable of performing the major steps of the <span><math><mi>w</mi></math></span>-stacking algorithm presented in Offringa et al. (2014) both inter- and intra-node, and in particular has the possibility to run entirely on the GPU memory, minimising the number of data transfers between CPU and GPU. This feature, especially among multiple GPUs, is critical given the huge sizes of radio datasets involved.</div><div>After a detailed description of the new implementations of the code with respect to the first version presented in Gheller et al. (2023), we analyse the performances of the code for each step involved in its execution. We also discuss the pros and cons related to an accelerated approach to this problem and its impact on the overall behaviour of the code. Such approach to the problem results in a significant improvement in terms of runtime with respect to the CPU version of the code, as long as the amount of computational resources does not exceed the one requested by the size of the problem: the code, in fact, is now limited by the communication costs, with the computation that gets heavily reduced by the capabilities of the accelerators.</div></div>","PeriodicalId":48757,"journal":{"name":"Astronomy and Computing","volume":"50 ","pages":"Article 100895"},"PeriodicalIF":1.9000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Astronomy and Computing","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2213133724001100","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
引用次数: 0

Abstract

This paper presents an implementation of radio astronomy imaging algorithms on modern High Performance Computing (HPC) infrastructures, exploiting distributed memory parallelism and acceleration throughout multiple GPUs. Our code, called RICK (Radio Imaging Code Kernels), is capable of performing the major steps of the w-stacking algorithm presented in Offringa et al. (2014) both inter- and intra-node, and in particular has the possibility to run entirely on the GPU memory, minimising the number of data transfers between CPU and GPU. This feature, especially among multiple GPUs, is critical given the huge sizes of radio datasets involved.
After a detailed description of the new implementations of the code with respect to the first version presented in Gheller et al. (2023), we analyse the performances of the code for each step involved in its execution. We also discuss the pros and cons related to an accelerated approach to this problem and its impact on the overall behaviour of the code. Such approach to the problem results in a significant improvement in terms of runtime with respect to the CPU version of the code, as long as the amount of computational resources does not exceed the one requested by the size of the problem: the code, in fact, is now limited by the communication costs, with the computation that gets heavily reduced by the capabilities of the accelerators.
利用 RICK 加速射电天文学成像
本文介绍了射电天文学成像算法在现代高性能计算(HPC)基础设施上的实现,利用了分布式内存并行性和多 GPU 加速。我们的代码名为RICK(射电成像代码内核),能够在节点间和节点内执行Offringa等人(2014年)提出的w-stacking算法的主要步骤,特别是可以完全在GPU内存上运行,最大限度地减少CPU和GPU之间的数据传输次数。在详细描述了与 Gheller 等人(2023 年)所介绍的第一版代码相比的新代码实现之后,我们分析了代码执行过程中每个步骤的性能。我们还讨论了与加速处理该问题相关的利弊及其对代码整体行为的影响。与 CPU 版本的代码相比,只要计算资源量不超过问题规模所需的资源量,采用这种方法处理问题就能显著改善代码的运行时间:事实上,代码现在受到通信成本的限制,而计算量则因加速器的能力而大大减少。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Astronomy and Computing
Astronomy and Computing ASTRONOMY & ASTROPHYSICSCOMPUTER SCIENCE,-COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
CiteScore
4.10
自引率
8.00%
发文量
67
期刊介绍: Astronomy and Computing is a peer-reviewed journal that focuses on the broad area between astronomy, computer science and information technology. The journal aims to publish the work of scientists and (software) engineers in all aspects of astronomical computing, including the collection, analysis, reduction, visualisation, preservation and dissemination of data, and the development of astronomical software and simulations. The journal covers applications for academic computer science techniques to astronomy, as well as novel applications of information technologies within astronomy.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信