{"title":"基于分布式执行框架的低频无线电干涉阵列成像管线优化","authors":"WEI Yao-jie , FU Jie-lin , LAO Bao-qiang","doi":"10.1016/j.chinastron.2024.05.008","DOIUrl":null,"url":null,"abstract":"<div><p>The Square Kilometre Array (SKA) project is an international collaboration to build the world’s largest radio telescope, whose sensitivity and measurement speed will be an order of magnitude higher than those of all current radio telescopes. Radio continuum survey is one of the main observation mode of the SKA, and the establishment of a standard map of the survey area based on continuum imaging will provide an important foundation for subsequent astronomical science. The GaLactic and Extragalactic All-sky Murchison Widefield Array survey eXtended (GLEAM-X) is a project of the SKA pilot telescope Murchison Widefield Array (MWA) in 2018—2020. GLEAM-X is a new radio continuum survey project to be carried out with the MWA Phase II expansion array in 2018—2020. The experience of optimizing the imaging pipeline based on the distributed execution framework will help to solve the problem of massive data processing. In this paper, we describe the process steps of GLEAM-X imaging pipeline, integrate and improve it, and realize parallel processing of multiple pipelines on the China SKA Regional Centre Prototype (CSRC-P), and verify the deployment and test the correctness of the imaging pipeline system using GLEAM-X observation data. The GLEAM-X observations were used to validate the deployment of the imaging pipeline system and test its correctness. Then, to optimize the pipelines and improve the processing efficiency, the Data Activated Liu Graph Engine (DALiuGE) was used to integrate the imaging pipelines into the DALiuGE execution framework to automate the distributed parallel processing of the pipelines. Performance tests and results’ analysis show that the optimized imaging pipeline based on the DALiuGE execution framework has better performance, more flexible adaptability, and scalability than the traditional parallel approach, and can support future large-scale continuum imaging experiments during the first phase of SKA commissioning.</p></div>","PeriodicalId":35730,"journal":{"name":"Chinese Astronomy and Astrophysics","volume":"48 2","pages":"Pages 389-412"},"PeriodicalIF":0.0000,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Low-frequency Radio Interferometric Array Imaging Pipeline Optimization Based on Distributed Execution Framework\",\"authors\":\"WEI Yao-jie , FU Jie-lin , LAO Bao-qiang\",\"doi\":\"10.1016/j.chinastron.2024.05.008\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The Square Kilometre Array (SKA) project is an international collaboration to build the world’s largest radio telescope, whose sensitivity and measurement speed will be an order of magnitude higher than those of all current radio telescopes. Radio continuum survey is one of the main observation mode of the SKA, and the establishment of a standard map of the survey area based on continuum imaging will provide an important foundation for subsequent astronomical science. The GaLactic and Extragalactic All-sky Murchison Widefield Array survey eXtended (GLEAM-X) is a project of the SKA pilot telescope Murchison Widefield Array (MWA) in 2018—2020. GLEAM-X is a new radio continuum survey project to be carried out with the MWA Phase II expansion array in 2018—2020. The experience of optimizing the imaging pipeline based on the distributed execution framework will help to solve the problem of massive data processing. In this paper, we describe the process steps of GLEAM-X imaging pipeline, integrate and improve it, and realize parallel processing of multiple pipelines on the China SKA Regional Centre Prototype (CSRC-P), and verify the deployment and test the correctness of the imaging pipeline system using GLEAM-X observation data. The GLEAM-X observations were used to validate the deployment of the imaging pipeline system and test its correctness. Then, to optimize the pipelines and improve the processing efficiency, the Data Activated Liu Graph Engine (DALiuGE) was used to integrate the imaging pipelines into the DALiuGE execution framework to automate the distributed parallel processing of the pipelines. Performance tests and results’ analysis show that the optimized imaging pipeline based on the DALiuGE execution framework has better performance, more flexible adaptability, and scalability than the traditional parallel approach, and can support future large-scale continuum imaging experiments during the first phase of SKA commissioning.</p></div>\",\"PeriodicalId\":35730,\"journal\":{\"name\":\"Chinese Astronomy and Astrophysics\",\"volume\":\"48 2\",\"pages\":\"Pages 389-412\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Chinese Astronomy and Astrophysics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0275106224000341\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Physics and Astronomy\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chinese Astronomy and Astrophysics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0275106224000341","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Physics and Astronomy","Score":null,"Total":0}
引用次数: 0
摘要
平方公里阵列(SKA)项目是一项国际合作项目,旨在建造世界上最大的射电望远镜,其灵敏度和测量速度将比目前所有的射电望远镜高出一个数量级。射电连续面巡天是 SKA 的主要观测模式之一,基于连续面成像建立巡天区域的标准地图将为后续的天文科学研究奠定重要基础。银河系和河外星系全天空默奇森宽视场阵列巡天(GaLactic and Extragalactic All-sky Murchison Widefield Array survey eXtended,GLEAM-X)是2018-2020年SKA试点望远镜默奇森宽视场阵列(MWA)的一个项目。GLEAM-X是一个新的射电连续面巡天项目,将于2018-2020年利用MWA二期扩展阵列开展。基于分布式执行框架优化成像管道的经验将有助于解决海量数据处理问题。本文介绍了GLEAM-X成像流水线的流程步骤,对其进行了集成和改进,在中国SKA区域中心原型机(CSRC-P)上实现了多条流水线的并行处理,并利用GLEAM-X观测数据验证了成像流水线系统的部署和测试的正确性。利用GLEAM-X观测数据验证成像管道系统的部署并测试其正确性。然后,为了优化流水线并提高处理效率,使用数据激活刘图引擎(DALiuGE)将成像流水线集成到 DALiuGE 执行框架中,实现流水线分布式并行处理的自动化。性能测试和结果分析表明,与传统的并行方法相比,基于 DALiuGE 执行框架的优化成像管道具有更好的性能、更灵活的适应性和可扩展性,能够支持未来 SKA 调试第一阶段的大规模连续成像实验。
Low-frequency Radio Interferometric Array Imaging Pipeline Optimization Based on Distributed Execution Framework
The Square Kilometre Array (SKA) project is an international collaboration to build the world’s largest radio telescope, whose sensitivity and measurement speed will be an order of magnitude higher than those of all current radio telescopes. Radio continuum survey is one of the main observation mode of the SKA, and the establishment of a standard map of the survey area based on continuum imaging will provide an important foundation for subsequent astronomical science. The GaLactic and Extragalactic All-sky Murchison Widefield Array survey eXtended (GLEAM-X) is a project of the SKA pilot telescope Murchison Widefield Array (MWA) in 2018—2020. GLEAM-X is a new radio continuum survey project to be carried out with the MWA Phase II expansion array in 2018—2020. The experience of optimizing the imaging pipeline based on the distributed execution framework will help to solve the problem of massive data processing. In this paper, we describe the process steps of GLEAM-X imaging pipeline, integrate and improve it, and realize parallel processing of multiple pipelines on the China SKA Regional Centre Prototype (CSRC-P), and verify the deployment and test the correctness of the imaging pipeline system using GLEAM-X observation data. The GLEAM-X observations were used to validate the deployment of the imaging pipeline system and test its correctness. Then, to optimize the pipelines and improve the processing efficiency, the Data Activated Liu Graph Engine (DALiuGE) was used to integrate the imaging pipelines into the DALiuGE execution framework to automate the distributed parallel processing of the pipelines. Performance tests and results’ analysis show that the optimized imaging pipeline based on the DALiuGE execution framework has better performance, more flexible adaptability, and scalability than the traditional parallel approach, and can support future large-scale continuum imaging experiments during the first phase of SKA commissioning.
期刊介绍:
The vigorous growth of astronomical and astrophysical science in China led to an increase in papers on astrophysics which Acta Astronomica Sinica could no longer absorb. Translations of papers from two new journals the Chinese Journal of Space Science and Acta Astrophysica Sinica are added to the translation of Acta Astronomica Sinica to form the new journal Chinese Astronomy and Astrophysics. Chinese Astronomy and Astrophysics brings English translations of notable articles to astronomers and astrophysicists outside China.