有损数据压缩对高红移21cm信号功率谱的影响

IF 5.8 2区 物理与天体物理 Q1 ASTRONOMY & ASTROPHYSICS
J. K. Chege, L. V. E. Koopmans, A. R. Offringa, B. K. Gehlot, S. A. Brackenhoff, E. Ceccotti, S. Ghosh, C. Höfer, F. G. Mertens, M. Mevius, S. Munshi
{"title":"有损数据压缩对高红移21cm信号功率谱的影响","authors":"J. K. Chege, L. V. E. Koopmans, A. R. Offringa, B. K. Gehlot, S. A. Brackenhoff, E. Ceccotti, S. Ghosh, C. Höfer, F. G. Mertens, M. Mevius, S. Munshi","doi":"10.1051/0004-6361/202451367","DOIUrl":null,"url":null,"abstract":"<i>Context.<i/> Current radio interferometers output multi-petabyte-scale volumes of data per year, making the storage, transfer, and processing of these data a sizeable challenge. This challenge is expected to grow with next-generation telescopes such as the Square Kilometre Array (SKA), which will produce a considerably larger data volume than current instruments. Lossy compression of interferometric data post-correlation can abate this challenge, but any drawbacks from the compression should be well understood in advance.<i>Aims.<i/> Lossy data compression reduces the precision of data, introducing additional noise. Since high-redshift (e.g., cosmic dawn or epoch of reionization) 21 cm studies impose strict precision requirements, the impact of this effect on the 21 cm signal power spectrum statistic is investigated in a bid to rule out unwanted systematics.<i>Methods.<i/> We applied DYSCO visibility compression, a technique for normalizing and quantizing specifically designed for radio interferometric data, to observed visibilities datasets from the LOFAR telescope as well as simulated ones. The power spectrum of these data was analyzed, and we establish the level of the compression noise in the power spectrum in comparison to the thermal noise. We also examined its coherency behavior by employing the cross-coherence metric. Finally, for optimal compression results, we compared the compression noise obtained from different compression settings to a nominal 21 cm signal power.<i>Results.<i/> From a single night of observation, we find that the noise introduced due to the compression is more than five orders of magnitude lower than the thermal noise level in the power spectrum. The noise does not affect calibration. Furthermore, the noise remains subdominant to the noise introduced by the nonlinear calibration algorithm used following random parameter initialization across different runs. The compression noise shows no correlation with the sky signal and has no measurable coherent component, therefore averaging down optimally with the integration of more data. The level of compression error in the power spectrum ultimately depends on the compression settings.<i>Conclusions.<i/> DYSCO visibility compression is found to be an insignificant concern for 21 cm power spectrum studies. Hence, data volumes can be safely reduced by factors of ∼4 with insignificant bias to the final power spectrum. Data from SKA-Low will likely be compressible by the same factor as data from LOFAR owing to the similarities of the two instruments. The same technique can be used to compress data from other telescopes, but a small adjustment of the compression parameters might be required.","PeriodicalId":8571,"journal":{"name":"Astronomy & Astrophysics","volume":"3 1","pages":""},"PeriodicalIF":5.8000,"publicationDate":"2024-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The impact of lossy data compression on the power spectrum of the high-redshift 21 cm signal with LOFAR\",\"authors\":\"J. K. Chege, L. V. E. Koopmans, A. R. Offringa, B. K. Gehlot, S. A. Brackenhoff, E. Ceccotti, S. Ghosh, C. Höfer, F. G. Mertens, M. Mevius, S. Munshi\",\"doi\":\"10.1051/0004-6361/202451367\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<i>Context.<i/> Current radio interferometers output multi-petabyte-scale volumes of data per year, making the storage, transfer, and processing of these data a sizeable challenge. This challenge is expected to grow with next-generation telescopes such as the Square Kilometre Array (SKA), which will produce a considerably larger data volume than current instruments. Lossy compression of interferometric data post-correlation can abate this challenge, but any drawbacks from the compression should be well understood in advance.<i>Aims.<i/> Lossy data compression reduces the precision of data, introducing additional noise. Since high-redshift (e.g., cosmic dawn or epoch of reionization) 21 cm studies impose strict precision requirements, the impact of this effect on the 21 cm signal power spectrum statistic is investigated in a bid to rule out unwanted systematics.<i>Methods.<i/> We applied DYSCO visibility compression, a technique for normalizing and quantizing specifically designed for radio interferometric data, to observed visibilities datasets from the LOFAR telescope as well as simulated ones. The power spectrum of these data was analyzed, and we establish the level of the compression noise in the power spectrum in comparison to the thermal noise. We also examined its coherency behavior by employing the cross-coherence metric. Finally, for optimal compression results, we compared the compression noise obtained from different compression settings to a nominal 21 cm signal power.<i>Results.<i/> From a single night of observation, we find that the noise introduced due to the compression is more than five orders of magnitude lower than the thermal noise level in the power spectrum. The noise does not affect calibration. Furthermore, the noise remains subdominant to the noise introduced by the nonlinear calibration algorithm used following random parameter initialization across different runs. The compression noise shows no correlation with the sky signal and has no measurable coherent component, therefore averaging down optimally with the integration of more data. The level of compression error in the power spectrum ultimately depends on the compression settings.<i>Conclusions.<i/> DYSCO visibility compression is found to be an insignificant concern for 21 cm power spectrum studies. Hence, data volumes can be safely reduced by factors of ∼4 with insignificant bias to the final power spectrum. Data from SKA-Low will likely be compressible by the same factor as data from LOFAR owing to the similarities of the two instruments. The same technique can be used to compress data from other telescopes, but a small adjustment of the compression parameters might be required.\",\"PeriodicalId\":8571,\"journal\":{\"name\":\"Astronomy & Astrophysics\",\"volume\":\"3 1\",\"pages\":\"\"},\"PeriodicalIF\":5.8000,\"publicationDate\":\"2024-12-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Astronomy & Astrophysics\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://doi.org/10.1051/0004-6361/202451367\",\"RegionNum\":2,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ASTRONOMY & ASTROPHYSICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Astronomy & Astrophysics","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1051/0004-6361/202451367","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
引用次数: 0

摘要

背景。目前的射电干涉仪每年输出的数据量高达几千兆字节,这使得这些数据的存储、传输和处理成为一项巨大的挑战。随着平方千米阵列(SKA)等下一代望远镜的出现,这一挑战预计会越来越大,因为下一代望远镜产生的数据量将大大超过目前的仪器。对干涉测量数据进行相关后有损压缩可以减轻这一挑战,但应事先充分了解压缩带来的任何弊端。有损数据压缩会降低数据的精度,带来额外的噪声。由于高红移(如宇宙黎明或再电离纪)21 厘米研究对精度有严格要求,因此我们研究了这一影响对 21 厘米信号功率谱统计的影响,以排除不必要的系统性。我们对 LOFAR 望远镜观测到的可见度数据集和模拟数据集采用了 DYSCO 可见度压缩技术,这是一种专门为射电干涉测量数据设计的归一化和量化技术。我们对这些数据的功率谱进行了分析,并确定了功率谱中压缩噪声与热噪声的比较水平。我们还利用交叉相干度量检验了其相干性。最后,为了获得最佳压缩效果,我们将不同压缩设置获得的压缩噪声与标称的 21 厘米信号功率进行了比较。通过一个晚上的观测,我们发现压缩带来的噪声比功率谱中的热噪声低五个数量级以上。噪声不会影响校准。此外,在不同的运行中,该噪声仍然低于随机参数初始化后使用的非线性校准算法所引入的噪声。压缩噪声与天空信号没有相关性,也没有可测量的相干成分,因此随着更多数据的整合,其平均值会以最佳方式下降。功率谱的压缩误差水平最终取决于压缩设置。在 21 厘米功率谱研究中,DYSCO 可见度压缩是一个无关紧要的问题。因此,数据量可以安全地减少 ∼ 4 倍,而对最终功率谱的影响不大。由于 SKA-Low 和 LOFAR 仪器的相似性,SKA-Low 的数据可能会与 LOFAR 的数据压缩相同的系数。同样的技术也可用于压缩其他望远镜的数据,但可能需要对压缩参数稍作调整。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
The impact of lossy data compression on the power spectrum of the high-redshift 21 cm signal with LOFAR
Context. Current radio interferometers output multi-petabyte-scale volumes of data per year, making the storage, transfer, and processing of these data a sizeable challenge. This challenge is expected to grow with next-generation telescopes such as the Square Kilometre Array (SKA), which will produce a considerably larger data volume than current instruments. Lossy compression of interferometric data post-correlation can abate this challenge, but any drawbacks from the compression should be well understood in advance.Aims. Lossy data compression reduces the precision of data, introducing additional noise. Since high-redshift (e.g., cosmic dawn or epoch of reionization) 21 cm studies impose strict precision requirements, the impact of this effect on the 21 cm signal power spectrum statistic is investigated in a bid to rule out unwanted systematics.Methods. We applied DYSCO visibility compression, a technique for normalizing and quantizing specifically designed for radio interferometric data, to observed visibilities datasets from the LOFAR telescope as well as simulated ones. The power spectrum of these data was analyzed, and we establish the level of the compression noise in the power spectrum in comparison to the thermal noise. We also examined its coherency behavior by employing the cross-coherence metric. Finally, for optimal compression results, we compared the compression noise obtained from different compression settings to a nominal 21 cm signal power.Results. From a single night of observation, we find that the noise introduced due to the compression is more than five orders of magnitude lower than the thermal noise level in the power spectrum. The noise does not affect calibration. Furthermore, the noise remains subdominant to the noise introduced by the nonlinear calibration algorithm used following random parameter initialization across different runs. The compression noise shows no correlation with the sky signal and has no measurable coherent component, therefore averaging down optimally with the integration of more data. The level of compression error in the power spectrum ultimately depends on the compression settings.Conclusions. DYSCO visibility compression is found to be an insignificant concern for 21 cm power spectrum studies. Hence, data volumes can be safely reduced by factors of ∼4 with insignificant bias to the final power spectrum. Data from SKA-Low will likely be compressible by the same factor as data from LOFAR owing to the similarities of the two instruments. The same technique can be used to compress data from other telescopes, but a small adjustment of the compression parameters might be required.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Astronomy & Astrophysics
Astronomy & Astrophysics 地学天文-天文与天体物理
CiteScore
10.20
自引率
27.70%
发文量
2105
审稿时长
1-2 weeks
期刊介绍: Astronomy & Astrophysics is an international Journal that publishes papers on all aspects of astronomy and astrophysics (theoretical, observational, and instrumental) independently of the techniques used to obtain the results.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信