Astronomical big data processing using machine learning: A comprehensive review

IF 2.7 3区 物理与天体物理 Q2 ASTRONOMY & ASTROPHYSICS
Snigdha Sen, Sonali Agarwal, Pavan Chakraborty, Krishna Pratap Singh
{"title":"Astronomical big data processing using machine learning: A comprehensive review","authors":"Snigdha Sen,&nbsp;Sonali Agarwal,&nbsp;Pavan Chakraborty,&nbsp;Krishna Pratap Singh","doi":"10.1007/s10686-021-09827-4","DOIUrl":null,"url":null,"abstract":"<div><p>Astronomy, being one of the oldest observational sciences, has collected a lot of data over the ages. In recent times, it is experiencing a huge data surge due to advancements in telescopic technologies with automated digital outputs. The main driver behind this article is to present various relevant Machine Learning (ML) algorithms and big data frameworks or tools being applied and can be employed in large astronomical data-set analysis to assist astronomers in solving multiple vital intriguing problems. Throughout this survey, we attempt to review, evaluate and summarize diverse astronomical data sources, gain knowledge of structure, the complexity of the data, and challenges in the data processing. Additionally, we discuss ample technologies being developed to handle and process this voluminous data. We also look at numerous activities being carried out all over the world enriching this domain. While going through existing literature, we perceived a limited number of comprehensive studies reported so far analyzing astronomy data-sets from the viewpoint of parallel processing and machine learning collectively. This motivated us to pursue this extensive literature review task by outlining up-to-date contributions and opportunities available in this area. Besides, this article also discusses briefly a cloud-based machine learning approach to estimate the extra-galactic object redshifts considering photometric data as input features. As the intersection of big data, machine learning and astronomy is a quite new paradigm, this article will create a strong awareness among interested young scientists for future research and provide an appropriate insight on how these algorithms and tools are becoming inevitable to the astronomy community day by day.</p></div>","PeriodicalId":551,"journal":{"name":"Experimental Astronomy","volume":"53 1","pages":"1 - 43"},"PeriodicalIF":2.7000,"publicationDate":"2022-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Experimental Astronomy","FirstCategoryId":"101","ListUrlMain":"https://link.springer.com/article/10.1007/s10686-021-09827-4","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
引用次数: 19

Abstract

Astronomy, being one of the oldest observational sciences, has collected a lot of data over the ages. In recent times, it is experiencing a huge data surge due to advancements in telescopic technologies with automated digital outputs. The main driver behind this article is to present various relevant Machine Learning (ML) algorithms and big data frameworks or tools being applied and can be employed in large astronomical data-set analysis to assist astronomers in solving multiple vital intriguing problems. Throughout this survey, we attempt to review, evaluate and summarize diverse astronomical data sources, gain knowledge of structure, the complexity of the data, and challenges in the data processing. Additionally, we discuss ample technologies being developed to handle and process this voluminous data. We also look at numerous activities being carried out all over the world enriching this domain. While going through existing literature, we perceived a limited number of comprehensive studies reported so far analyzing astronomy data-sets from the viewpoint of parallel processing and machine learning collectively. This motivated us to pursue this extensive literature review task by outlining up-to-date contributions and opportunities available in this area. Besides, this article also discusses briefly a cloud-based machine learning approach to estimate the extra-galactic object redshifts considering photometric data as input features. As the intersection of big data, machine learning and astronomy is a quite new paradigm, this article will create a strong awareness among interested young scientists for future research and provide an appropriate insight on how these algorithms and tools are becoming inevitable to the astronomy community day by day.

Abstract Image

利用机器学习处理天文大数据:综述
天文学是最古老的观测科学之一,多年来收集了大量数据。近年来,由于具有自动数字输出的望远镜技术的进步,它正在经历巨大的数据激增。本文背后的主要驱动力是介绍各种相关的机器学习(ML)算法和大数据框架或工具,可以用于大型天文数据集分析,以帮助天文学家解决多个重要的有趣问题。在整个调查过程中,我们试图对各种天文数据源进行审查,评估和总结,了解数据的结构,数据的复杂性以及数据处理中的挑战。此外,我们还讨论了为处理和处理这些海量数据而开发的大量技术。我们还看到世界各地正在开展的丰富这一领域的许多活动。在查阅现有文献的同时,我们发现,迄今为止,从并行处理和机器学习的角度分析天文数据集的综合研究报告数量有限。这促使我们通过概述该领域最新的贡献和可用的机会来进行这项广泛的文献综述任务。此外,本文还简要讨论了一种基于云的机器学习方法,以光度数据作为输入特征来估计星系外物体的红移。由于大数据、机器学习和天文学的交叉是一个相当新的范式,本文将在感兴趣的年轻科学家中建立对未来研究的强烈意识,并提供适当的见解,说明这些算法和工具如何日益成为天文学界不可避免的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Experimental Astronomy
Experimental Astronomy 地学天文-天文与天体物理
CiteScore
5.30
自引率
3.30%
发文量
57
审稿时长
6-12 weeks
期刊介绍: Many new instruments for observing astronomical objects at a variety of wavelengths have been and are continually being developed. Furthermore, a vast amount of effort is being put into the development of new techniques for data analysis in order to cope with great streams of data collected by these instruments. Experimental Astronomy acts as a medium for the publication of papers of contemporary scientific interest on astrophysical instrumentation and methods necessary for the conduct of astronomy at all wavelength fields. Experimental Astronomy publishes full-length articles, research letters and reviews on developments in detection techniques, instruments, and data analysis and image processing techniques. Occasional special issues are published, giving an in-depth presentation of the instrumentation and/or analysis connected with specific projects, such as satellite experiments or ground-based telescopes, or of specialized techniques.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信