Redshift-Agnostic Machine Learning Classification: Unveiling Peak Performance in Galaxy, Star, and Quasar Classification (Using SDSS DR17)

IF 1 4区 物理与天体物理 Q3 ASTRONOMY & ASTROPHYSICS
Debashis Chatterjee, Prithwish Ghosh
{"title":"Redshift-Agnostic Machine Learning Classification: Unveiling Peak Performance in Galaxy, Star, and Quasar Classification (Using SDSS DR17)","authors":"Debashis Chatterjee,&nbsp;Prithwish Ghosh","doi":"10.1002/asna.20240057","DOIUrl":null,"url":null,"abstract":"<p>Classification of galaxies, stars, and quasars using spectral data is fundamental to astronomy, but often relies heavily on redshift. This study evaluates the performance of 10 machine learning algorithms on SDSS data to classify these objects, with a particular focus on scenarios where redshift information is unavailable. Leveraging features such as “z,” “u,” “g,” “r,” “i,” and redshift, we assess the accuracy of various algorithms, including XGBoost, Random Forest, and recurrent neural networks (RNNs). Our analysis demonstrates the superior accuracy of the Random Forest classifier when redshift is included. The feature importance analysis reveals that “redshift” is the most critical feature, contributing 64.7% to the classification accuracy, followed by the “z” band (10.0%) and the “g” band (7.95%). However, even in the absence of redshift, XGBoost, Random Forest, and RNNs exhibit promising results, indicating the potential of photometric data for accurate classification. We systematically compare classification outcomes with and without redshift, revealing the relative importance of different features and identifying the most robust classifiers for redshift-limited scenarios. This research not only highlights the power of machine learning for astronomical classification but also provides a framework for reliable classification when redshift data is lacking. By uncovering the distinguishing spectral characteristics of galaxies, stars, and quasars that are independent of redshift, we open new avenues for efficient and accurate classification in large-scale photometric surveys and the study of faint, high-redshift objects.</p>","PeriodicalId":55442,"journal":{"name":"Astronomische Nachrichten","volume":"346 5","pages":""},"PeriodicalIF":1.0000,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/asna.20240057","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Astronomische Nachrichten","FirstCategoryId":"101","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/asna.20240057","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
引用次数: 0

Abstract

Classification of galaxies, stars, and quasars using spectral data is fundamental to astronomy, but often relies heavily on redshift. This study evaluates the performance of 10 machine learning algorithms on SDSS data to classify these objects, with a particular focus on scenarios where redshift information is unavailable. Leveraging features such as “z,” “u,” “g,” “r,” “i,” and redshift, we assess the accuracy of various algorithms, including XGBoost, Random Forest, and recurrent neural networks (RNNs). Our analysis demonstrates the superior accuracy of the Random Forest classifier when redshift is included. The feature importance analysis reveals that “redshift” is the most critical feature, contributing 64.7% to the classification accuracy, followed by the “z” band (10.0%) and the “g” band (7.95%). However, even in the absence of redshift, XGBoost, Random Forest, and RNNs exhibit promising results, indicating the potential of photometric data for accurate classification. We systematically compare classification outcomes with and without redshift, revealing the relative importance of different features and identifying the most robust classifiers for redshift-limited scenarios. This research not only highlights the power of machine learning for astronomical classification but also provides a framework for reliable classification when redshift data is lacking. By uncovering the distinguishing spectral characteristics of galaxies, stars, and quasars that are independent of redshift, we open new avenues for efficient and accurate classification in large-scale photometric surveys and the study of faint, high-redshift objects.

Abstract Image

红移不可知机器学习分类:揭示银河系、恒星和类星体分类中的峰值性能(使用SDSS DR17)
利用光谱数据对星系、恒星和类星体进行分类是天文学的基础,但往往严重依赖于红移。本研究评估了10种机器学习算法在SDSS数据上的性能,以对这些对象进行分类,特别关注红移信息不可用的场景。利用“z”、“u”、“g”、“r”、“i”和红移等特征,我们评估了各种算法的准确性,包括XGBoost、随机森林和循环神经网络(rnn)。我们的分析表明,当考虑红移时,随机森林分类器的精度更高。特征重要性分析显示,“红移”是最关键的特征,对分类准确率的贡献率为64.7%,其次是“z”波段(10.0%)和“g”波段(7.95%)。然而,即使在没有红移的情况下,XGBoost、Random Forest和rnn也表现出了很好的结果,这表明了光度数据在准确分类方面的潜力。我们系统地比较了有红移和没有红移的分类结果,揭示了不同特征的相对重要性,并确定了红移受限场景下最鲁棒的分类器。这项研究不仅突出了机器学习在天文分类中的力量,而且在缺乏红移数据的情况下,为可靠的分类提供了一个框架。通过揭示独立于红移的星系、恒星和类星体的独特光谱特征,我们为大规模光度调查和研究微弱、高红移天体的有效和准确分类开辟了新的途径。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Astronomische Nachrichten
Astronomische Nachrichten 地学天文-天文与天体物理
CiteScore
1.80
自引率
11.10%
发文量
57
审稿时长
4-8 weeks
期刊介绍: Astronomische Nachrichten, founded in 1821 by H. C. Schumacher, is the oldest astronomical journal worldwide still being published. Famous astronomical discoveries and important papers on astronomy and astrophysics published in more than 300 volumes of the journal give an outstanding representation of the progress of astronomical research over the last 180 years. Today, Astronomical Notes/ Astronomische Nachrichten publishes articles in the field of observational and theoretical astrophysics and related topics in solar-system and solar physics. Additional, papers on astronomical instrumentation ground-based and space-based as well as papers about numerical astrophysical techniques and supercomputer modelling are covered. Papers can be completed by short video sequences in the electronic version. Astronomical Notes/ Astronomische Nachrichten also publishes special issues of meeting proceedings.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信