红移不可知机器学习分类：揭示银河系、恒星和类星体分类中的峰值性能（使用SDSS DR17）

IF 1 4区物理与天体物理 Q3 ASTRONOMY & ASTROPHYSICS

Astronomische Nachrichten Pub Date : 2025-02-18 DOI:10.1002/asna.20240057

Debashis Chatterjee, Prithwish Ghosh

{"title":"红移不可知机器学习分类：揭示银河系、恒星和类星体分类中的峰值性能（使用SDSS DR17）","authors":"Debashis Chatterjee, Prithwish Ghosh","doi":"10.1002/asna.20240057","DOIUrl":null,"url":null,"abstract":"<p>Classification of galaxies, stars, and quasars using spectral data is fundamental to astronomy, but often relies heavily on redshift. This study evaluates the performance of 10 machine learning algorithms on SDSS data to classify these objects, with a particular focus on scenarios where redshift information is unavailable. Leveraging features such as “z,” “u,” “g,” “r,” “i,” and redshift, we assess the accuracy of various algorithms, including XGBoost, Random Forest, and recurrent neural networks (RNNs). Our analysis demonstrates the superior accuracy of the Random Forest classifier when redshift is included. The feature importance analysis reveals that “redshift” is the most critical feature, contributing 64.7% to the classification accuracy, followed by the “z” band (10.0%) and the “g” band (7.95%). However, even in the absence of redshift, XGBoost, Random Forest, and RNNs exhibit promising results, indicating the potential of photometric data for accurate classification. We systematically compare classification outcomes with and without redshift, revealing the relative importance of different features and identifying the most robust classifiers for redshift-limited scenarios. This research not only highlights the power of machine learning for astronomical classification but also provides a framework for reliable classification when redshift data is lacking. By uncovering the distinguishing spectral characteristics of galaxies, stars, and quasars that are independent of redshift, we open new avenues for efficient and accurate classification in large-scale photometric surveys and the study of faint, high-redshift objects.</p>","PeriodicalId":55442,"journal":{"name":"Astronomische Nachrichten","volume":"346 5","pages":""},"PeriodicalIF":1.0000,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/asna.20240057","citationCount":"0","resultStr":"{\"title\":\"Redshift-Agnostic Machine Learning Classification: Unveiling Peak Performance in Galaxy, Star, and Quasar Classification (Using SDSS DR17)\",\"authors\":\"Debashis Chatterjee, Prithwish Ghosh\",\"doi\":\"10.1002/asna.20240057\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Classification of galaxies, stars, and quasars using spectral data is fundamental to astronomy, but often relies heavily on redshift. This study evaluates the performance of 10 machine learning algorithms on SDSS data to classify these objects, with a particular focus on scenarios where redshift information is unavailable. Leveraging features such as “z,” “u,” “g,” “r,” “i,” and redshift, we assess the accuracy of various algorithms, including XGBoost, Random Forest, and recurrent neural networks (RNNs). Our analysis demonstrates the superior accuracy of the Random Forest classifier when redshift is included. The feature importance analysis reveals that “redshift” is the most critical feature, contributing 64.7% to the classification accuracy, followed by the “z” band (10.0%) and the “g” band (7.95%). However, even in the absence of redshift, XGBoost, Random Forest, and RNNs exhibit promising results, indicating the potential of photometric data for accurate classification. We systematically compare classification outcomes with and without redshift, revealing the relative importance of different features and identifying the most robust classifiers for redshift-limited scenarios. This research not only highlights the power of machine learning for astronomical classification but also provides a framework for reliable classification when redshift data is lacking. By uncovering the distinguishing spectral characteristics of galaxies, stars, and quasars that are independent of redshift, we open new avenues for efficient and accurate classification in large-scale photometric surveys and the study of faint, high-redshift objects.</p>\",\"PeriodicalId\":55442,\"journal\":{\"name\":\"Astronomische Nachrichten\",\"volume\":\"346 5\",\"pages\":\"\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2025-02-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/asna.20240057\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Astronomische Nachrichten\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/asna.20240057\",\"RegionNum\":4,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ASTRONOMY & ASTROPHYSICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Astronomische Nachrichten","FirstCategoryId":"101","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/asna.20240057","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}

引用次数: 0

摘要

利用光谱数据对星系、恒星和类星体进行分类是天文学的基础，但往往严重依赖于红移。本研究评估了10种机器学习算法在SDSS数据上的性能，以对这些对象进行分类，特别关注红移信息不可用的场景。利用“z”、“u”、“g”、“r”、“i”和红移等特征，我们评估了各种算法的准确性，包括XGBoost、随机森林和循环神经网络（rnn）。我们的分析表明，当考虑红移时，随机森林分类器的精度更高。特征重要性分析显示，“红移”是最关键的特征，对分类准确率的贡献率为64.7%，其次是“z”波段（10.0%）和“g”波段（7.95%）。然而，即使在没有红移的情况下，XGBoost、Random Forest和rnn也表现出了很好的结果，这表明了光度数据在准确分类方面的潜力。我们系统地比较了有红移和没有红移的分类结果，揭示了不同特征的相对重要性，并确定了红移受限场景下最鲁棒的分类器。这项研究不仅突出了机器学习在天文分类中的力量，而且在缺乏红移数据的情况下，为可靠的分类提供了一个框架。通过揭示独立于红移的星系、恒星和类星体的独特光谱特征，我们为大规模光度调查和研究微弱、高红移天体的有效和准确分类开辟了新的途径。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Redshift-Agnostic Machine Learning Classification: Unveiling Peak Performance in Galaxy, Star, and Quasar Classification (Using SDSS DR17)

查看原文本刊更多论文

Redshift-Agnostic Machine Learning Classification: Unveiling Peak Performance in Galaxy, Star, and Quasar Classification (Using SDSS DR17)

Classification of galaxies, stars, and quasars using spectral data is fundamental to astronomy, but often relies heavily on redshift. This study evaluates the performance of 10 machine learning algorithms on SDSS data to classify these objects, with a particular focus on scenarios where redshift information is unavailable. Leveraging features such as “z,” “u,” “g,” “r,” “i,” and redshift, we assess the accuracy of various algorithms, including XGBoost, Random Forest, and recurrent neural networks (RNNs). Our analysis demonstrates the superior accuracy of the Random Forest classifier when redshift is included. The feature importance analysis reveals that “redshift” is the most critical feature, contributing 64.7% to the classification accuracy, followed by the “z” band (10.0%) and the “g” band (7.95%). However, even in the absence of redshift, XGBoost, Random Forest, and RNNs exhibit promising results, indicating the potential of photometric data for accurate classification. We systematically compare classification outcomes with and without redshift, revealing the relative importance of different features and identifying the most robust classifiers for redshift-limited scenarios. This research not only highlights the power of machine learning for astronomical classification but also provides a framework for reliable classification when redshift data is lacking. By uncovering the distinguishing spectral characteristics of galaxies, stars, and quasars that are independent of redshift, we open new avenues for efficient and accurate classification in large-scale photometric surveys and the study of faint, high-redshift objects.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Astronomische Nachrichten 地学天文-天文与天体物理

CiteScore

1.80

自引率

11.10%

发文量

审稿时长

4-8 weeks

期刊介绍： Astronomische Nachrichten, founded in 1821 by H. C. Schumacher, is the oldest astronomical journal worldwide still being published. Famous astronomical discoveries and important papers on astronomy and astrophysics published in more than 300 volumes of the journal give an outstanding representation of the progress of astronomical research over the last 180 years. Today, Astronomical Notes/ Astronomische Nachrichten publishes articles in the field of observational and theoretical astrophysics and related topics in solar-system and solar physics. Additional, papers on astronomical instrumentation ground-based and space-based as well as papers about numerical astrophysical techniques and supercomputer modelling are covered. Papers can be completed by short video sequences in the electronic version. Astronomical Notes/ Astronomische Nachrichten also publishes special issues of meeting proceedings.