Asteroid family classification with machine learning: Investigative analysis of a novel two-step approach for categorizing known small asteroid families⋆

IF 2.7 3区 物理与天体物理 Q2 ASTRONOMY & ASTROPHYSICS
Fatin Abrar Shams, Abdullah Al Mahmud Nafiz, Md. Salman Mohosheu, Maheen Mashrur Hoque, Samiur Rashid Abir, Rashed Hasan Ratul, Md. Mushfiqur Rahman Mushfique, Aftab Ibn Nazim, Rubaiat Rehman Khan, Md Mahmudunnobe, Mohsinul Kabir
{"title":"Asteroid family classification with machine learning: Investigative analysis of a novel two-step approach for categorizing known small asteroid families⋆","authors":"Fatin Abrar Shams,&nbsp;Abdullah Al Mahmud Nafiz,&nbsp;Md. Salman Mohosheu,&nbsp;Maheen Mashrur Hoque,&nbsp;Samiur Rashid Abir,&nbsp;Rashed Hasan Ratul,&nbsp;Md. Mushfiqur Rahman Mushfique,&nbsp;Aftab Ibn Nazim,&nbsp;Rubaiat Rehman Khan,&nbsp;Md Mahmudunnobe,&nbsp;Mohsinul Kabir","doi":"10.1007/s10686-025-09982-y","DOIUrl":null,"url":null,"abstract":"<div><p>The term “asteroid family” refers to a collection of asteroids that share similar proper orbital elements such as semi-major axis, eccentricities, and orbital inclinations. Detecting small asteroid families has proved to be a challenge for a long time because of their extremely low sample size. In general, standalone machine learning classifiers tend to exhibit a bias towards classes with larger sample sizes, resulting in the inadequate classification of asteroid families with limited data. In this paper, a two-step supervised model was proposed for the effective classification of the asteroid families, especially for the tiny, small, and lower groups of medium asteroid families. The proposed model uses two-step classification in an attempt to resolve the challenges that come with the imbalanced dataset where at first a binary classification of small and large families was done with an XGB (Extreme Gradient boosting) classifier and then in the second stage Random Forest classifier was used alongside previously identified binary features to classify asteroid families. The proposed model performed better with higher F1 scores for tiny and small asteroid families compared to other algorithms tested in this work. It also achieved a perfect F1 score for 90 families, among 112 families which were tested. As for the lower group of medium sized asteroid families, it performed slightly worse compared to the previously used machine learning algorithms. Along with this, four dataset imbalance handling techniques have been employed in this work and compared to the proposed algorithm.</p></div>","PeriodicalId":551,"journal":{"name":"Experimental Astronomy","volume":"59 1","pages":""},"PeriodicalIF":2.7000,"publicationDate":"2025-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Experimental Astronomy","FirstCategoryId":"101","ListUrlMain":"https://link.springer.com/article/10.1007/s10686-025-09982-y","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
引用次数: 0

Abstract

The term “asteroid family” refers to a collection of asteroids that share similar proper orbital elements such as semi-major axis, eccentricities, and orbital inclinations. Detecting small asteroid families has proved to be a challenge for a long time because of their extremely low sample size. In general, standalone machine learning classifiers tend to exhibit a bias towards classes with larger sample sizes, resulting in the inadequate classification of asteroid families with limited data. In this paper, a two-step supervised model was proposed for the effective classification of the asteroid families, especially for the tiny, small, and lower groups of medium asteroid families. The proposed model uses two-step classification in an attempt to resolve the challenges that come with the imbalanced dataset where at first a binary classification of small and large families was done with an XGB (Extreme Gradient boosting) classifier and then in the second stage Random Forest classifier was used alongside previously identified binary features to classify asteroid families. The proposed model performed better with higher F1 scores for tiny and small asteroid families compared to other algorithms tested in this work. It also achieved a perfect F1 score for 90 families, among 112 families which were tested. As for the lower group of medium sized asteroid families, it performed slightly worse compared to the previously used machine learning algorithms. Along with this, four dataset imbalance handling techniques have been employed in this work and compared to the proposed algorithm.

求助全文
约1分钟内获得全文 求助全文
来源期刊
Experimental Astronomy
Experimental Astronomy 地学天文-天文与天体物理
CiteScore
5.30
自引率
3.30%
发文量
57
审稿时长
6-12 weeks
期刊介绍: Many new instruments for observing astronomical objects at a variety of wavelengths have been and are continually being developed. Furthermore, a vast amount of effort is being put into the development of new techniques for data analysis in order to cope with great streams of data collected by these instruments. Experimental Astronomy acts as a medium for the publication of papers of contemporary scientific interest on astrophysical instrumentation and methods necessary for the conduct of astronomy at all wavelength fields. Experimental Astronomy publishes full-length articles, research letters and reviews on developments in detection techniques, instruments, and data analysis and image processing techniques. Occasional special issues are published, giving an in-depth presentation of the instrumentation and/or analysis connected with specific projects, such as satellite experiments or ground-based telescopes, or of specialized techniques.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信