How effective oversampling techniques are in classifying potentially hazardous asteroids

IF 2.9 3区 物理与天体物理 Q2 PHYSICS, MULTIDISCIPLINARY
Md. Sadman, Mir Sakhawat Hossain
{"title":"How effective oversampling techniques are in classifying potentially hazardous asteroids","authors":"Md. Sadman,&nbsp;Mir Sakhawat Hossain","doi":"10.1140/epjp/s13360-025-06783-2","DOIUrl":null,"url":null,"abstract":"<div><p>Potentially hazardous asteroids require strict surveillance to ensure the safety of our planet. However, the vast amount of increasing astronomical data makes it troublesome for humans to study these asteroids. Hence, machine learning techniques are used to classify these hazardous asteroids. However, machine learning models are not robust for distinguishing imbalanced classes. Various undersampling and oversampling techniques are used to address this problem. In our study, we refrained from using any undersampling technique as we did not want to lose any valuable information. Instead, we employed various oversampling techniques, including random Oversampling, SMOTE (Synthetic minority over-sampling technique), ADASYN (Adaptive Synthetic Sampling), BorderlineSmote, KMeansSmote, and SVMSmote. For each oversampling technique, we trained the Random Forest, XGBoost, LightGBM, HistGradientBoosting, and AdaBoost classifiers. Our research presents a detailed study of these oversampling techniques to determine which one is more suitable for our dataset.</p></div>","PeriodicalId":792,"journal":{"name":"The European Physical Journal Plus","volume":"140 9","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2025-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The European Physical Journal Plus","FirstCategoryId":"4","ListUrlMain":"https://link.springer.com/article/10.1140/epjp/s13360-025-06783-2","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PHYSICS, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Potentially hazardous asteroids require strict surveillance to ensure the safety of our planet. However, the vast amount of increasing astronomical data makes it troublesome for humans to study these asteroids. Hence, machine learning techniques are used to classify these hazardous asteroids. However, machine learning models are not robust for distinguishing imbalanced classes. Various undersampling and oversampling techniques are used to address this problem. In our study, we refrained from using any undersampling technique as we did not want to lose any valuable information. Instead, we employed various oversampling techniques, including random Oversampling, SMOTE (Synthetic minority over-sampling technique), ADASYN (Adaptive Synthetic Sampling), BorderlineSmote, KMeansSmote, and SVMSmote. For each oversampling technique, we trained the Random Forest, XGBoost, LightGBM, HistGradientBoosting, and AdaBoost classifiers. Our research presents a detailed study of these oversampling techniques to determine which one is more suitable for our dataset.

过采样技术在分类潜在危险小行星方面有多有效
潜在的危险小行星需要严格的监视,以确保我们星球的安全。然而,大量不断增加的天文数据给人类研究这些小行星带来了麻烦。因此,机器学习技术被用来对这些危险的小行星进行分类。然而,机器学习模型在区分不平衡类别方面并不健壮。各种欠采样和过采样技术被用来解决这个问题。在我们的研究中,我们避免使用任何欠采样技术,因为我们不想丢失任何有价值的信息。相反,我们采用了各种过采样技术,包括随机过采样、SMOTE(合成少数过采样技术)、ADASYN(自适应合成采样)、BorderlineSmote、KMeansSmote和SVMSmote。对于每种过采样技术,我们训练了Random Forest、XGBoost、LightGBM、HistGradientBoosting和AdaBoost分类器。我们的研究对这些过采样技术进行了详细的研究,以确定哪一个更适合我们的数据集。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
The European Physical Journal Plus
The European Physical Journal Plus PHYSICS, MULTIDISCIPLINARY-
CiteScore
5.40
自引率
8.80%
发文量
1150
审稿时长
4-8 weeks
期刊介绍: The aims of this peer-reviewed online journal are to distribute and archive all relevant material required to document, assess, validate and reconstruct in detail the body of knowledge in the physical and related sciences. The scope of EPJ Plus encompasses a broad landscape of fields and disciplines in the physical and related sciences - such as covered by the topical EPJ journals and with the explicit addition of geophysics, astrophysics, general relativity and cosmology, mathematical and quantum physics, classical and fluid mechanics, accelerator and medical physics, as well as physics techniques applied to any other topics, including energy, environment and cultural heritage.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信