Predicting Pulsars from Imbalanced Dataset with Hybrid Resampling Approach

IF 1.6 4区 物理与天体物理 Q3 ASTRONOMY & ASTROPHYSICS
Ernesto Lee, F. Rustam, Wajdi Aljedaani, Abid Ishaq, Vaibhav Rupapara, I. Ashraf
{"title":"Predicting Pulsars from Imbalanced Dataset with Hybrid Resampling Approach","authors":"Ernesto Lee, F. Rustam, Wajdi Aljedaani, Abid Ishaq, Vaibhav Rupapara, I. Ashraf","doi":"10.1155/2021/4916494","DOIUrl":null,"url":null,"abstract":"Pulsar stars, usually neutron stars, are spherical and compact objects containing a large quantity of mass. Each pulsar star possesses a magnetic field and emits a slightly different pattern of electromagnetic radiation which is used to identify the potential candidates for a real pulsar star. Pulsar stars are considered an important cosmic phenomenon, and scientists use them to study nuclear physics, gravitational waves, and collisions between black holes. Defining the process of automatic detection of pulsar stars can accelerate the study of pulsar stars by scientists. This study contrives an accurate and efficient approach for true pulsar detection using supervised machine learning. For experiments, the high time-resolution (HTRU2) dataset is used in this study. To resolve the data imbalance problem and overcome model overfitting, a hybrid resampling approach is presented in this study. Experiments are performed with imbalanced and balanced datasets using well-known machine learning algorithms. Results demonstrate that the proposed hybrid resampling approach proves highly influential to avoid model overfitting and increase the prediction accuracy. With the proposed hybrid resampling approach, the extra tree classifier achieves a 0.993 accuracy score for true pulsar star prediction.","PeriodicalId":48962,"journal":{"name":"Advances in Astronomy","volume":" ","pages":""},"PeriodicalIF":1.6000,"publicationDate":"2021-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Astronomy","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1155/2021/4916494","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
引用次数: 5

Abstract

Pulsar stars, usually neutron stars, are spherical and compact objects containing a large quantity of mass. Each pulsar star possesses a magnetic field and emits a slightly different pattern of electromagnetic radiation which is used to identify the potential candidates for a real pulsar star. Pulsar stars are considered an important cosmic phenomenon, and scientists use them to study nuclear physics, gravitational waves, and collisions between black holes. Defining the process of automatic detection of pulsar stars can accelerate the study of pulsar stars by scientists. This study contrives an accurate and efficient approach for true pulsar detection using supervised machine learning. For experiments, the high time-resolution (HTRU2) dataset is used in this study. To resolve the data imbalance problem and overcome model overfitting, a hybrid resampling approach is presented in this study. Experiments are performed with imbalanced and balanced datasets using well-known machine learning algorithms. Results demonstrate that the proposed hybrid resampling approach proves highly influential to avoid model overfitting and increase the prediction accuracy. With the proposed hybrid resampling approach, the extra tree classifier achieves a 0.993 accuracy score for true pulsar star prediction.
利用混合重采样方法预测不平衡数据集中的脉冲星
脉冲星,通常是中子星,是球状的致密天体,含有大量的质量。每颗脉冲星都有一个磁场,并发射出一种略有不同的电磁辐射模式,这种模式被用来识别真正脉冲星的潜在候选者。脉冲星被认为是一种重要的宇宙现象,科学家们利用它们来研究核物理学、引力波和黑洞之间的碰撞。确定脉冲星自动探测的过程,可以加快科学家对脉冲星的研究。本研究设计了一种使用监督机器学习进行真正脉冲星探测的准确有效的方法。实验采用高时间分辨率(HTRU2)数据集。为了解决数据不平衡问题和克服模型过拟合问题,本文提出了一种混合重采样方法。使用著名的机器学习算法对不平衡和平衡数据集进行实验。结果表明,混合重采样方法对避免模型过拟合和提高预测精度有很大的影响。采用混合重采样方法,额外树分类器对真实脉冲星的预测准确率达到0.993。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Advances in Astronomy
Advances in Astronomy ASTRONOMY & ASTROPHYSICS-
CiteScore
2.70
自引率
7.10%
发文量
10
审稿时长
22 weeks
期刊介绍: Advances in Astronomy publishes articles in all areas of astronomy, astrophysics, and cosmology. The journal accepts both observational and theoretical investigations into celestial objects and the wider universe, as well as the reports of new methods and instrumentation for their study.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信