Ensemble Feature Selection Method for Single Pulse Classification

Q4 Physics and Astronomy
ZHANG Jin-qu , LING Yu , DU Ping , LI Xiang-ru , LI Hui
{"title":"Ensemble Feature Selection Method for Single Pulse Classification","authors":"ZHANG Jin-qu ,&nbsp;LING Yu ,&nbsp;DU Ping ,&nbsp;LI Xiang-ru ,&nbsp;LI Hui","doi":"10.1016/j.chinastron.2024.03.010","DOIUrl":null,"url":null,"abstract":"<div><p>Affected by a large number of radio frequency interference signals, it has become an important task for astronomical data processing to quickly and accurately identify single pulse signals from massive observation data. Designing and extracting effective data features is the key issue for efficient identification of single pulse signals using machine learning. This paper proposes an ensemble feature selection method for single pulse signal classification. The method first mixed three types of features, including the parametric features, statistical features, and abstract features of single pulse signals, and then used five individual feature selection methods to select the corresponding optimal feature set, respectively. At last, the features selected by the five individual methods are mixed, and the greedy strategy was used to select the optimal ensemble feature set. The experimental results show that the ensemble feature set can improve F1-score by a value of 1.8% at most, and can obtain higher accuracy than the features selected by individual methods. Under the background of high-speed and large-scale sky survey, the ensemble feature selection method plays an important role in reducing the number of features, improving classification performance, and speeding up data processing.</p></div>","PeriodicalId":35730,"journal":{"name":"Chinese Astronomy and Astrophysics","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0275106224000109/pdfft?md5=b5424318b054c6cb876d5942fc61ea19&pid=1-s2.0-S0275106224000109-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chinese Astronomy and Astrophysics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0275106224000109","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Physics and Astronomy","Score":null,"Total":0}
引用次数: 0

Abstract

Affected by a large number of radio frequency interference signals, it has become an important task for astronomical data processing to quickly and accurately identify single pulse signals from massive observation data. Designing and extracting effective data features is the key issue for efficient identification of single pulse signals using machine learning. This paper proposes an ensemble feature selection method for single pulse signal classification. The method first mixed three types of features, including the parametric features, statistical features, and abstract features of single pulse signals, and then used five individual feature selection methods to select the corresponding optimal feature set, respectively. At last, the features selected by the five individual methods are mixed, and the greedy strategy was used to select the optimal ensemble feature set. The experimental results show that the ensemble feature set can improve F1-score by a value of 1.8% at most, and can obtain higher accuracy than the features selected by individual methods. Under the background of high-speed and large-scale sky survey, the ensemble feature selection method plays an important role in reducing the number of features, improving classification performance, and speeding up data processing.

用于单脉冲分类的集合特征选择方法
受大量射频干扰信号的影响,如何从海量观测数据中快速、准确地识别单脉冲信号已成为天文数据处理的一项重要任务。设计和提取有效的数据特征是利用机器学习高效识别单脉冲信号的关键问题。本文提出了一种用于单脉冲信号分类的集合特征选择方法。该方法首先混合了单脉冲信号的参数特征、统计特征和抽象特征等三类特征,然后使用五种单个特征选择方法分别选择相应的最优特征集。最后,将五种单独方法选出的特征进行混合,并使用贪婪策略选出最优的集合特征集。实验结果表明,集合特征集最多可将 F1 分数提高 1.8%,比单个方法选择的特征获得更高的精度。在高速、大规模巡天背景下,集合特征选择方法在减少特征数量、提高分类性能、加快数据处理速度等方面发挥了重要作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Chinese Astronomy and Astrophysics
Chinese Astronomy and Astrophysics Physics and Astronomy-Astronomy and Astrophysics
CiteScore
0.70
自引率
0.00%
发文量
20
期刊介绍: The vigorous growth of astronomical and astrophysical science in China led to an increase in papers on astrophysics which Acta Astronomica Sinica could no longer absorb. Translations of papers from two new journals the Chinese Journal of Space Science and Acta Astrophysica Sinica are added to the translation of Acta Astronomica Sinica to form the new journal Chinese Astronomy and Astrophysics. Chinese Astronomy and Astrophysics brings English translations of notable articles to astronomers and astrophysicists outside China.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信