ZHANG Jin-qu , LING Yu , DU Ping , LI Xiang-ru , LI Hui
{"title":"Ensemble Feature Selection Method for Single Pulse Classification","authors":"ZHANG Jin-qu , LING Yu , DU Ping , LI Xiang-ru , LI Hui","doi":"10.1016/j.chinastron.2024.03.010","DOIUrl":null,"url":null,"abstract":"<div><p>Affected by a large number of radio frequency interference signals, it has become an important task for astronomical data processing to quickly and accurately identify single pulse signals from massive observation data. Designing and extracting effective data features is the key issue for efficient identification of single pulse signals using machine learning. This paper proposes an ensemble feature selection method for single pulse signal classification. The method first mixed three types of features, including the parametric features, statistical features, and abstract features of single pulse signals, and then used five individual feature selection methods to select the corresponding optimal feature set, respectively. At last, the features selected by the five individual methods are mixed, and the greedy strategy was used to select the optimal ensemble feature set. The experimental results show that the ensemble feature set can improve F1-score by a value of 1.8% at most, and can obtain higher accuracy than the features selected by individual methods. Under the background of high-speed and large-scale sky survey, the ensemble feature selection method plays an important role in reducing the number of features, improving classification performance, and speeding up data processing.</p></div>","PeriodicalId":35730,"journal":{"name":"Chinese Astronomy and Astrophysics","volume":"48 1","pages":"Pages 84-99"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0275106224000109/pdfft?md5=b5424318b054c6cb876d5942fc61ea19&pid=1-s2.0-S0275106224000109-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chinese Astronomy and Astrophysics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0275106224000109","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Physics and Astronomy","Score":null,"Total":0}
引用次数: 0
Abstract
Affected by a large number of radio frequency interference signals, it has become an important task for astronomical data processing to quickly and accurately identify single pulse signals from massive observation data. Designing and extracting effective data features is the key issue for efficient identification of single pulse signals using machine learning. This paper proposes an ensemble feature selection method for single pulse signal classification. The method first mixed three types of features, including the parametric features, statistical features, and abstract features of single pulse signals, and then used five individual feature selection methods to select the corresponding optimal feature set, respectively. At last, the features selected by the five individual methods are mixed, and the greedy strategy was used to select the optimal ensemble feature set. The experimental results show that the ensemble feature set can improve F1-score by a value of 1.8% at most, and can obtain higher accuracy than the features selected by individual methods. Under the background of high-speed and large-scale sky survey, the ensemble feature selection method plays an important role in reducing the number of features, improving classification performance, and speeding up data processing.
受大量射频干扰信号的影响,如何从海量观测数据中快速、准确地识别单脉冲信号已成为天文数据处理的一项重要任务。设计和提取有效的数据特征是利用机器学习高效识别单脉冲信号的关键问题。本文提出了一种用于单脉冲信号分类的集合特征选择方法。该方法首先混合了单脉冲信号的参数特征、统计特征和抽象特征等三类特征,然后使用五种单个特征选择方法分别选择相应的最优特征集。最后,将五种单独方法选出的特征进行混合,并使用贪婪策略选出最优的集合特征集。实验结果表明,集合特征集最多可将 F1 分数提高 1.8%,比单个方法选择的特征获得更高的精度。在高速、大规模巡天背景下,集合特征选择方法在减少特征数量、提高分类性能、加快数据处理速度等方面发挥了重要作用。
期刊介绍:
The vigorous growth of astronomical and astrophysical science in China led to an increase in papers on astrophysics which Acta Astronomica Sinica could no longer absorb. Translations of papers from two new journals the Chinese Journal of Space Science and Acta Astrophysica Sinica are added to the translation of Acta Astronomica Sinica to form the new journal Chinese Astronomy and Astrophysics. Chinese Astronomy and Astrophysics brings English translations of notable articles to astronomers and astrophysicists outside China.