R. C. Salvador, E. Dadios, Irister M. Javel, Antipas T. Teologo
{"title":"PULSE: A Pulsar Searching Model with Genetic Algorithm Implementation for Best Pipeline Selection and Hyperparameters Optimization","authors":"R. C. Salvador, E. Dadios, Irister M. Javel, Antipas T. Teologo","doi":"10.1109/HNICEM48295.2019.9072764","DOIUrl":null,"url":null,"abstract":"Pulsars enabled astronomers to study neutron stars and verify general relativity under intense gravitational field conditions. However, finding pulsars is not as easy as it seems because most of them have weak pulses that get drowned in the background noise and hence do not get detected. This paper presents a novel way of classifying radio emission patterns collected from a radio telescope whether it is from a pulsar or not through machine learning and genetic algorithm. The dataset was acquired from the High Time Resolution Universe (HTRU) survey two which contains eight numerical features and one target variable describing the pulse profile. Synthetic Minority Oversampling Technique (SMOTE) was applied to the dataset to fix the imbalance between classes. A genetic algorithm library was used to automatically select the best feature preprocessing method, feature selection/reduction technique, machine learning model inside the scikit-learn library, and hyperparameter settings. The genetic algorithm suggested using a single stack and multiple stack classifiers for different sets of features. The optimum level of hyperparameters was also given with the help of the same algorithm. The selected pipelines consistently reported a score of more than 99% in all the evaluation metrics used.","PeriodicalId":6733,"journal":{"name":"2019 IEEE 11th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management ( HNICEM )","volume":"35 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 11th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management ( HNICEM )","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HNICEM48295.2019.9072764","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Pulsars enabled astronomers to study neutron stars and verify general relativity under intense gravitational field conditions. However, finding pulsars is not as easy as it seems because most of them have weak pulses that get drowned in the background noise and hence do not get detected. This paper presents a novel way of classifying radio emission patterns collected from a radio telescope whether it is from a pulsar or not through machine learning and genetic algorithm. The dataset was acquired from the High Time Resolution Universe (HTRU) survey two which contains eight numerical features and one target variable describing the pulse profile. Synthetic Minority Oversampling Technique (SMOTE) was applied to the dataset to fix the imbalance between classes. A genetic algorithm library was used to automatically select the best feature preprocessing method, feature selection/reduction technique, machine learning model inside the scikit-learn library, and hyperparameter settings. The genetic algorithm suggested using a single stack and multiple stack classifiers for different sets of features. The optimum level of hyperparameters was also given with the help of the same algorithm. The selected pipelines consistently reported a score of more than 99% in all the evaluation metrics used.