Jiaming Yang , Liangping Tu , Jianxi Li , Jiawei Miao
{"title":"DRsm: Star spectral classification algorithm based on multi-feature extraction","authors":"Jiaming Yang , Liangping Tu , Jianxi Li , Jiawei Miao","doi":"10.1016/j.newast.2024.102349","DOIUrl":null,"url":null,"abstract":"<div><div>With the development of information technology, data-driven astronomical research has become a very popular subject. In view of the huge amount of spectral data from the sky, it is necessary to find suitable automatic processing methods to meet the needs of the time. Based on DenseNet model and ResNet model, DRsm (DenseNet ResNet SoftMax) algorithm is built in this paper, which realizes the automatic classification of stellar spectra. There are 6 steps to the DRsm algorithm: (1) Normalization processing: The Min–max normalization function is used to normalize the stellar spectrum to speed up the algorithm. (2) Denoising processing: The Ces algorithm is employed to denoise the stellar spectrum by reducing the photon noise that affects the spectral observations. (3) Composite RGB image: Three channels of an RGB image, corresponding to the gray image generated by the same spectrum. By superimposing the same spectrum, the effective distinguishing features of the stellar spectrum become more apparent and subsequent work is made easier. Here, we have normalized the continuous spectrum of the stellar spectrum, so that the content shown in the RGB image is basically the spectral line information of the star spectrum. At the same time, we analyze the feasibility of data conversion (synthetic RGB image) : using the main spectral line information of the star spectrum as a reference, we investigate whether the relevant pixel position of the synthesized RGB image contains these features. (4) Data enhancement: The Bottom-hat transformation (Top-hat transformation, contrast enhancement algorithm) is used to enhance the converted data, so that the main distinguishing features of the star spectrum are more obvious. (5) Feature extraction: The ResNet model and DenseNet models are used to extract features from stellar spectra, and the RGB image with a scale of 64 × 64 is extracted as a one-dimensional feature vector. (6) Automatic classification: The feature vector is then sent to the SoftMax module where it is automatically classified. The loss function used by the SoftMax module is ‘data set loss + regular term loss’. When the DRsm algorithm is used to automatically classify the spectra of A, B, dM, F, G, gM and K-type stars with R-band signal-to-noise ratio greater than 30, the classification accuracy is 0.96. The classification accuracy of this method is notably higher than that of the CNN(Convolutional Neural Networks)+Bayes, CNN+KNN, CNN+SVM, CNN+AdaBoost, and CNN+RF algorithms, which achieved accuracies of 0.862, 0.876, 0.894, 0.868, and 0.889, respectively.</div></div>","PeriodicalId":54727,"journal":{"name":"New Astronomy","volume":"116 ","pages":"Article 102349"},"PeriodicalIF":1.9000,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"New Astronomy","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1384107624001635","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
引用次数: 0
Abstract
With the development of information technology, data-driven astronomical research has become a very popular subject. In view of the huge amount of spectral data from the sky, it is necessary to find suitable automatic processing methods to meet the needs of the time. Based on DenseNet model and ResNet model, DRsm (DenseNet ResNet SoftMax) algorithm is built in this paper, which realizes the automatic classification of stellar spectra. There are 6 steps to the DRsm algorithm: (1) Normalization processing: The Min–max normalization function is used to normalize the stellar spectrum to speed up the algorithm. (2) Denoising processing: The Ces algorithm is employed to denoise the stellar spectrum by reducing the photon noise that affects the spectral observations. (3) Composite RGB image: Three channels of an RGB image, corresponding to the gray image generated by the same spectrum. By superimposing the same spectrum, the effective distinguishing features of the stellar spectrum become more apparent and subsequent work is made easier. Here, we have normalized the continuous spectrum of the stellar spectrum, so that the content shown in the RGB image is basically the spectral line information of the star spectrum. At the same time, we analyze the feasibility of data conversion (synthetic RGB image) : using the main spectral line information of the star spectrum as a reference, we investigate whether the relevant pixel position of the synthesized RGB image contains these features. (4) Data enhancement: The Bottom-hat transformation (Top-hat transformation, contrast enhancement algorithm) is used to enhance the converted data, so that the main distinguishing features of the star spectrum are more obvious. (5) Feature extraction: The ResNet model and DenseNet models are used to extract features from stellar spectra, and the RGB image with a scale of 64 × 64 is extracted as a one-dimensional feature vector. (6) Automatic classification: The feature vector is then sent to the SoftMax module where it is automatically classified. The loss function used by the SoftMax module is ‘data set loss + regular term loss’. When the DRsm algorithm is used to automatically classify the spectra of A, B, dM, F, G, gM and K-type stars with R-band signal-to-noise ratio greater than 30, the classification accuracy is 0.96. The classification accuracy of this method is notably higher than that of the CNN(Convolutional Neural Networks)+Bayes, CNN+KNN, CNN+SVM, CNN+AdaBoost, and CNN+RF algorithms, which achieved accuracies of 0.862, 0.876, 0.894, 0.868, and 0.889, respectively.
期刊介绍:
New Astronomy publishes articles in all fields of astronomy and astrophysics, with a particular focus on computational astronomy: mathematical and astronomy techniques and methodology, simulations, modelling and numerical results and computational techniques in instrumentation.
New Astronomy includes full length research articles and review articles. The journal covers solar, stellar, galactic and extragalactic astronomy and astrophysics. It reports on original research in all wavelength bands, ranging from radio to gamma-ray.