Yucong Jin , Fengye Chen , Chen Sun , Beiyi Zhang , Yunfei Rao , Tianyang Sun , Ding Li , Haoyu Yang , Li Wang , Yu-Yan Sara Zhao , Olivier Forni , Agnès Cousin , Sylvestre Maurice , Jin Yu
{"title":"基于轮廓优化特征选择和神经网络的含碳有机化合物矩阵不敏感分类","authors":"Yucong Jin , Fengye Chen , Chen Sun , Beiyi Zhang , Yunfei Rao , Tianyang Sun , Ding Li , Haoyu Yang , Li Wang , Yu-Yan Sara Zhao , Olivier Forni , Agnès Cousin , Sylvestre Maurice , Jin Yu","doi":"10.1016/j.sab.2025.107252","DOIUrl":null,"url":null,"abstract":"<div><div>The presence of carbon-containing organic matter on Mars is closely linked to the planet's potential habitability and the possibility of past life. Laser-Induced Breakdown Spectroscopy (LIBS) has emerged as a key technology in Martian organic studies due to its advantages, including real-time detection, on-site analysis, and elimination of sample preparation requirements. However, practical implementation of LIBS still faces challenges such as interference from Mars' background CO<span><math><msub><mrow></mrow><mn>2</mn></msub></math></span> gas and significant chemical matrix effects caused by the planet's diverse geological environment. In this study, we constructed a laboratory-based simulation chamber that mimics the Martian environment and prepared a sample library consisting of four possible Martian matrices doped with three different organic compounds. Three matrices were used as training sets while another kind of Mars soil simulator JMSS-1 served as an independent test set to simulate the real world exploration on Mars. In order to overcome the matrix effect and influence of CO<span><math><msub><mrow></mrow><mn>2</mn></msub></math></span> background, we introduced a novel strategy called SC-SKB-BPNN. Silhouette coefficent(SC) is employed to evaluate the clustering effect after the SelectKBest(SKB) feature selection and t-SNE dimensionality reduction, which enabled us to optimize the number of features and empowered the backward propagation neutral network(BPNN) classification model. Finally, after implementing the SC-SKB-BPNN algorithm, we successfully achieved precise classification of carbon-containing organic compound samples in a simulated Mars environment, attaining a sensitivity and specificity of 95.6 % and 97.8 % for the training set, while obtaining values of 93.3 % and 96.7 % for the test set.</div></div>","PeriodicalId":21890,"journal":{"name":"Spectrochimica Acta Part B: Atomic Spectroscopy","volume":"231 ","pages":"Article 107252"},"PeriodicalIF":3.2000,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A matrix-insensitive classification of carbon-containing organic compounds via silhouette-optimized feature selection and neural network\",\"authors\":\"Yucong Jin , Fengye Chen , Chen Sun , Beiyi Zhang , Yunfei Rao , Tianyang Sun , Ding Li , Haoyu Yang , Li Wang , Yu-Yan Sara Zhao , Olivier Forni , Agnès Cousin , Sylvestre Maurice , Jin Yu\",\"doi\":\"10.1016/j.sab.2025.107252\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The presence of carbon-containing organic matter on Mars is closely linked to the planet's potential habitability and the possibility of past life. Laser-Induced Breakdown Spectroscopy (LIBS) has emerged as a key technology in Martian organic studies due to its advantages, including real-time detection, on-site analysis, and elimination of sample preparation requirements. However, practical implementation of LIBS still faces challenges such as interference from Mars' background CO<span><math><msub><mrow></mrow><mn>2</mn></msub></math></span> gas and significant chemical matrix effects caused by the planet's diverse geological environment. In this study, we constructed a laboratory-based simulation chamber that mimics the Martian environment and prepared a sample library consisting of four possible Martian matrices doped with three different organic compounds. Three matrices were used as training sets while another kind of Mars soil simulator JMSS-1 served as an independent test set to simulate the real world exploration on Mars. In order to overcome the matrix effect and influence of CO<span><math><msub><mrow></mrow><mn>2</mn></msub></math></span> background, we introduced a novel strategy called SC-SKB-BPNN. Silhouette coefficent(SC) is employed to evaluate the clustering effect after the SelectKBest(SKB) feature selection and t-SNE dimensionality reduction, which enabled us to optimize the number of features and empowered the backward propagation neutral network(BPNN) classification model. Finally, after implementing the SC-SKB-BPNN algorithm, we successfully achieved precise classification of carbon-containing organic compound samples in a simulated Mars environment, attaining a sensitivity and specificity of 95.6 % and 97.8 % for the training set, while obtaining values of 93.3 % and 96.7 % for the test set.</div></div>\",\"PeriodicalId\":21890,\"journal\":{\"name\":\"Spectrochimica Acta Part B: Atomic Spectroscopy\",\"volume\":\"231 \",\"pages\":\"Article 107252\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2025-06-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Spectrochimica Acta Part B: Atomic Spectroscopy\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0584854725001375\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"SPECTROSCOPY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spectrochimica Acta Part B: Atomic Spectroscopy","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0584854725001375","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SPECTROSCOPY","Score":null,"Total":0}
A matrix-insensitive classification of carbon-containing organic compounds via silhouette-optimized feature selection and neural network
The presence of carbon-containing organic matter on Mars is closely linked to the planet's potential habitability and the possibility of past life. Laser-Induced Breakdown Spectroscopy (LIBS) has emerged as a key technology in Martian organic studies due to its advantages, including real-time detection, on-site analysis, and elimination of sample preparation requirements. However, practical implementation of LIBS still faces challenges such as interference from Mars' background CO gas and significant chemical matrix effects caused by the planet's diverse geological environment. In this study, we constructed a laboratory-based simulation chamber that mimics the Martian environment and prepared a sample library consisting of four possible Martian matrices doped with three different organic compounds. Three matrices were used as training sets while another kind of Mars soil simulator JMSS-1 served as an independent test set to simulate the real world exploration on Mars. In order to overcome the matrix effect and influence of CO background, we introduced a novel strategy called SC-SKB-BPNN. Silhouette coefficent(SC) is employed to evaluate the clustering effect after the SelectKBest(SKB) feature selection and t-SNE dimensionality reduction, which enabled us to optimize the number of features and empowered the backward propagation neutral network(BPNN) classification model. Finally, after implementing the SC-SKB-BPNN algorithm, we successfully achieved precise classification of carbon-containing organic compound samples in a simulated Mars environment, attaining a sensitivity and specificity of 95.6 % and 97.8 % for the training set, while obtaining values of 93.3 % and 96.7 % for the test set.
期刊介绍:
Spectrochimica Acta Part B: Atomic Spectroscopy, is intended for the rapid publication of both original work and reviews in the following fields:
Atomic Emission (AES), Atomic Absorption (AAS) and Atomic Fluorescence (AFS) spectroscopy;
Mass Spectrometry (MS) for inorganic analysis covering Spark Source (SS-MS), Inductively Coupled Plasma (ICP-MS), Glow Discharge (GD-MS), and Secondary Ion Mass Spectrometry (SIMS).
Laser induced atomic spectroscopy for inorganic analysis, including non-linear optical laser spectroscopy, covering Laser Enhanced Ionization (LEI), Laser Induced Fluorescence (LIF), Resonance Ionization Spectroscopy (RIS) and Resonance Ionization Mass Spectrometry (RIMS); Laser Induced Breakdown Spectroscopy (LIBS); Cavity Ringdown Spectroscopy (CRDS), Laser Ablation Inductively Coupled Plasma Atomic Emission Spectroscopy (LA-ICP-AES) and Laser Ablation Inductively Coupled Plasma Mass Spectrometry (LA-ICP-MS).
X-ray spectrometry, X-ray Optics and Microanalysis, including X-ray fluorescence spectrometry (XRF) and related techniques, in particular Total-reflection X-ray Fluorescence Spectrometry (TXRF), and Synchrotron Radiation-excited Total reflection XRF (SR-TXRF).
Manuscripts dealing with (i) fundamentals, (ii) methodology development, (iii)instrumentation, and (iv) applications, can be submitted for publication.