Chenyang Duan , Zhuomin Huang , Yue Jin , Huaqiang Li , Haoyu Yang , Tianyang Sun , Chen Sun , Shu Liu , Jin Yu
{"title":"在 LIBS 光谱处理中结合 XGBoost 和神经网络,精确测定 620 块不同产地铁矿石样品中的关键元素","authors":"Chenyang Duan , Zhuomin Huang , Yue Jin , Huaqiang Li , Haoyu Yang , Tianyang Sun , Chen Sun , Shu Liu , Jin Yu","doi":"10.1016/j.sab.2024.107056","DOIUrl":null,"url":null,"abstract":"<div><div>China is the worldwide largest importer of iron ores for supplying its iron and steel industries. The necessary inspections of iron ores not only concern total iron content but also involve minor elements, such as silicium, aluminum, magnesium, and calcium. Laser-induced breakdown spectroscopy (LIBS) promises <em>in situ</em> and on-site detection and analysis capabilities, which is required for efficient, reliable, and large-scale impartation inspections. Such opportunity represents at the same time, a challenge for the technique in terms of the quantitative analysis ability. A fundamental issue corresponds to the matrix effect in LIBS analysis due to a large diversity of iron ores in terms of mineralogical and chemical compositions. Demonstrated as effective to deal with the matrix effect in LIBS analyses of geological materials, the machine learning approach needs to be implemented within an optimized data processing pipeline, combining algorithms carefully chosen and tested. In this work, we developed a sequentially connected combination of an extreme gradient boosting (XGBoost) model and a back propagation neural network (BPNN) model, to effectively fit the functional relation between the concentrations and the spectra. The specificity of the approach consists in fitting the linear, low-order nonlinear, and high-order nonlinear terms in a sequential way to substantially improve the model prediction performances. The method has been developed and tested with a significant set of 620 collected iron ore samples of eight production countries and 35 different commercial brands, with a wide range of matrices in terms of mineralogical and chemical compositions. The samples underwent prior characterization using conventional laboratory analysis methods to establish the reference elemental concentrations. The results from this necessary step of laboratory investigation, with root mean square error for prediction (<span><math><mi>RMSEP</mi></math></span>) of 0.407 wt%, 0.372 wt%, 0.085 wt%, 0.131 wt%, and 0.114 wt%, respectively for total iron (TFe), SiO<sub>2</sub>, Al<sub>2</sub>O<sub>3</sub>, MgO, and CaO, show the potential for on-site inspections.</div></div>","PeriodicalId":21890,"journal":{"name":"Spectrochimica Acta Part B: Atomic Spectroscopy","volume":"221 ","pages":"Article 107056"},"PeriodicalIF":3.2000,"publicationDate":"2024-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A combination of XGBoost and neural network in LIBS spectrum processing for precise determination of critical elements in 620 iron ore samples of various origins\",\"authors\":\"Chenyang Duan , Zhuomin Huang , Yue Jin , Huaqiang Li , Haoyu Yang , Tianyang Sun , Chen Sun , Shu Liu , Jin Yu\",\"doi\":\"10.1016/j.sab.2024.107056\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>China is the worldwide largest importer of iron ores for supplying its iron and steel industries. The necessary inspections of iron ores not only concern total iron content but also involve minor elements, such as silicium, aluminum, magnesium, and calcium. Laser-induced breakdown spectroscopy (LIBS) promises <em>in situ</em> and on-site detection and analysis capabilities, which is required for efficient, reliable, and large-scale impartation inspections. Such opportunity represents at the same time, a challenge for the technique in terms of the quantitative analysis ability. A fundamental issue corresponds to the matrix effect in LIBS analysis due to a large diversity of iron ores in terms of mineralogical and chemical compositions. Demonstrated as effective to deal with the matrix effect in LIBS analyses of geological materials, the machine learning approach needs to be implemented within an optimized data processing pipeline, combining algorithms carefully chosen and tested. In this work, we developed a sequentially connected combination of an extreme gradient boosting (XGBoost) model and a back propagation neural network (BPNN) model, to effectively fit the functional relation between the concentrations and the spectra. The specificity of the approach consists in fitting the linear, low-order nonlinear, and high-order nonlinear terms in a sequential way to substantially improve the model prediction performances. The method has been developed and tested with a significant set of 620 collected iron ore samples of eight production countries and 35 different commercial brands, with a wide range of matrices in terms of mineralogical and chemical compositions. The samples underwent prior characterization using conventional laboratory analysis methods to establish the reference elemental concentrations. The results from this necessary step of laboratory investigation, with root mean square error for prediction (<span><math><mi>RMSEP</mi></math></span>) of 0.407 wt%, 0.372 wt%, 0.085 wt%, 0.131 wt%, and 0.114 wt%, respectively for total iron (TFe), SiO<sub>2</sub>, Al<sub>2</sub>O<sub>3</sub>, MgO, and CaO, show the potential for on-site inspections.</div></div>\",\"PeriodicalId\":21890,\"journal\":{\"name\":\"Spectrochimica Acta Part B: Atomic Spectroscopy\",\"volume\":\"221 \",\"pages\":\"Article 107056\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2024-10-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Spectrochimica Acta Part B: Atomic Spectroscopy\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0584854724002015\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"SPECTROSCOPY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spectrochimica Acta Part B: Atomic Spectroscopy","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0584854724002015","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SPECTROSCOPY","Score":null,"Total":0}
A combination of XGBoost and neural network in LIBS spectrum processing for precise determination of critical elements in 620 iron ore samples of various origins
China is the worldwide largest importer of iron ores for supplying its iron and steel industries. The necessary inspections of iron ores not only concern total iron content but also involve minor elements, such as silicium, aluminum, magnesium, and calcium. Laser-induced breakdown spectroscopy (LIBS) promises in situ and on-site detection and analysis capabilities, which is required for efficient, reliable, and large-scale impartation inspections. Such opportunity represents at the same time, a challenge for the technique in terms of the quantitative analysis ability. A fundamental issue corresponds to the matrix effect in LIBS analysis due to a large diversity of iron ores in terms of mineralogical and chemical compositions. Demonstrated as effective to deal with the matrix effect in LIBS analyses of geological materials, the machine learning approach needs to be implemented within an optimized data processing pipeline, combining algorithms carefully chosen and tested. In this work, we developed a sequentially connected combination of an extreme gradient boosting (XGBoost) model and a back propagation neural network (BPNN) model, to effectively fit the functional relation between the concentrations and the spectra. The specificity of the approach consists in fitting the linear, low-order nonlinear, and high-order nonlinear terms in a sequential way to substantially improve the model prediction performances. The method has been developed and tested with a significant set of 620 collected iron ore samples of eight production countries and 35 different commercial brands, with a wide range of matrices in terms of mineralogical and chemical compositions. The samples underwent prior characterization using conventional laboratory analysis methods to establish the reference elemental concentrations. The results from this necessary step of laboratory investigation, with root mean square error for prediction () of 0.407 wt%, 0.372 wt%, 0.085 wt%, 0.131 wt%, and 0.114 wt%, respectively for total iron (TFe), SiO2, Al2O3, MgO, and CaO, show the potential for on-site inspections.
期刊介绍:
Spectrochimica Acta Part B: Atomic Spectroscopy, is intended for the rapid publication of both original work and reviews in the following fields:
Atomic Emission (AES), Atomic Absorption (AAS) and Atomic Fluorescence (AFS) spectroscopy;
Mass Spectrometry (MS) for inorganic analysis covering Spark Source (SS-MS), Inductively Coupled Plasma (ICP-MS), Glow Discharge (GD-MS), and Secondary Ion Mass Spectrometry (SIMS).
Laser induced atomic spectroscopy for inorganic analysis, including non-linear optical laser spectroscopy, covering Laser Enhanced Ionization (LEI), Laser Induced Fluorescence (LIF), Resonance Ionization Spectroscopy (RIS) and Resonance Ionization Mass Spectrometry (RIMS); Laser Induced Breakdown Spectroscopy (LIBS); Cavity Ringdown Spectroscopy (CRDS), Laser Ablation Inductively Coupled Plasma Atomic Emission Spectroscopy (LA-ICP-AES) and Laser Ablation Inductively Coupled Plasma Mass Spectrometry (LA-ICP-MS).
X-ray spectrometry, X-ray Optics and Microanalysis, including X-ray fluorescence spectrometry (XRF) and related techniques, in particular Total-reflection X-ray Fluorescence Spectrometry (TXRF), and Synchrotron Radiation-excited Total reflection XRF (SR-TXRF).
Manuscripts dealing with (i) fundamentals, (ii) methodology development, (iii)instrumentation, and (iv) applications, can be submitted for publication.