在 LIBS 光谱处理中结合 XGBoost 和神经网络,精确测定 620 块不同产地铁矿石样品中的关键元素

IF 3.2 2区 化学 Q1 SPECTROSCOPY
Chenyang Duan , Zhuomin Huang , Yue Jin , Huaqiang Li , Haoyu Yang , Tianyang Sun , Chen Sun , Shu Liu , Jin Yu
{"title":"在 LIBS 光谱处理中结合 XGBoost 和神经网络,精确测定 620 块不同产地铁矿石样品中的关键元素","authors":"Chenyang Duan ,&nbsp;Zhuomin Huang ,&nbsp;Yue Jin ,&nbsp;Huaqiang Li ,&nbsp;Haoyu Yang ,&nbsp;Tianyang Sun ,&nbsp;Chen Sun ,&nbsp;Shu Liu ,&nbsp;Jin Yu","doi":"10.1016/j.sab.2024.107056","DOIUrl":null,"url":null,"abstract":"<div><div>China is the worldwide largest importer of iron ores for supplying its iron and steel industries. The necessary inspections of iron ores not only concern total iron content but also involve minor elements, such as silicium, aluminum, magnesium, and calcium. Laser-induced breakdown spectroscopy (LIBS) promises <em>in situ</em> and on-site detection and analysis capabilities, which is required for efficient, reliable, and large-scale impartation inspections. Such opportunity represents at the same time, a challenge for the technique in terms of the quantitative analysis ability. A fundamental issue corresponds to the matrix effect in LIBS analysis due to a large diversity of iron ores in terms of mineralogical and chemical compositions. Demonstrated as effective to deal with the matrix effect in LIBS analyses of geological materials, the machine learning approach needs to be implemented within an optimized data processing pipeline, combining algorithms carefully chosen and tested. In this work, we developed a sequentially connected combination of an extreme gradient boosting (XGBoost) model and a back propagation neural network (BPNN) model, to effectively fit the functional relation between the concentrations and the spectra. The specificity of the approach consists in fitting the linear, low-order nonlinear, and high-order nonlinear terms in a sequential way to substantially improve the model prediction performances. The method has been developed and tested with a significant set of 620 collected iron ore samples of eight production countries and 35 different commercial brands, with a wide range of matrices in terms of mineralogical and chemical compositions. The samples underwent prior characterization using conventional laboratory analysis methods to establish the reference elemental concentrations. The results from this necessary step of laboratory investigation, with root mean square error for prediction (<span><math><mi>RMSEP</mi></math></span>) of 0.407 wt%, 0.372 wt%, 0.085 wt%, 0.131 wt%, and 0.114 wt%, respectively for total iron (TFe), SiO<sub>2</sub>, Al<sub>2</sub>O<sub>3</sub>, MgO, and CaO, show the potential for on-site inspections.</div></div>","PeriodicalId":21890,"journal":{"name":"Spectrochimica Acta Part B: Atomic Spectroscopy","volume":"221 ","pages":"Article 107056"},"PeriodicalIF":3.2000,"publicationDate":"2024-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A combination of XGBoost and neural network in LIBS spectrum processing for precise determination of critical elements in 620 iron ore samples of various origins\",\"authors\":\"Chenyang Duan ,&nbsp;Zhuomin Huang ,&nbsp;Yue Jin ,&nbsp;Huaqiang Li ,&nbsp;Haoyu Yang ,&nbsp;Tianyang Sun ,&nbsp;Chen Sun ,&nbsp;Shu Liu ,&nbsp;Jin Yu\",\"doi\":\"10.1016/j.sab.2024.107056\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>China is the worldwide largest importer of iron ores for supplying its iron and steel industries. The necessary inspections of iron ores not only concern total iron content but also involve minor elements, such as silicium, aluminum, magnesium, and calcium. Laser-induced breakdown spectroscopy (LIBS) promises <em>in situ</em> and on-site detection and analysis capabilities, which is required for efficient, reliable, and large-scale impartation inspections. Such opportunity represents at the same time, a challenge for the technique in terms of the quantitative analysis ability. A fundamental issue corresponds to the matrix effect in LIBS analysis due to a large diversity of iron ores in terms of mineralogical and chemical compositions. Demonstrated as effective to deal with the matrix effect in LIBS analyses of geological materials, the machine learning approach needs to be implemented within an optimized data processing pipeline, combining algorithms carefully chosen and tested. In this work, we developed a sequentially connected combination of an extreme gradient boosting (XGBoost) model and a back propagation neural network (BPNN) model, to effectively fit the functional relation between the concentrations and the spectra. The specificity of the approach consists in fitting the linear, low-order nonlinear, and high-order nonlinear terms in a sequential way to substantially improve the model prediction performances. The method has been developed and tested with a significant set of 620 collected iron ore samples of eight production countries and 35 different commercial brands, with a wide range of matrices in terms of mineralogical and chemical compositions. The samples underwent prior characterization using conventional laboratory analysis methods to establish the reference elemental concentrations. The results from this necessary step of laboratory investigation, with root mean square error for prediction (<span><math><mi>RMSEP</mi></math></span>) of 0.407 wt%, 0.372 wt%, 0.085 wt%, 0.131 wt%, and 0.114 wt%, respectively for total iron (TFe), SiO<sub>2</sub>, Al<sub>2</sub>O<sub>3</sub>, MgO, and CaO, show the potential for on-site inspections.</div></div>\",\"PeriodicalId\":21890,\"journal\":{\"name\":\"Spectrochimica Acta Part B: Atomic Spectroscopy\",\"volume\":\"221 \",\"pages\":\"Article 107056\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2024-10-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Spectrochimica Acta Part B: Atomic Spectroscopy\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0584854724002015\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"SPECTROSCOPY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spectrochimica Acta Part B: Atomic Spectroscopy","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0584854724002015","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SPECTROSCOPY","Score":null,"Total":0}
引用次数: 0

摘要

中国是世界上最大的铁矿石进口国,为其钢铁工业提供原料。对铁矿石的必要检测不仅涉及总铁含量,还涉及硅、铝、镁和钙等次要元素。激光诱导击穿光谱(LIBS)具有原位和现场检测与分析能力,是高效、可靠和大规模传授检查所必需的。这种机遇同时也对该技术的定量分析能力提出了挑战。由于铁矿石在矿物学和化学成分方面的巨大差异,LIBS 分析中的基质效应是一个基本问题。在地质材料的 LIBS 分析中,机器学习方法被证明可以有效地处理基质效应,但需要在优化的数据处理管道中实施,并结合精心选择和测试的算法。在这项工作中,我们开发了极梯度提升(XGBoost)模型和反向传播神经网络(BPNN)模型的顺序连接组合,以有效拟合浓度和光谱之间的函数关系。该方法的特殊之处在于以顺序的方式拟合线性、低阶非线性和高阶非线性项,从而大幅提高模型的预测性能。该方法的开发和测试使用了大量收集的铁矿石样本,这些样本来自 8 个生产国和 35 个不同的商业品牌,在矿物学和化学成分方面具有广泛的基质。这些样品事先经过了传统实验室分析方法的特征描述,以确定参考元素浓度。实验室调查这一必要步骤的结果显示,总铁 (TFe)、二氧化硅 (SiO2)、氧化铝 (Al2O3)、氧化镁 (MgO) 和氧化钙 (CaO) 的预测均方根误差 (RMSEP) 分别为 0.407 wt%、0.372 wt%、0.085 wt%、0.131 wt% 和 0.114 wt%,显示了现场检测的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

A combination of XGBoost and neural network in LIBS spectrum processing for precise determination of critical elements in 620 iron ore samples of various origins

A combination of XGBoost and neural network in LIBS spectrum processing for precise determination of critical elements in 620 iron ore samples of various origins
China is the worldwide largest importer of iron ores for supplying its iron and steel industries. The necessary inspections of iron ores not only concern total iron content but also involve minor elements, such as silicium, aluminum, magnesium, and calcium. Laser-induced breakdown spectroscopy (LIBS) promises in situ and on-site detection and analysis capabilities, which is required for efficient, reliable, and large-scale impartation inspections. Such opportunity represents at the same time, a challenge for the technique in terms of the quantitative analysis ability. A fundamental issue corresponds to the matrix effect in LIBS analysis due to a large diversity of iron ores in terms of mineralogical and chemical compositions. Demonstrated as effective to deal with the matrix effect in LIBS analyses of geological materials, the machine learning approach needs to be implemented within an optimized data processing pipeline, combining algorithms carefully chosen and tested. In this work, we developed a sequentially connected combination of an extreme gradient boosting (XGBoost) model and a back propagation neural network (BPNN) model, to effectively fit the functional relation between the concentrations and the spectra. The specificity of the approach consists in fitting the linear, low-order nonlinear, and high-order nonlinear terms in a sequential way to substantially improve the model prediction performances. The method has been developed and tested with a significant set of 620 collected iron ore samples of eight production countries and 35 different commercial brands, with a wide range of matrices in terms of mineralogical and chemical compositions. The samples underwent prior characterization using conventional laboratory analysis methods to establish the reference elemental concentrations. The results from this necessary step of laboratory investigation, with root mean square error for prediction (RMSEP) of 0.407 wt%, 0.372 wt%, 0.085 wt%, 0.131 wt%, and 0.114 wt%, respectively for total iron (TFe), SiO2, Al2O3, MgO, and CaO, show the potential for on-site inspections.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
6.10
自引率
12.10%
发文量
173
审稿时长
81 days
期刊介绍: Spectrochimica Acta Part B: Atomic Spectroscopy, is intended for the rapid publication of both original work and reviews in the following fields: Atomic Emission (AES), Atomic Absorption (AAS) and Atomic Fluorescence (AFS) spectroscopy; Mass Spectrometry (MS) for inorganic analysis covering Spark Source (SS-MS), Inductively Coupled Plasma (ICP-MS), Glow Discharge (GD-MS), and Secondary Ion Mass Spectrometry (SIMS). Laser induced atomic spectroscopy for inorganic analysis, including non-linear optical laser spectroscopy, covering Laser Enhanced Ionization (LEI), Laser Induced Fluorescence (LIF), Resonance Ionization Spectroscopy (RIS) and Resonance Ionization Mass Spectrometry (RIMS); Laser Induced Breakdown Spectroscopy (LIBS); Cavity Ringdown Spectroscopy (CRDS), Laser Ablation Inductively Coupled Plasma Atomic Emission Spectroscopy (LA-ICP-AES) and Laser Ablation Inductively Coupled Plasma Mass Spectrometry (LA-ICP-MS). X-ray spectrometry, X-ray Optics and Microanalysis, including X-ray fluorescence spectrometry (XRF) and related techniques, in particular Total-reflection X-ray Fluorescence Spectrometry (TXRF), and Synchrotron Radiation-excited Total reflection XRF (SR-TXRF). Manuscripts dealing with (i) fundamentals, (ii) methodology development, (iii)instrumentation, and (iv) applications, can be submitted for publication.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信