Rapid detection and quantification of atmospheric heavy metal deposition on plant leaves using machine learning-enhanced NIR spectroscopy

IF 3.4 3区 物理与天体物理 Q2 INSTRUMENTS & INSTRUMENTATION
Zhengwei Huang , Yulong Guo , Qianqian Sheng , Chun Li , Ling Jiang , Zunling Zhu
{"title":"Rapid detection and quantification of atmospheric heavy metal deposition on plant leaves using machine learning-enhanced NIR spectroscopy","authors":"Zhengwei Huang ,&nbsp;Yulong Guo ,&nbsp;Qianqian Sheng ,&nbsp;Chun Li ,&nbsp;Ling Jiang ,&nbsp;Zunling Zhu","doi":"10.1016/j.infrared.2025.106200","DOIUrl":null,"url":null,"abstract":"<div><div>Plant leaves serve as effective bioindicators for monitoring atmospheric heavy metal pollution because of their efficient adsorption and accumulation of airborne contaminants. This study presents an enhanced integration of near-infrared (NIR) spectroscopy with advanced machine learning algorithms for rapid, nondestructive detection of atmospheric heavy metals in <em>Ligustrum japonicum</em> ’Howardii’ leaves. We developed a comprehensive analytical framework incorporating 15 preprocessing methods, six advanced feature selection algorithms, and state-of-the-art machine learning models. The NIR spectra (780–2,500 nm) revealed characteristic absorption features in the 900–1,000, 1,400–1,500, and 1,900–2,000 nm regions corresponding to heavy metal-induced physiological changes in chlorophyll, water-protein structures, and metabolites, respectively. The proposed framework employed systematic optimization strategies to test 450 unique preprocessing–feature selection model combinations through rigorous validation protocols and systematic noise robustness testing. For qualitative classification, the optimal combination of baseline correction preprocessing, principal component analysis-based feature selection, and logistic regression achieved perfect discrimination accuracy (100.0 % Leave-One-Out Cross-Validation) across all three heavy metal types (lead, chromium, and nickel), maintaining exceptional noise tolerance with &gt;94 % accuracy under 20 % noise conditions. For quantitative analysis, metal-specific optimization strategies yielded superior performance, achieving R<sup>2</sup> values exceeding 0.83 across all three heavy metals under study, with chromium and nickel surpassing 0.88. The proposed enhanced methodology demonstrated substantial improvements over traditional single-method approaches while providing mechanistic insights into heavy metal-plant interactions suitable for regulatory applications and automated environmental monitoring systems.</div></div>","PeriodicalId":13549,"journal":{"name":"Infrared Physics & Technology","volume":"152 ","pages":"Article 106200"},"PeriodicalIF":3.4000,"publicationDate":"2025-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Infrared Physics & Technology","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1350449525004931","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INSTRUMENTS & INSTRUMENTATION","Score":null,"Total":0}
引用次数: 0

Abstract

Plant leaves serve as effective bioindicators for monitoring atmospheric heavy metal pollution because of their efficient adsorption and accumulation of airborne contaminants. This study presents an enhanced integration of near-infrared (NIR) spectroscopy with advanced machine learning algorithms for rapid, nondestructive detection of atmospheric heavy metals in Ligustrum japonicum ’Howardii’ leaves. We developed a comprehensive analytical framework incorporating 15 preprocessing methods, six advanced feature selection algorithms, and state-of-the-art machine learning models. The NIR spectra (780–2,500 nm) revealed characteristic absorption features in the 900–1,000, 1,400–1,500, and 1,900–2,000 nm regions corresponding to heavy metal-induced physiological changes in chlorophyll, water-protein structures, and metabolites, respectively. The proposed framework employed systematic optimization strategies to test 450 unique preprocessing–feature selection model combinations through rigorous validation protocols and systematic noise robustness testing. For qualitative classification, the optimal combination of baseline correction preprocessing, principal component analysis-based feature selection, and logistic regression achieved perfect discrimination accuracy (100.0 % Leave-One-Out Cross-Validation) across all three heavy metal types (lead, chromium, and nickel), maintaining exceptional noise tolerance with >94 % accuracy under 20 % noise conditions. For quantitative analysis, metal-specific optimization strategies yielded superior performance, achieving R2 values exceeding 0.83 across all three heavy metals under study, with chromium and nickel surpassing 0.88. The proposed enhanced methodology demonstrated substantial improvements over traditional single-method approaches while providing mechanistic insights into heavy metal-plant interactions suitable for regulatory applications and automated environmental monitoring systems.
利用机器学习增强近红外光谱快速检测和定量植物叶片上大气重金属沉积
植物叶片对大气中重金属的有效吸附和积累,是监测大气重金属污染的有效生物指标。本研究提出了一种增强的近红外(NIR)光谱与先进的机器学习算法的集成,用于快速、无损地检测日本紫荆‘ Howardii ’叶片中的大气重金属。我们开发了一个综合分析框架,包括15种预处理方法,6种先进的特征选择算法和最先进的机器学习模型。近红外光谱(780 ~ 2500 nm)显示了重金属诱导叶绿素、水蛋白结构和代谢物生理变化的900 ~ 1000、1400 ~ 1500和1900 ~ 2000 nm区域的特征吸收特征。该框架采用系统优化策略,通过严格的验证协议和系统的噪声鲁棒性测试,测试了450种独特的预处理特征选择模型组合。对于定性分类,基线校正预处理、基于主成分分析的特征选择和逻辑回归的最佳组合在所有三种重金属类型(铅、铬和镍)中实现了完美的识别精度(100.0%的留一交叉验证),在20%的噪声条件下保持了优异的噪声容忍度,准确度为94%。在定量分析中,针对金属的优化策略产生了优异的性能,在所研究的三种重金属中,R2值均超过0.83,其中铬和镍超过0.88。与传统的单一方法相比,提出的增强方法有了实质性的改进,同时提供了适用于监管应用和自动化环境监测系统的重金属与植物相互作用的机理见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
5.70
自引率
12.10%
发文量
400
审稿时长
67 days
期刊介绍: The Journal covers the entire field of infrared physics and technology: theory, experiment, application, devices and instrumentation. Infrared'' is defined as covering the near, mid and far infrared (terahertz) regions from 0.75um (750nm) to 1mm (300GHz.) Submissions in the 300GHz to 100GHz region may be accepted at the editors discretion if their content is relevant to shorter wavelengths. Submissions must be primarily concerned with and directly relevant to this spectral region. Its core topics can be summarized as the generation, propagation and detection, of infrared radiation; the associated optics, materials and devices; and its use in all fields of science, industry, engineering and medicine. Infrared techniques occur in many different fields, notably spectroscopy and interferometry; material characterization and processing; atmospheric physics, astronomy and space research. Scientific aspects include lasers, quantum optics, quantum electronics, image processing and semiconductor physics. Some important applications are medical diagnostics and treatment, industrial inspection and environmental monitoring.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信