使用峰敏感逻辑回归的光学光谱机器学习解释

IF 15.8 1区 材料科学 Q1 CHEMISTRY, MULTIDISCIPLINARY
ACS Nano Pub Date : 2025-04-15 DOI:10.1021/acsnano.4c16037
Ziyang Wang, Jeewan C. Ranasinghe, Wenjing Wu, Dennis C. Y. Chan, Ashley Gomm, Rudolph E. Tanzi, Can Zhang, Nanyin Zhang, Genevera I. Allen, Shengxi Huang
{"title":"使用峰敏感逻辑回归的光学光谱机器学习解释","authors":"Ziyang Wang, Jeewan C. Ranasinghe, Wenjing Wu, Dennis C. Y. Chan, Ashley Gomm, Rudolph E. Tanzi, Can Zhang, Nanyin Zhang, Genevera I. Allen, Shengxi Huang","doi":"10.1021/acsnano.4c16037","DOIUrl":null,"url":null,"abstract":"Optical spectroscopy, a noninvasive molecular sensing technique, offers valuable insights into material characterization, molecule identification, and biosample analysis. Despite the informativeness of high-dimensional optical spectra, their interpretation remains a challenge. Machine learning methods have gained prominence in spectral analyses, efficiently unveiling analyte compositions. However, these methods still face challenges in interpretability, particularly in generating clear feature importance maps that highlight the spectral features specific to each class of data. These limitations arise from feature noise, model complexity, and the lack of optimization for spectroscopy. In this work, we introduce a machine learning algorithm─logistic regression with peak-sensitive elastic-net regularization (PSE-LR)─tailored for spectral analysis. PSE-LR enables classification and interpretability by producing a peak-sensitive feature importance map, achieving an F1-score of 0.93 and a feature sensitivity of 1.0. Its performance is compared with other methods, including k-nearest neighbors (KNN), elastic-net logistic regression (E-LR), support vector machine (SVM), principal component analysis followed by linear discriminant analysis (PCA-LDA), XGBoost, and neural network (NN). Applying PSE-LR to Raman and photoluminescence (PL) spectra, we detected the receptor-binding domain (RBD) of SARS-CoV-2 spike protein in ultralow concentrations, identified neuroprotective solution (NPS) in brain samples, recognized WS<sub>2</sub> monolayer and WSe<sub>2</sub>/WS<sub>2</sub> heterobilayer, analyzed Alzheimer’s disease (AD) brains, and suggested potential disease biomarkers. Our findings demonstrate PSE-LR’s utility in detecting subtle spectral features and generating interpretable feature importance maps. It is beneficial for the spectral characterization of materials, molecules, and biosamples and applicable to other spectroscopic methods. This work also facilitates the development of nanodevices such as nanosensors and miniaturized spectrometers based on nanomaterials.","PeriodicalId":21,"journal":{"name":"ACS Nano","volume":"4 1","pages":""},"PeriodicalIF":15.8000,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine Learning Interpretation of Optical Spectroscopy Using Peak-Sensitive Logistic Regression\",\"authors\":\"Ziyang Wang, Jeewan C. Ranasinghe, Wenjing Wu, Dennis C. Y. Chan, Ashley Gomm, Rudolph E. Tanzi, Can Zhang, Nanyin Zhang, Genevera I. Allen, Shengxi Huang\",\"doi\":\"10.1021/acsnano.4c16037\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Optical spectroscopy, a noninvasive molecular sensing technique, offers valuable insights into material characterization, molecule identification, and biosample analysis. Despite the informativeness of high-dimensional optical spectra, their interpretation remains a challenge. Machine learning methods have gained prominence in spectral analyses, efficiently unveiling analyte compositions. However, these methods still face challenges in interpretability, particularly in generating clear feature importance maps that highlight the spectral features specific to each class of data. These limitations arise from feature noise, model complexity, and the lack of optimization for spectroscopy. In this work, we introduce a machine learning algorithm─logistic regression with peak-sensitive elastic-net regularization (PSE-LR)─tailored for spectral analysis. PSE-LR enables classification and interpretability by producing a peak-sensitive feature importance map, achieving an F1-score of 0.93 and a feature sensitivity of 1.0. Its performance is compared with other methods, including k-nearest neighbors (KNN), elastic-net logistic regression (E-LR), support vector machine (SVM), principal component analysis followed by linear discriminant analysis (PCA-LDA), XGBoost, and neural network (NN). Applying PSE-LR to Raman and photoluminescence (PL) spectra, we detected the receptor-binding domain (RBD) of SARS-CoV-2 spike protein in ultralow concentrations, identified neuroprotective solution (NPS) in brain samples, recognized WS<sub>2</sub> monolayer and WSe<sub>2</sub>/WS<sub>2</sub> heterobilayer, analyzed Alzheimer’s disease (AD) brains, and suggested potential disease biomarkers. Our findings demonstrate PSE-LR’s utility in detecting subtle spectral features and generating interpretable feature importance maps. It is beneficial for the spectral characterization of materials, molecules, and biosamples and applicable to other spectroscopic methods. This work also facilitates the development of nanodevices such as nanosensors and miniaturized spectrometers based on nanomaterials.\",\"PeriodicalId\":21,\"journal\":{\"name\":\"ACS Nano\",\"volume\":\"4 1\",\"pages\":\"\"},\"PeriodicalIF\":15.8000,\"publicationDate\":\"2025-04-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACS Nano\",\"FirstCategoryId\":\"88\",\"ListUrlMain\":\"https://doi.org/10.1021/acsnano.4c16037\",\"RegionNum\":1,\"RegionCategory\":\"材料科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Nano","FirstCategoryId":"88","ListUrlMain":"https://doi.org/10.1021/acsnano.4c16037","RegionNum":1,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

光谱学是一种非侵入性分子传感技术,为材料表征、分子鉴定和生物样品分析提供了有价值的见解。尽管高维光谱信息丰富,但其解释仍然是一个挑战。机器学习方法在光谱分析中获得了突出地位,有效地揭示了分析物的成分。然而,这些方法在可解释性方面仍然面临挑战,特别是在生成清晰的特征重要性图时,突出了每一类数据的光谱特征。这些限制来自于特征噪声、模型复杂性和缺乏对光谱的优化。在这项工作中,我们介绍了一种为光谱分析量身定制的机器学习算法──带有峰值敏感弹性网络正则化(PSE-LR)的逻辑回归。PSE-LR通过生成峰值敏感特征重要性图来实现分类和可解释性,其f1得分为0.93,特征灵敏度为1.0。比较了k近邻(KNN)、弹性网络逻辑回归(E-LR)、支持向量机(SVM)、主成分分析后线性判别分析(PCA-LDA)、XGBoost和神经网络(NN)等方法的性能。利用PSE-LR结合拉曼光谱和光致发光(PL)光谱,检测了SARS-CoV-2刺突蛋白的超低浓度受体结合域(RBD),鉴定了脑样品中的神经保护液(NPS),识别了WS2单分子层和WSe2/WS2异双分子层,分析了阿尔茨海默病(AD)脑,并提出了潜在的疾病生物标志物。我们的研究结果证明了PSE-LR在检测细微光谱特征和生成可解释特征重要性图方面的实用性。它有利于材料、分子和生物样品的光谱表征,并适用于其他光谱方法。这项工作也促进了纳米器件的发展,如纳米传感器和基于纳米材料的小型化光谱仪。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Machine Learning Interpretation of Optical Spectroscopy Using Peak-Sensitive Logistic Regression

Machine Learning Interpretation of Optical Spectroscopy Using Peak-Sensitive Logistic Regression
Optical spectroscopy, a noninvasive molecular sensing technique, offers valuable insights into material characterization, molecule identification, and biosample analysis. Despite the informativeness of high-dimensional optical spectra, their interpretation remains a challenge. Machine learning methods have gained prominence in spectral analyses, efficiently unveiling analyte compositions. However, these methods still face challenges in interpretability, particularly in generating clear feature importance maps that highlight the spectral features specific to each class of data. These limitations arise from feature noise, model complexity, and the lack of optimization for spectroscopy. In this work, we introduce a machine learning algorithm─logistic regression with peak-sensitive elastic-net regularization (PSE-LR)─tailored for spectral analysis. PSE-LR enables classification and interpretability by producing a peak-sensitive feature importance map, achieving an F1-score of 0.93 and a feature sensitivity of 1.0. Its performance is compared with other methods, including k-nearest neighbors (KNN), elastic-net logistic regression (E-LR), support vector machine (SVM), principal component analysis followed by linear discriminant analysis (PCA-LDA), XGBoost, and neural network (NN). Applying PSE-LR to Raman and photoluminescence (PL) spectra, we detected the receptor-binding domain (RBD) of SARS-CoV-2 spike protein in ultralow concentrations, identified neuroprotective solution (NPS) in brain samples, recognized WS2 monolayer and WSe2/WS2 heterobilayer, analyzed Alzheimer’s disease (AD) brains, and suggested potential disease biomarkers. Our findings demonstrate PSE-LR’s utility in detecting subtle spectral features and generating interpretable feature importance maps. It is beneficial for the spectral characterization of materials, molecules, and biosamples and applicable to other spectroscopic methods. This work also facilitates the development of nanodevices such as nanosensors and miniaturized spectrometers based on nanomaterials.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
ACS Nano
ACS Nano 工程技术-材料科学:综合
CiteScore
26.00
自引率
4.10%
发文量
1627
审稿时长
1.7 months
期刊介绍: ACS Nano, published monthly, serves as an international forum for comprehensive articles on nanoscience and nanotechnology research at the intersections of chemistry, biology, materials science, physics, and engineering. The journal fosters communication among scientists in these communities, facilitating collaboration, new research opportunities, and advancements through discoveries. ACS Nano covers synthesis, assembly, characterization, theory, and simulation of nanostructures, nanobiotechnology, nanofabrication, methods and tools for nanoscience and nanotechnology, and self- and directed-assembly. Alongside original research articles, it offers thorough reviews, perspectives on cutting-edge research, and discussions envisioning the future of nanoscience and nanotechnology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信