从纯化合物拉曼光谱中分类新型芬太尼类似物的机器学习方法

IF 2.6 3区 医学 Q2 CHEMISTRY, ANALYTICAL
Phillip Koshute , N. Jordan Jameson , Nathan Hagan , David Lawrence , Adam Lanzarotta
{"title":"从纯化合物拉曼光谱中分类新型芬太尼类似物的机器学习方法","authors":"Phillip Koshute ,&nbsp;N. Jordan Jameson ,&nbsp;Nathan Hagan ,&nbsp;David Lawrence ,&nbsp;Adam Lanzarotta","doi":"10.1016/j.forc.2023.100506","DOIUrl":null,"url":null,"abstract":"<div><p>In previous research, we demonstrated the promise of detecting novel fentanyl analogs from mass spectra using machine learning models. This approach complements existing library matching methods and provides a key capability amid the recent sharp increase in abuse of fentanyl and its analogs. However, many applications rely upon portable devices such as Raman spectrometers, rather than mass spectrometers that are generally located in laboratories. In response, we adapted our models to Raman-based sensing, devising a machine learning approach for detecting novel fentanyl analogs from Raman spectra. Whereas mass spectra consist of well-defined discrete peaks, Raman spectra are continuous. To aid model development, we extracted features from each spectrum using smoothing, background subtraction, and principal component analysis (PCA). Additionally, we extracted features related to spectral peaks and similarity to spectra of known compounds; these features were guided by subject-matter expertise. We also used a third feature set that combined the features from PCA and from spectral peaks. With these three feature sets as inputs, we developed fentanyl analog classification models using various machine learning techniques. These techniques included multi-layer perceptron, neural network, partial least squares, penalized multinomial regression, random forest, regularized logistic regression, support vector machines, and extreme gradient boosting. We developed and tested our models using 320 Raman spectra of pure compounds, assessing performance via cross-validation. Models with PCA-based features performed better than those using expert-based features, achieving more than 90% probability of detection alongside less than 1% probability of false alarm. Although fentanyl compounds are often found with other components (e.g., cutting agents), some applications such as mail screening may encounter relatively pure fentanyl analogs. The results within this study suggest that our machine learning models are particularly promising for such applications.</p></div>","PeriodicalId":324,"journal":{"name":"Forensic Chemistry","volume":"34 ","pages":"Article 100506"},"PeriodicalIF":2.6000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Machine learning methods for classifying novel fentanyl analogs from Raman spectra of pure compounds\",\"authors\":\"Phillip Koshute ,&nbsp;N. Jordan Jameson ,&nbsp;Nathan Hagan ,&nbsp;David Lawrence ,&nbsp;Adam Lanzarotta\",\"doi\":\"10.1016/j.forc.2023.100506\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>In previous research, we demonstrated the promise of detecting novel fentanyl analogs from mass spectra using machine learning models. This approach complements existing library matching methods and provides a key capability amid the recent sharp increase in abuse of fentanyl and its analogs. However, many applications rely upon portable devices such as Raman spectrometers, rather than mass spectrometers that are generally located in laboratories. In response, we adapted our models to Raman-based sensing, devising a machine learning approach for detecting novel fentanyl analogs from Raman spectra. Whereas mass spectra consist of well-defined discrete peaks, Raman spectra are continuous. To aid model development, we extracted features from each spectrum using smoothing, background subtraction, and principal component analysis (PCA). Additionally, we extracted features related to spectral peaks and similarity to spectra of known compounds; these features were guided by subject-matter expertise. We also used a third feature set that combined the features from PCA and from spectral peaks. With these three feature sets as inputs, we developed fentanyl analog classification models using various machine learning techniques. These techniques included multi-layer perceptron, neural network, partial least squares, penalized multinomial regression, random forest, regularized logistic regression, support vector machines, and extreme gradient boosting. We developed and tested our models using 320 Raman spectra of pure compounds, assessing performance via cross-validation. Models with PCA-based features performed better than those using expert-based features, achieving more than 90% probability of detection alongside less than 1% probability of false alarm. Although fentanyl compounds are often found with other components (e.g., cutting agents), some applications such as mail screening may encounter relatively pure fentanyl analogs. The results within this study suggest that our machine learning models are particularly promising for such applications.</p></div>\",\"PeriodicalId\":324,\"journal\":{\"name\":\"Forensic Chemistry\",\"volume\":\"34 \",\"pages\":\"Article 100506\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2023-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Forensic Chemistry\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2468170923000425\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, ANALYTICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Forensic Chemistry","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2468170923000425","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 1

摘要

在之前的研究中,我们展示了使用机器学习模型从质谱中检测新型芬太尼类似物的前景。这种方法补充了现有的库匹配方法,并在最近芬太尼及其类似物滥用急剧增加的情况下提供了关键能力。然而,许多应用依赖于便携式设备,如拉曼光谱仪,而不是通常位于实验室的质谱仪。作为回应,我们将我们的模型调整为基于拉曼的传感,设计了一种机器学习方法来从拉曼光谱中检测新型芬太尼类似物。质谱由定义明确的离散峰组成,而拉曼光谱是连续的。为了帮助模型开发,我们使用平滑、背景减除和主成分分析(PCA)从每个光谱中提取特征。此外,我们提取了与已知化合物的光谱峰和相似度相关的特征;这些特性是由主题专业知识指导的。我们还使用了第三个特征集,该特征集结合了PCA和光谱峰的特征。以这三个特征集为输入,我们使用各种机器学习技术开发了芬太尼类似物分类模型。这些技术包括多层感知器、神经网络、偏最小二乘、惩罚多项式回归、随机森林、正则化逻辑回归、支持向量机和极端梯度增强。我们使用纯化合物的320拉曼光谱开发和测试了我们的模型,并通过交叉验证评估了性能。使用基于pca的特征的模型比使用基于专家的特征的模型表现更好,实现了超过90%的检测概率和不到1%的误报概率。虽然芬太尼化合物经常与其他成分(例如,切割剂)一起发现,但某些应用,例如邮件筛选,可能会遇到相对纯的芬太尼类似物。这项研究的结果表明,我们的机器学习模型在这类应用中特别有前途。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Machine learning methods for classifying novel fentanyl analogs from Raman spectra of pure compounds

Machine learning methods for classifying novel fentanyl analogs from Raman spectra of pure compounds

In previous research, we demonstrated the promise of detecting novel fentanyl analogs from mass spectra using machine learning models. This approach complements existing library matching methods and provides a key capability amid the recent sharp increase in abuse of fentanyl and its analogs. However, many applications rely upon portable devices such as Raman spectrometers, rather than mass spectrometers that are generally located in laboratories. In response, we adapted our models to Raman-based sensing, devising a machine learning approach for detecting novel fentanyl analogs from Raman spectra. Whereas mass spectra consist of well-defined discrete peaks, Raman spectra are continuous. To aid model development, we extracted features from each spectrum using smoothing, background subtraction, and principal component analysis (PCA). Additionally, we extracted features related to spectral peaks and similarity to spectra of known compounds; these features were guided by subject-matter expertise. We also used a third feature set that combined the features from PCA and from spectral peaks. With these three feature sets as inputs, we developed fentanyl analog classification models using various machine learning techniques. These techniques included multi-layer perceptron, neural network, partial least squares, penalized multinomial regression, random forest, regularized logistic regression, support vector machines, and extreme gradient boosting. We developed and tested our models using 320 Raman spectra of pure compounds, assessing performance via cross-validation. Models with PCA-based features performed better than those using expert-based features, achieving more than 90% probability of detection alongside less than 1% probability of false alarm. Although fentanyl compounds are often found with other components (e.g., cutting agents), some applications such as mail screening may encounter relatively pure fentanyl analogs. The results within this study suggest that our machine learning models are particularly promising for such applications.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Forensic Chemistry
Forensic Chemistry CHEMISTRY, ANALYTICAL-
CiteScore
5.70
自引率
14.80%
发文量
65
审稿时长
46 days
期刊介绍: Forensic Chemistry publishes high quality manuscripts focusing on the theory, research and application of any chemical science to forensic analysis. The scope of the journal includes fundamental advancements that result in a better understanding of the evidentiary significance derived from the physical and chemical analysis of materials. The scope of Forensic Chemistry will also include the application and or development of any molecular and atomic spectrochemical technique, electrochemical techniques, sensors, surface characterization techniques, mass spectrometry, nuclear magnetic resonance, chemometrics and statistics, and separation sciences (e.g. chromatography) that provide insight into the forensic analysis of materials. Evidential topics of interest to the journal include, but are not limited to, fingerprint analysis, drug analysis, ignitable liquid residue analysis, explosives detection and analysis, the characterization and comparison of trace evidence (glass, fibers, paints and polymers, tapes, soils and other materials), ink and paper analysis, gunshot residue analysis, synthetic pathways for drugs, toxicology and the analysis and chemistry associated with the components of fingermarks. The journal is particularly interested in receiving manuscripts that report advances in the forensic interpretation of chemical evidence. Technology Readiness Level: When submitting an article to Forensic Chemistry, all authors will be asked to self-assign a Technology Readiness Level (TRL) to their article. The purpose of the TRL system is to help readers understand the level of maturity of an idea or method, to help track the evolution of readiness of a given technique or method, and to help filter published articles by the expected ease of implementation in an operation setting within a crime lab.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信