Optimized Machine Learning Approaches to Combining Surface-Enhanced Raman Scattering and Infrared Data for Trace Detection of Xylazine in Illicit Opioids
Rebecca Martens, Lea Gozdzialski, Ella Newman, Bruce Wallace, Chris Gill, Dennis Kumar Hore
{"title":"Optimized Machine Learning Approaches to Combining Surface-Enhanced Raman Scattering and Infrared Data for Trace Detection of Xylazine in Illicit Opioids","authors":"Rebecca Martens, Lea Gozdzialski, Ella Newman, Bruce Wallace, Chris Gill, Dennis Kumar Hore","doi":"10.1039/d4an01496k","DOIUrl":null,"url":null,"abstract":"Infrared absorption spectroscopy and surface-enhanced Raman spectroscopy were integrated into three data fusion strategies—hybrid (concatenated spectra), mid-level (extracted features from both datasets) and high-level (fusion of predictions from both models)—to enhance the predictive accuracy for xylazine detection in illicit opioid samples. Three chemometric approaches—random forest, support vector machine, and k-nearest neighbor algorithms—were employed and optimized using a 5-fold cross-validation grid search for all fusion strategies. Validation results identified the random forest classifier as the optimal model for all fusion strategies, achieving high sensitivity (88% for hybrid, 92% for mid-level, and 96% for high-level) and specificity (88% for hybrid, mid-level, and high-level). The enhanced performance of the high-level fusion approach (F1 Score of 92%) is demonstrated, effectively leveraging the surface-enhanced Raman data with a 90\\% voting weight, without compromising prediction accuracy (92%) when combined with infrared spectral data. This highlights the viability of a multi-instrument approach using data fusion and random forest classification to improve the detection of various components in complex opioid samples in a point-of-care setting.","PeriodicalId":63,"journal":{"name":"Analyst","volume":"30 1","pages":""},"PeriodicalIF":3.6000,"publicationDate":"2025-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analyst","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1039/d4an01496k","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Infrared absorption spectroscopy and surface-enhanced Raman spectroscopy were integrated into three data fusion strategies—hybrid (concatenated spectra), mid-level (extracted features from both datasets) and high-level (fusion of predictions from both models)—to enhance the predictive accuracy for xylazine detection in illicit opioid samples. Three chemometric approaches—random forest, support vector machine, and k-nearest neighbor algorithms—were employed and optimized using a 5-fold cross-validation grid search for all fusion strategies. Validation results identified the random forest classifier as the optimal model for all fusion strategies, achieving high sensitivity (88% for hybrid, 92% for mid-level, and 96% for high-level) and specificity (88% for hybrid, mid-level, and high-level). The enhanced performance of the high-level fusion approach (F1 Score of 92%) is demonstrated, effectively leveraging the surface-enhanced Raman data with a 90\% voting weight, without compromising prediction accuracy (92%) when combined with infrared spectral data. This highlights the viability of a multi-instrument approach using data fusion and random forest classification to improve the detection of various components in complex opioid samples in a point-of-care setting.