A robust support vector machine approach for Raman data classification

Marco Piazza , Andrea Spinelli , Francesca Maggioni , Marzia Bedoni , Enza Messina
{"title":"A robust support vector machine approach for Raman data classification","authors":"Marco Piazza ,&nbsp;Andrea Spinelli ,&nbsp;Francesca Maggioni ,&nbsp;Marzia Bedoni ,&nbsp;Enza Messina","doi":"10.1016/j.dajour.2025.100595","DOIUrl":null,"url":null,"abstract":"<div><div>Recent advances in healthcare technologies have led to the availability of large amounts of biological samples across several techniques and applications. In particular, in the last few years, <em>Raman spectroscopy</em> analysis of biological samples has been successfully applied for early-stage diagnosis. However, spectra’s inherent complexity and variability make the manual analysis challenging, even for domain experts. For the same reason, the use of traditional <em>Statistical Learning</em> and <em>Machine Learning</em> techniques could not guarantee for accurate and reliable results. Machine learning models, combined with robust optimization techniques, offer the possibility to improve the classification accuracy and enhance the resilience of predictive models under data uncertainty. In this paper, we investigate the performance of a novel robust formulation for <em>Support Vector Machine</em> (SVM) in classifying COVID-19 samples obtained from Raman spectroscopy. Given the noisy and perturbed nature of biological samples, we protect the classification process against uncertainty through the application of robust optimization techniques. Specifically, we consider the robust counterparts of deterministic SVM formulations using bounded-by-norm uncertainty sets. We explore the cases of both linear and kernel-induced classifiers, addressing binary and multiclass classification tasks. The effectiveness of our approach is evaluated on real-world COVID-19 Raman saliva samples provided by Italian hospitals. We assess the performance of the proposed method by comparing the results of our numerical experiments with those of a state-of-the-art classifier, showing the potential of robust classifiers in handling uncertain Raman data.</div></div>","PeriodicalId":100357,"journal":{"name":"Decision Analytics Journal","volume":"16 ","pages":"Article 100595"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Decision Analytics Journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772662225000517","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Recent advances in healthcare technologies have led to the availability of large amounts of biological samples across several techniques and applications. In particular, in the last few years, Raman spectroscopy analysis of biological samples has been successfully applied for early-stage diagnosis. However, spectra’s inherent complexity and variability make the manual analysis challenging, even for domain experts. For the same reason, the use of traditional Statistical Learning and Machine Learning techniques could not guarantee for accurate and reliable results. Machine learning models, combined with robust optimization techniques, offer the possibility to improve the classification accuracy and enhance the resilience of predictive models under data uncertainty. In this paper, we investigate the performance of a novel robust formulation for Support Vector Machine (SVM) in classifying COVID-19 samples obtained from Raman spectroscopy. Given the noisy and perturbed nature of biological samples, we protect the classification process against uncertainty through the application of robust optimization techniques. Specifically, we consider the robust counterparts of deterministic SVM formulations using bounded-by-norm uncertainty sets. We explore the cases of both linear and kernel-induced classifiers, addressing binary and multiclass classification tasks. The effectiveness of our approach is evaluated on real-world COVID-19 Raman saliva samples provided by Italian hospitals. We assess the performance of the proposed method by comparing the results of our numerical experiments with those of a state-of-the-art classifier, showing the potential of robust classifiers in handling uncertain Raman data.
拉曼数据分类的鲁棒支持向量机方法
医疗保健技术的最新进展导致了多种技术和应用中大量生物样品的可用性。特别是近年来,生物样品的拉曼光谱分析已成功应用于早期诊断。然而,光谱固有的复杂性和可变性使得手工分析具有挑战性,即使对领域专家也是如此。出于同样的原因,使用传统的统计学习和机器学习技术也不能保证准确可靠的结果。机器学习模型与鲁棒优化技术相结合,为提高分类精度和增强预测模型在数据不确定性下的弹性提供了可能。在本文中,我们研究了一种新的鲁棒支持向量机(SVM)公式在分类从拉曼光谱获得的COVID-19样本中的性能。考虑到生物样本的噪声和扰动性质,我们通过应用鲁棒优化技术来保护分类过程免受不确定性的影响。具体来说,我们考虑了使用范数有界不确定性集的确定性支持向量机公式的鲁棒对应物。我们探讨了线性和核诱导分类器的情况,解决了二进制和多类分类任务。通过意大利医院提供的实际COVID-19拉曼唾液样本评估了我们方法的有效性。我们通过比较我们的数值实验结果与最先进的分类器的结果来评估所提出方法的性能,显示了鲁棒分类器在处理不确定拉曼数据方面的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.90
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信