Urine-based Raman markers for prostate cancer diagnosis: A machine learning approach using fingerprint and lipid spectral region

IF 4.6 2区 化学 Q1 SPECTROSCOPY
Przemysław Mitura , Wiesław Paja , Grzegorz Młynarczyk , Radosław Kowalski , Krzysztof Bar , Joanna Depciuch
{"title":"Urine-based Raman markers for prostate cancer diagnosis: A machine learning approach using fingerprint and lipid spectral region","authors":"Przemysław Mitura ,&nbsp;Wiesław Paja ,&nbsp;Grzegorz Młynarczyk ,&nbsp;Radosław Kowalski ,&nbsp;Krzysztof Bar ,&nbsp;Joanna Depciuch","doi":"10.1016/j.saa.2025.126661","DOIUrl":null,"url":null,"abstract":"<div><div>This study investigates the potential of Raman spectroscopy in distinguishing between healthy individuals and prostate cancer patients using urine samples. The Boruta algorithm was applied to Raman spectral data in two distinct wavenumber regions: 800–1800 cm<sup>−1</sup> (fingerprint region) and 2800–3000 cm<sup>−1</sup> (lipid region). The algorithm identified important spectral features from both regions that were used to construct decision trees for classification. Key wavenumbers in the fingerprint region (1009 cm<sup>−1</sup>) and high-wavenumber region (2937 cm<sup>−1</sup>) were found to be significant markers for prostate cancer detection. Principal Component Analysis (PCA) revealed that the intensity of these markers effectively separated healthy and cancerous samples, with the 1009 cm<sup>−1</sup> marker showing higher discriminative power. Furthermore, four classification models: Decision Tree (DT), k-Nearest Neighbors (kNN), Random Forest (RF), and Support Vector Machine (SVM) were evaluated for their performance in classifying urine samples based on Raman spectral features. The RF and kNN models exhibited the best overall performance, with high accuracy and sensitivity, particularly in the 800–1800 cm<sup>−1</sup> region. The study also explored the correlation between Raman markers and clinical parameters, finding that the 2937 cm<sup>−1</sup> marker had strong correlations with critical clinical variables like Gleason scores and MRI PIRADS scores, suggesting its utility for prostate cancer diagnosis and staging. These findings highlight the potential of Raman spectroscopy as a non-invasive tool for prostate cancer detection and monitoring.</div></div>","PeriodicalId":433,"journal":{"name":"Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy","volume":"344 ","pages":"Article 126661"},"PeriodicalIF":4.6000,"publicationDate":"2025-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1386142525009680","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SPECTROSCOPY","Score":null,"Total":0}
引用次数: 0

Abstract

This study investigates the potential of Raman spectroscopy in distinguishing between healthy individuals and prostate cancer patients using urine samples. The Boruta algorithm was applied to Raman spectral data in two distinct wavenumber regions: 800–1800 cm−1 (fingerprint region) and 2800–3000 cm−1 (lipid region). The algorithm identified important spectral features from both regions that were used to construct decision trees for classification. Key wavenumbers in the fingerprint region (1009 cm−1) and high-wavenumber region (2937 cm−1) were found to be significant markers for prostate cancer detection. Principal Component Analysis (PCA) revealed that the intensity of these markers effectively separated healthy and cancerous samples, with the 1009 cm−1 marker showing higher discriminative power. Furthermore, four classification models: Decision Tree (DT), k-Nearest Neighbors (kNN), Random Forest (RF), and Support Vector Machine (SVM) were evaluated for their performance in classifying urine samples based on Raman spectral features. The RF and kNN models exhibited the best overall performance, with high accuracy and sensitivity, particularly in the 800–1800 cm−1 region. The study also explored the correlation between Raman markers and clinical parameters, finding that the 2937 cm−1 marker had strong correlations with critical clinical variables like Gleason scores and MRI PIRADS scores, suggesting its utility for prostate cancer diagnosis and staging. These findings highlight the potential of Raman spectroscopy as a non-invasive tool for prostate cancer detection and monitoring.

Abstract Image

基于尿液的前列腺癌拉曼标记物诊断:使用指纹和脂质光谱区域的机器学习方法
本研究探讨了拉曼光谱在使用尿液样本区分健康个体和前列腺癌患者方面的潜力。将Boruta算法应用于两个不同波数区域的拉曼光谱数据:800-1800 cm−1(指纹区)和2800-3000 cm−1(脂区)。该算法从两个区域中识别出重要的光谱特征,用于构建用于分类的决策树。指纹区域的关键波数(1009 cm−1)和高波数区域(2937 cm−1)是前列腺癌检测的重要标志。主成分分析(PCA)表明,这些标记的强度可以有效地区分健康和癌样,其中1009 cm−1标记具有更高的判别能力。此外,对决策树(DT)、k近邻(kNN)、随机森林(RF)和支持向量机(SVM)四种分类模型在基于拉曼光谱特征的尿液样本分类中的性能进行了评估。RF和kNN模型表现出最好的综合性能,具有较高的精度和灵敏度,特别是在800-1800 cm−1区域。该研究还探讨了拉曼标记物与临床参数之间的相关性,发现2937 cm−1标记物与Gleason评分和MRI PIRADS评分等关键临床变量具有很强的相关性,表明其在前列腺癌诊断和分期中的实用性。这些发现突出了拉曼光谱作为前列腺癌检测和监测的非侵入性工具的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
8.40
自引率
11.40%
发文量
1364
审稿时长
40 days
期刊介绍: Spectrochimica Acta, Part A: Molecular and Biomolecular Spectroscopy (SAA) is an interdisciplinary journal which spans from basic to applied aspects of optical spectroscopy in chemistry, medicine, biology, and materials science. The journal publishes original scientific papers that feature high-quality spectroscopic data and analysis. From the broad range of optical spectroscopies, the emphasis is on electronic, vibrational or rotational spectra of molecules, rather than on spectroscopy based on magnetic moments. Criteria for publication in SAA are novelty, uniqueness, and outstanding quality. Routine applications of spectroscopic techniques and computational methods are not appropriate. Topics of particular interest of Spectrochimica Acta Part A include, but are not limited to: Spectroscopy and dynamics of bioanalytical, biomedical, environmental, and atmospheric sciences, Novel experimental techniques or instrumentation for molecular spectroscopy, Novel theoretical and computational methods, Novel applications in photochemistry and photobiology, Novel interpretational approaches as well as advances in data analysis based on electronic or vibrational spectroscopy.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信