Przemysław Mitura , Wiesław Paja , Grzegorz Młynarczyk , Radosław Kowalski , Krzysztof Bar , Joanna Depciuch
{"title":"基于尿液的前列腺癌拉曼标记物诊断:使用指纹和脂质光谱区域的机器学习方法","authors":"Przemysław Mitura , Wiesław Paja , Grzegorz Młynarczyk , Radosław Kowalski , Krzysztof Bar , Joanna Depciuch","doi":"10.1016/j.saa.2025.126661","DOIUrl":null,"url":null,"abstract":"<div><div>This study investigates the potential of Raman spectroscopy in distinguishing between healthy individuals and prostate cancer patients using urine samples. The Boruta algorithm was applied to Raman spectral data in two distinct wavenumber regions: 800–1800 cm<sup>−1</sup> (fingerprint region) and 2800–3000 cm<sup>−1</sup> (lipid region). The algorithm identified important spectral features from both regions that were used to construct decision trees for classification. Key wavenumbers in the fingerprint region (1009 cm<sup>−1</sup>) and high-wavenumber region (2937 cm<sup>−1</sup>) were found to be significant markers for prostate cancer detection. Principal Component Analysis (PCA) revealed that the intensity of these markers effectively separated healthy and cancerous samples, with the 1009 cm<sup>−1</sup> marker showing higher discriminative power. Furthermore, four classification models: Decision Tree (DT), k-Nearest Neighbors (kNN), Random Forest (RF), and Support Vector Machine (SVM) were evaluated for their performance in classifying urine samples based on Raman spectral features. The RF and kNN models exhibited the best overall performance, with high accuracy and sensitivity, particularly in the 800–1800 cm<sup>−1</sup> region. The study also explored the correlation between Raman markers and clinical parameters, finding that the 2937 cm<sup>−1</sup> marker had strong correlations with critical clinical variables like Gleason scores and MRI PIRADS scores, suggesting its utility for prostate cancer diagnosis and staging. These findings highlight the potential of Raman spectroscopy as a non-invasive tool for prostate cancer detection and monitoring.</div></div>","PeriodicalId":433,"journal":{"name":"Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy","volume":"344 ","pages":"Article 126661"},"PeriodicalIF":4.6000,"publicationDate":"2025-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Urine-based Raman markers for prostate cancer diagnosis: A machine learning approach using fingerprint and lipid spectral region\",\"authors\":\"Przemysław Mitura , Wiesław Paja , Grzegorz Młynarczyk , Radosław Kowalski , Krzysztof Bar , Joanna Depciuch\",\"doi\":\"10.1016/j.saa.2025.126661\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This study investigates the potential of Raman spectroscopy in distinguishing between healthy individuals and prostate cancer patients using urine samples. The Boruta algorithm was applied to Raman spectral data in two distinct wavenumber regions: 800–1800 cm<sup>−1</sup> (fingerprint region) and 2800–3000 cm<sup>−1</sup> (lipid region). The algorithm identified important spectral features from both regions that were used to construct decision trees for classification. Key wavenumbers in the fingerprint region (1009 cm<sup>−1</sup>) and high-wavenumber region (2937 cm<sup>−1</sup>) were found to be significant markers for prostate cancer detection. Principal Component Analysis (PCA) revealed that the intensity of these markers effectively separated healthy and cancerous samples, with the 1009 cm<sup>−1</sup> marker showing higher discriminative power. Furthermore, four classification models: Decision Tree (DT), k-Nearest Neighbors (kNN), Random Forest (RF), and Support Vector Machine (SVM) were evaluated for their performance in classifying urine samples based on Raman spectral features. The RF and kNN models exhibited the best overall performance, with high accuracy and sensitivity, particularly in the 800–1800 cm<sup>−1</sup> region. The study also explored the correlation between Raman markers and clinical parameters, finding that the 2937 cm<sup>−1</sup> marker had strong correlations with critical clinical variables like Gleason scores and MRI PIRADS scores, suggesting its utility for prostate cancer diagnosis and staging. These findings highlight the potential of Raman spectroscopy as a non-invasive tool for prostate cancer detection and monitoring.</div></div>\",\"PeriodicalId\":433,\"journal\":{\"name\":\"Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy\",\"volume\":\"344 \",\"pages\":\"Article 126661\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-07-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1386142525009680\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"SPECTROSCOPY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1386142525009680","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SPECTROSCOPY","Score":null,"Total":0}
Urine-based Raman markers for prostate cancer diagnosis: A machine learning approach using fingerprint and lipid spectral region
This study investigates the potential of Raman spectroscopy in distinguishing between healthy individuals and prostate cancer patients using urine samples. The Boruta algorithm was applied to Raman spectral data in two distinct wavenumber regions: 800–1800 cm−1 (fingerprint region) and 2800–3000 cm−1 (lipid region). The algorithm identified important spectral features from both regions that were used to construct decision trees for classification. Key wavenumbers in the fingerprint region (1009 cm−1) and high-wavenumber region (2937 cm−1) were found to be significant markers for prostate cancer detection. Principal Component Analysis (PCA) revealed that the intensity of these markers effectively separated healthy and cancerous samples, with the 1009 cm−1 marker showing higher discriminative power. Furthermore, four classification models: Decision Tree (DT), k-Nearest Neighbors (kNN), Random Forest (RF), and Support Vector Machine (SVM) were evaluated for their performance in classifying urine samples based on Raman spectral features. The RF and kNN models exhibited the best overall performance, with high accuracy and sensitivity, particularly in the 800–1800 cm−1 region. The study also explored the correlation between Raman markers and clinical parameters, finding that the 2937 cm−1 marker had strong correlations with critical clinical variables like Gleason scores and MRI PIRADS scores, suggesting its utility for prostate cancer diagnosis and staging. These findings highlight the potential of Raman spectroscopy as a non-invasive tool for prostate cancer detection and monitoring.
期刊介绍:
Spectrochimica Acta, Part A: Molecular and Biomolecular Spectroscopy (SAA) is an interdisciplinary journal which spans from basic to applied aspects of optical spectroscopy in chemistry, medicine, biology, and materials science.
The journal publishes original scientific papers that feature high-quality spectroscopic data and analysis. From the broad range of optical spectroscopies, the emphasis is on electronic, vibrational or rotational spectra of molecules, rather than on spectroscopy based on magnetic moments.
Criteria for publication in SAA are novelty, uniqueness, and outstanding quality. Routine applications of spectroscopic techniques and computational methods are not appropriate.
Topics of particular interest of Spectrochimica Acta Part A include, but are not limited to:
Spectroscopy and dynamics of bioanalytical, biomedical, environmental, and atmospheric sciences,
Novel experimental techniques or instrumentation for molecular spectroscopy,
Novel theoretical and computational methods,
Novel applications in photochemistry and photobiology,
Novel interpretational approaches as well as advances in data analysis based on electronic or vibrational spectroscopy.