Jihan K. Zaki, , , Jakub Tomasik, , , Jade A. McCune, , , Sabine Bahn, , , Pietro Lió*, , and , Oren A. Scherman*,
{"title":"Explainable Deep Learning Framework for SERS Bioquantification","authors":"Jihan K. Zaki, , , Jakub Tomasik, , , Jade A. McCune, , , Sabine Bahn, , , Pietro Lió*, , and , Oren A. Scherman*, ","doi":"10.1021/acssensors.5c01058","DOIUrl":null,"url":null,"abstract":"<p >Surface-enhanced Raman spectroscopy (SERS) is rapidly gaining attention as a fast and inexpensive method of biomarker quantification, which can be combined with deep learning to elucidate complex biomarker-disease relationships. Current standard practices in SERS analysis are behind the state-of-the-art machine learning approaches; however, the present challenges of SERS analysis could be effectively addressed with a robust computational framework. Furthermore, there is a need for improved model explainability for SERS analysis, which at present is insufficient in assessing the contexts in which confounding factors affect prediction outcomes. This study presents a framework for SERS bioquantification rooted in a three-step process, including spectral processing, quantification, and explainability. A serotonin quantification task in urine was assessed as a model task, with 682 SERS spectra measured in a micromolar range using cucurbit[8]uril chemical spacers. A denoising autoencoder was utilized for spectral enhancement, while convolutional neural networks (CNNs) and vision transformers were utilized for biomarker quantification. In addition, a context representative interpretable model explanation (CRIME) method was developed to suit the current needs of SERS mixture analysis explainability. Serotonin quantification was most efficient in denoised spectra analyzed using a CNN with a three-parameter logistic output layer (mean absolute error = 0.15 μM, mean percentage error = 4.67%). Subsequently, the CRIME method revealed the CNN model to present six unique prediction contexts, of which three were associated with serotonin. The proposed framework could unlock a novel, untargeted hypothesis-generating method of biomarker discovery, considering the rapid and inexpensive nature of SERS measurements and the potential to identify biomarkers from CRIME contexts.</p>","PeriodicalId":24,"journal":{"name":"ACS Sensors","volume":"10 9","pages":"6597–6606"},"PeriodicalIF":9.1000,"publicationDate":"2025-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.acs.org/doi/pdf/10.1021/acssensors.5c01058","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Sensors","FirstCategoryId":"92","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acssensors.5c01058","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Surface-enhanced Raman spectroscopy (SERS) is rapidly gaining attention as a fast and inexpensive method of biomarker quantification, which can be combined with deep learning to elucidate complex biomarker-disease relationships. Current standard practices in SERS analysis are behind the state-of-the-art machine learning approaches; however, the present challenges of SERS analysis could be effectively addressed with a robust computational framework. Furthermore, there is a need for improved model explainability for SERS analysis, which at present is insufficient in assessing the contexts in which confounding factors affect prediction outcomes. This study presents a framework for SERS bioquantification rooted in a three-step process, including spectral processing, quantification, and explainability. A serotonin quantification task in urine was assessed as a model task, with 682 SERS spectra measured in a micromolar range using cucurbit[8]uril chemical spacers. A denoising autoencoder was utilized for spectral enhancement, while convolutional neural networks (CNNs) and vision transformers were utilized for biomarker quantification. In addition, a context representative interpretable model explanation (CRIME) method was developed to suit the current needs of SERS mixture analysis explainability. Serotonin quantification was most efficient in denoised spectra analyzed using a CNN with a three-parameter logistic output layer (mean absolute error = 0.15 μM, mean percentage error = 4.67%). Subsequently, the CRIME method revealed the CNN model to present six unique prediction contexts, of which three were associated with serotonin. The proposed framework could unlock a novel, untargeted hypothesis-generating method of biomarker discovery, considering the rapid and inexpensive nature of SERS measurements and the potential to identify biomarkers from CRIME contexts.
期刊介绍:
ACS Sensors is a peer-reviewed research journal that focuses on the dissemination of new and original knowledge in the field of sensor science, particularly those that selectively sense chemical or biological species or processes. The journal covers a broad range of topics, including but not limited to biosensors, chemical sensors, gas sensors, intracellular sensors, single molecule sensors, cell chips, and microfluidic devices. It aims to publish articles that address conceptual advances in sensing technology applicable to various types of analytes or application papers that report on the use of existing sensing concepts in new ways or for new analytes.