Jiaqi Hu, Gina Jinna Chen, Chenlong Xue, Pei Liang, Yanqun Xiang, Chuanlun Zhang, Xiaokeng Chi, Guoying Liu, Yanfang Ye, Dongyu Cui, De Zhang, Xiaojun Yu, Hong Dang, Wen Zhang, Junfan Chen, Quan Tang, Penglai Guo, Ho-Pui Ho, Yuchao Li, Longqing Cong, Perry Ping Shum
{"title":"RSPSSL: A novel high-fidelity Raman spectral preprocessing scheme to enhance biomedical applications and chemical resolution visualization.","authors":"Jiaqi Hu, Gina Jinna Chen, Chenlong Xue, Pei Liang, Yanqun Xiang, Chuanlun Zhang, Xiaokeng Chi, Guoying Liu, Yanfang Ye, Dongyu Cui, De Zhang, Xiaojun Yu, Hong Dang, Wen Zhang, Junfan Chen, Quan Tang, Penglai Guo, Ho-Pui Ho, Yuchao Li, Longqing Cong, Perry Ping Shum","doi":"10.1038/s41377-024-01394-5","DOIUrl":null,"url":null,"abstract":"<p><p>Raman spectroscopy has tremendous potential for material analysis with its molecular fingerprinting capability in many branches of science and technology. It is also an emerging omics technique for metabolic profiling to shape precision medicine. However, precisely attributing vibration peaks coupled with specific environmental, instrumental, and specimen noise is problematic. Intelligent Raman spectral preprocessing to remove statistical bias noise and sample-related errors should provide a powerful tool for valuable information extraction. Here, we propose a novel Raman spectral preprocessing scheme based on self-supervised learning (RSPSSL) with high capacity and spectral fidelity. It can preprocess arbitrary Raman spectra without further training at a speed of ~1 900 spectra per second without human interference. The experimental data preprocessing trial demonstrated its excellent capacity and signal fidelity with an 88% reduction in root mean square error and a 60% reduction in infinite norm ([Formula: see text]) compared to established techniques. With this advantage, it remarkably enhanced various biomedical applications with a 400% accuracy elevation (ΔAUC) in cancer diagnosis, an average 38% (few-shot) and 242% accuracy improvement in paraquat concentration prediction, and unsealed the chemical resolution of biomedical hyperspectral images, especially in the spectral fingerprint region. It precisely preprocessed various Raman spectra from different spectroscopy devices, laboratories, and diverse applications. This scheme will enable biomedical mechanism screening with the label-free volumetric molecular imaging tool on organism and disease metabolomics profiling with a scenario of high throughput, cross-device, various analyte complexity, and diverse applications.</p>","PeriodicalId":18093,"journal":{"name":"Light, science & applications","volume":"13 1","pages":"52"},"PeriodicalIF":19.4000,"publicationDate":"2024-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10876988/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Light, science & applications","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1038/s41377-024-01394-5","RegionNum":1,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Physics and Astronomy","Score":null,"Total":0}
引用次数: 0
Abstract
Raman spectroscopy has tremendous potential for material analysis with its molecular fingerprinting capability in many branches of science and technology. It is also an emerging omics technique for metabolic profiling to shape precision medicine. However, precisely attributing vibration peaks coupled with specific environmental, instrumental, and specimen noise is problematic. Intelligent Raman spectral preprocessing to remove statistical bias noise and sample-related errors should provide a powerful tool for valuable information extraction. Here, we propose a novel Raman spectral preprocessing scheme based on self-supervised learning (RSPSSL) with high capacity and spectral fidelity. It can preprocess arbitrary Raman spectra without further training at a speed of ~1 900 spectra per second without human interference. The experimental data preprocessing trial demonstrated its excellent capacity and signal fidelity with an 88% reduction in root mean square error and a 60% reduction in infinite norm ([Formula: see text]) compared to established techniques. With this advantage, it remarkably enhanced various biomedical applications with a 400% accuracy elevation (ΔAUC) in cancer diagnosis, an average 38% (few-shot) and 242% accuracy improvement in paraquat concentration prediction, and unsealed the chemical resolution of biomedical hyperspectral images, especially in the spectral fingerprint region. It precisely preprocessed various Raman spectra from different spectroscopy devices, laboratories, and diverse applications. This scheme will enable biomedical mechanism screening with the label-free volumetric molecular imaging tool on organism and disease metabolomics profiling with a scenario of high throughput, cross-device, various analyte complexity, and diverse applications.
期刊介绍:
Light: Science & Applications is an open-access, fully peer-reviewed publication.It publishes high-quality optics and photonics research globally, covering fundamental research and important issues in engineering and applied sciences related to optics and photonics.