Ao Song, Wanli Yang, Jun Wang, Yisa Cai, Lizheng Cai, Nan Pang, Ruihua Yu, Zhikun Liu, Chao Yang, Feng Jiang
{"title":"Application of ATR-FTIR spectroscopy and multivariate statistical analysis in cancer diagnosis.","authors":"Ao Song, Wanli Yang, Jun Wang, Yisa Cai, Lizheng Cai, Nan Pang, Ruihua Yu, Zhikun Liu, Chao Yang, Feng Jiang","doi":"10.1016/j.slast.2025.100253","DOIUrl":null,"url":null,"abstract":"<p><p>Lung cancer is one of the most prevalent and lethal malignant tumors worldwide. Currently, clinical diagnosis primarily relies on chest X-ray examinations, histopathological analysis, and the detection of tumor markers in blood. However, each of these methods has inherent limitations. The current study aims to explore novel diagnostic approaches for lung cancer by employing attenuated total reflection-Fourier transform infrared (ATR-FTIR) spectroscopy in conjunction with multiple machine learning models. Fourier transform infrared spectroscopy can detect subtle differences in the material structures that reflect the carcinogenic process between lung cancer tissues and normal tissues. By applying principal component analysis (PCA) and partial least squares discriminant analysis (PLS-DA) to analyze infrared spectral data, these subtle differences can be amplified. The study revealed that the combination of spectral bands within the 3500-3000 cm<sup>-1</sup> and 1600-1500 cm<sup>-1</sup> ranges is particularly significant for differentiating between the two groups. Three classification models-Support Vector Machine (SVM), k-Nearest Neighbor (kNN), and Linear Discriminant Analysis (LDA)-were constructed for spectral analysis of various band combinations. The results indicated that in detecting lung cancer samples, the combination of the 3500-3000 cm<sup>-1</sup> and 1600-1500 cm<sup>-1</sup> bands offers significant advantages. The analysis of the receiver operating characteristic (ROC) curve demonstrated that the area under the curve (AUC) exceeded 0.95 for all models, with the LDA model achieving an accuracy rate of 99.4% in identifying lung cancer patients compared to healthy individuals. The findings suggest that the integration of ATR-FTIR spectroscopy with multiple machine learning models represents a promising auxiliary diagnostic method for clinical lung cancer diagnosis, enabling detection at the molecular level.</p>","PeriodicalId":54248,"journal":{"name":"SLAS Technology","volume":" ","pages":"100253"},"PeriodicalIF":2.5000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"SLAS Technology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.slast.2025.100253","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Lung cancer is one of the most prevalent and lethal malignant tumors worldwide. Currently, clinical diagnosis primarily relies on chest X-ray examinations, histopathological analysis, and the detection of tumor markers in blood. However, each of these methods has inherent limitations. The current study aims to explore novel diagnostic approaches for lung cancer by employing attenuated total reflection-Fourier transform infrared (ATR-FTIR) spectroscopy in conjunction with multiple machine learning models. Fourier transform infrared spectroscopy can detect subtle differences in the material structures that reflect the carcinogenic process between lung cancer tissues and normal tissues. By applying principal component analysis (PCA) and partial least squares discriminant analysis (PLS-DA) to analyze infrared spectral data, these subtle differences can be amplified. The study revealed that the combination of spectral bands within the 3500-3000 cm-1 and 1600-1500 cm-1 ranges is particularly significant for differentiating between the two groups. Three classification models-Support Vector Machine (SVM), k-Nearest Neighbor (kNN), and Linear Discriminant Analysis (LDA)-were constructed for spectral analysis of various band combinations. The results indicated that in detecting lung cancer samples, the combination of the 3500-3000 cm-1 and 1600-1500 cm-1 bands offers significant advantages. The analysis of the receiver operating characteristic (ROC) curve demonstrated that the area under the curve (AUC) exceeded 0.95 for all models, with the LDA model achieving an accuracy rate of 99.4% in identifying lung cancer patients compared to healthy individuals. The findings suggest that the integration of ATR-FTIR spectroscopy with multiple machine learning models represents a promising auxiliary diagnostic method for clinical lung cancer diagnosis, enabling detection at the molecular level.
期刊介绍:
SLAS Technology emphasizes scientific and technical advances that enable and improve life sciences research and development; drug-delivery; diagnostics; biomedical and molecular imaging; and personalized and precision medicine. This includes high-throughput and other laboratory automation technologies; micro/nanotechnologies; analytical, separation and quantitative techniques; synthetic chemistry and biology; informatics (data analysis, statistics, bio, genomic and chemoinformatics); and more.