Xianhao Wu , Rui Tao , Zhiyan Sun , Tianyao Zhang , Shaowen Zheng , Yuan Yuan , Xingyue Li , Can Cao , Zhaohui Zhang , Dabiao Zhou , Xiaoyan Zhao , Pei Yang
{"title":"Glioma grade discrimination based on class imbalance learning with terahertz spectral data","authors":"Xianhao Wu , Rui Tao , Zhiyan Sun , Tianyao Zhang , Shaowen Zheng , Yuan Yuan , Xingyue Li , Can Cao , Zhaohui Zhang , Dabiao Zhou , Xiaoyan Zhao , Pei Yang","doi":"10.1016/j.infrared.2025.105809","DOIUrl":null,"url":null,"abstract":"<div><div>Grading gliomas is crucial for prognosis and survival prediction. Clinically, gliomas are categorized into low-grade gliomas (LGG, WHO II) and high-grade gliomas (HGG, WHO III and IV). Compared to traditional diagnostic methods, terahertz spectroscopy offers advantages such as time efficiency and label-free detection. In this study, we propose a machine learning model for discriminating glioma grades based on terahertz spectral data, incorporating, for the first time, the class imbalance of terahertz spectral data from different glioma grades. A total of 420 sample data from 21 cases were included. Three data processing methods—ROS, RUS, and SMOTE—were utilized, and three classifiers were employed to construct the model. We compared the effects of the three data processing methods and validated the proposed method on a test set. The best performance of the proposed algorithm, evaluated by the area under the curve (AUC), achieved a maximum value of 88.8%. This represents a significant advancement in the development of a rapid glioma grade diagnostic tool and provides more effective guidance for developing surgical protocols for gliomas.</div></div>","PeriodicalId":13549,"journal":{"name":"Infrared Physics & Technology","volume":"147 ","pages":"Article 105809"},"PeriodicalIF":3.1000,"publicationDate":"2025-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Infrared Physics & Technology","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1350449525001021","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INSTRUMENTS & INSTRUMENTATION","Score":null,"Total":0}
引用次数: 0
Abstract
Grading gliomas is crucial for prognosis and survival prediction. Clinically, gliomas are categorized into low-grade gliomas (LGG, WHO II) and high-grade gliomas (HGG, WHO III and IV). Compared to traditional diagnostic methods, terahertz spectroscopy offers advantages such as time efficiency and label-free detection. In this study, we propose a machine learning model for discriminating glioma grades based on terahertz spectral data, incorporating, for the first time, the class imbalance of terahertz spectral data from different glioma grades. A total of 420 sample data from 21 cases were included. Three data processing methods—ROS, RUS, and SMOTE—were utilized, and three classifiers were employed to construct the model. We compared the effects of the three data processing methods and validated the proposed method on a test set. The best performance of the proposed algorithm, evaluated by the area under the curve (AUC), achieved a maximum value of 88.8%. This represents a significant advancement in the development of a rapid glioma grade diagnostic tool and provides more effective guidance for developing surgical protocols for gliomas.
胶质瘤分级对预后和生存预测至关重要。临床上胶质瘤分为低级别胶质瘤(LGG、WHO II)和高级别胶质瘤(HGG、WHO III和IV)。与传统的诊断方法相比,太赫兹光谱具有时间效率和无标记检测等优势。在这项研究中,我们提出了一个基于太赫兹光谱数据区分胶质瘤等级的机器学习模型,首次纳入了来自不同级别胶质瘤的太赫兹光谱数据的类别不平衡。共纳入21例420份样本资料。采用ros、RUS和smote三种数据处理方法,采用三种分类器构建模型。我们比较了三种数据处理方法的效果,并在一个测试集上验证了所提出的方法。以曲线下面积(area under The curve, AUC)评价该算法的最佳性能,达到最大值88.8%。这代表了快速胶质瘤分级诊断工具发展的重大进步,并为胶质瘤的手术方案提供了更有效的指导。
期刊介绍:
The Journal covers the entire field of infrared physics and technology: theory, experiment, application, devices and instrumentation. Infrared'' is defined as covering the near, mid and far infrared (terahertz) regions from 0.75um (750nm) to 1mm (300GHz.) Submissions in the 300GHz to 100GHz region may be accepted at the editors discretion if their content is relevant to shorter wavelengths. Submissions must be primarily concerned with and directly relevant to this spectral region.
Its core topics can be summarized as the generation, propagation and detection, of infrared radiation; the associated optics, materials and devices; and its use in all fields of science, industry, engineering and medicine.
Infrared techniques occur in many different fields, notably spectroscopy and interferometry; material characterization and processing; atmospheric physics, astronomy and space research. Scientific aspects include lasers, quantum optics, quantum electronics, image processing and semiconductor physics. Some important applications are medical diagnostics and treatment, industrial inspection and environmental monitoring.