Empirical Mode Decomposition and Grassmann Manifold-Based Cervical Cancer Detection

IF 2 3区物理与天体物理 Q3 BIOCHEMICAL RESEARCH METHODS

Journal of Biophotonics Pub Date : 2025-05-28 DOI:10.1002/jbio.202400584

Sidharthenee Nayak, Bhaswati Singha Deo, Mayukha Pal, Prasanta K. Panigrahi, Asima Pradhan

{"title":"Empirical Mode Decomposition and Grassmann Manifold-Based Cervical Cancer Detection","authors":"Sidharthenee Nayak, Bhaswati Singha Deo, Mayukha Pal, Prasanta K. Panigrahi, Asima Pradhan","doi":"10.1002/jbio.202400584","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Cervical cancer is a prevalent malignancy affecting the female reproductive system and is recognized as a prominent factor to female mortality on a global scale. Timely and precise detection of various stages of cervical cancer plays a crucial role in enhancing the chances of successful treatment and extending patient survival. Fluorescence spectroscopy stands out as a highly sensitive method for identifying biochemical alterations associated with cancer and numerous other pathological conditions. In our study, empirical mode decomposition (EMD) and Grassmann manifold (GM) learning are explored for reliable cancer detection using fluorescence spectral signals collected from 110 subjects representing various categories of the human cervix. Initially, EMD is used to decompose the signal into several multi-feature intrinsic mode functions (IMFs) on a spectral scale. Each IMF demonstrates uniqueness by capturing the inherent frequency characteristics within the signal, thus facilitating the extraction of signal features. The GM representation of IMFs is employed for investigating the non-linear subspace structure within spectral signals, which is subsequently followed by a low-rank representation to transform and analyze the spectral signals. The GM allows for the extraction of relevant information, reduction of dimensionality, and exploration of complex relationships within data, ultimately contributing to improved diagnosis. Mutual information is further used for feature selection to reduce the number of features and hence the computational cost. When the selected features were employed for classification, the Random Forest (RF) classifier attained a high five-fold validation accuracy of 99% and exhibited a minimal standard deviation of 0.02. Other state-of-the-art machine learning classifiers were also used and compared with the RF model.</p>\n </div>","PeriodicalId":184,"journal":{"name":"Journal of Biophotonics","volume":"18 7","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Biophotonics","FirstCategoryId":"101","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/jbio.202400584","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}

引用次数: 0

Abstract

Cervical cancer is a prevalent malignancy affecting the female reproductive system and is recognized as a prominent factor to female mortality on a global scale. Timely and precise detection of various stages of cervical cancer plays a crucial role in enhancing the chances of successful treatment and extending patient survival. Fluorescence spectroscopy stands out as a highly sensitive method for identifying biochemical alterations associated with cancer and numerous other pathological conditions. In our study, empirical mode decomposition (EMD) and Grassmann manifold (GM) learning are explored for reliable cancer detection using fluorescence spectral signals collected from 110 subjects representing various categories of the human cervix. Initially, EMD is used to decompose the signal into several multi-feature intrinsic mode functions (IMFs) on a spectral scale. Each IMF demonstrates uniqueness by capturing the inherent frequency characteristics within the signal, thus facilitating the extraction of signal features. The GM representation of IMFs is employed for investigating the non-linear subspace structure within spectral signals, which is subsequently followed by a low-rank representation to transform and analyze the spectral signals. The GM allows for the extraction of relevant information, reduction of dimensionality, and exploration of complex relationships within data, ultimately contributing to improved diagnosis. Mutual information is further used for feature selection to reduce the number of features and hence the computational cost. When the selected features were employed for classification, the Random Forest (RF) classifier attained a high five-fold validation accuracy of 99% and exhibited a minimal standard deviation of 0.02. Other state-of-the-art machine learning classifiers were also used and compared with the RF model.

查看原文本刊更多论文

基于经验模态分解和Grassmann流形的宫颈癌检测。

子宫颈癌是一种影响女性生殖系统的普遍恶性肿瘤，被认为是全球范围内女性死亡的一个重要因素。及时准确地发现子宫颈癌的各个阶段，对提高成功治疗的机会和延长患者的生存期至关重要。荧光光谱作为一种高度敏感的方法，用于识别与癌症和许多其他病理状况相关的生化变化。在我们的研究中，经验模式分解（EMD）和格拉斯曼流形（GM）学习探索了可靠的癌症检测，使用从110个代表不同类别的人类子宫颈受试者中收集的荧光光谱信号。首先，利用EMD将信号在谱尺度上分解为多个多特征的内禀模态函数（imf）。每个IMF通过捕捉信号内固有的频率特征来展示其独特性，从而便于提取信号特征。该方法首先利用imf的GM表示来研究谱信号的非线性子空间结构，然后利用低秩表示对谱信号进行变换和分析。GM允许提取相关信息、降维和探索数据中的复杂关系，最终有助于改进诊断。互信息进一步用于特征选择，以减少特征的数量，从而减少计算成本。当选择的特征被用于分类时，随机森林（RF）分类器获得了高达99%的五倍验证精度，并表现出最小的标准偏差0.02。其他最先进的机器学习分类器也被使用，并与RF模型进行比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Biophotonics 生物-生化研究方法

CiteScore

5.70

自引率

7.10%

发文量

248

审稿时长

1 months

期刊介绍： The first international journal dedicated to publishing reviews and original articles from this exciting field, the Journal of Biophotonics covers the broad range of research on interactions between light and biological material. The journal offers a platform where the physicist communicates with the biologist and where the clinical practitioner learns about the latest tools for the diagnosis of diseases. As such, the journal is highly interdisciplinary, publishing cutting edge research in the fields of life sciences, medicine, physics, chemistry, and engineering. The coverage extends from fundamental research to specific developments, while also including the latest applications.