Developing a gene expression classifier for breast cancer diagnosis.

IF 2.6 4区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Zahra Hosseinpour, Mostafa Rezaei-Tavirani, Mohammad-Esmaeil Akbari, Masoumeh Farahani
{"title":"Developing a gene expression classifier for breast cancer diagnosis.","authors":"Zahra Hosseinpour, Mostafa Rezaei-Tavirani, Mohammad-Esmaeil Akbari, Masoumeh Farahani","doi":"10.1007/s11517-025-03329-7","DOIUrl":null,"url":null,"abstract":"<p><p>Breast cancer (BC) is the most common type of cancer in women worldwide. Solid tumors are complex structures composed of many cell types and extracellular matrix components. Understanding solid tumors is crucial for developing effective treatments. This study aimed to develop a gene expression classifier to predict BC with high accuracy. The study first identified the most important genes for cancer through differential expression analysis (DEA) between breast cancer and adjacent normal breast samples. The R package STRINGdb was then used to create a protein-protein interaction network (PPI) to examine upregulated genes and find clusters. Enrichment analyses were performed to identify overrepresented biological functions and pathways. A logistic regression prediction model was developed using a breast cancer dataset from TCGA and evaluated using discrimination and calibration measures. BUB1 expression in breast cancer was also investigated using quantitative analysis. Two significant clusters were identified, with cell cycle checkpoints and M phase key pathways in one cluster and extracellular matrix organization in the other. A prediction model using the hub gene set (COMP, FN1, SDC1, BUB1, TTK, and NUSAP1) showed high sensitivity (97.2%) and specificity (96.1%), and an AUC of 0.994. Three hub genes (COMP, FN1, and SDC1) were identified through the PPI network, strongly linked to extracellular matrix organization (BUB1, TTK, and NUSAP1) as hub genes involved in M phase and cell cycle checkpoints. Overall, the study identified hub pathways and genes that accurately distinguish between cancer and normal samples, presenting promising new possibilities for early cancer detection and improved BC therapy.</p>","PeriodicalId":49840,"journal":{"name":"Medical & Biological Engineering & Computing","volume":" ","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2025-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical & Biological Engineering & Computing","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s11517-025-03329-7","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Breast cancer (BC) is the most common type of cancer in women worldwide. Solid tumors are complex structures composed of many cell types and extracellular matrix components. Understanding solid tumors is crucial for developing effective treatments. This study aimed to develop a gene expression classifier to predict BC with high accuracy. The study first identified the most important genes for cancer through differential expression analysis (DEA) between breast cancer and adjacent normal breast samples. The R package STRINGdb was then used to create a protein-protein interaction network (PPI) to examine upregulated genes and find clusters. Enrichment analyses were performed to identify overrepresented biological functions and pathways. A logistic regression prediction model was developed using a breast cancer dataset from TCGA and evaluated using discrimination and calibration measures. BUB1 expression in breast cancer was also investigated using quantitative analysis. Two significant clusters were identified, with cell cycle checkpoints and M phase key pathways in one cluster and extracellular matrix organization in the other. A prediction model using the hub gene set (COMP, FN1, SDC1, BUB1, TTK, and NUSAP1) showed high sensitivity (97.2%) and specificity (96.1%), and an AUC of 0.994. Three hub genes (COMP, FN1, and SDC1) were identified through the PPI network, strongly linked to extracellular matrix organization (BUB1, TTK, and NUSAP1) as hub genes involved in M phase and cell cycle checkpoints. Overall, the study identified hub pathways and genes that accurately distinguish between cancer and normal samples, presenting promising new possibilities for early cancer detection and improved BC therapy.

开发一种用于乳腺癌诊断的基因表达分类器。
乳腺癌(BC)是世界范围内女性最常见的癌症类型。实体瘤是由多种细胞类型和细胞外基质成分组成的复杂结构。了解实体瘤对于开发有效的治疗方法至关重要。本研究旨在建立一个基因表达分类器来预测BC的准确性。该研究首先通过乳腺癌和邻近正常乳腺样本之间的差异表达分析(DEA)确定了最重要的癌症基因。然后使用R包STRINGdb创建蛋白相互作用网络(protein-protein interaction network, PPI)来检查上调的基因并找到簇。进行富集分析以确定过度代表的生物学功能和途径。使用TCGA的乳腺癌数据集建立逻辑回归预测模型,并使用判别和校准措施进行评估。用定量分析方法研究了乳腺癌中BUB1的表达。鉴定出两个重要的集群,其中一个集群具有细胞周期检查点和M期关键通路,另一个集群具有细胞外基质组织。使用枢纽基因集(COMP、FN1、SDC1、BUB1、TTK和NUSAP1)的预测模型具有较高的敏感性(97.2%)和特异性(96.1%),AUC为0.994。通过PPI网络鉴定出三个中心基因(COMP, FN1和SDC1),它们与细胞外基质组织(BUB1, TTK和NUSAP1)密切相关,是参与M期和细胞周期检查点的中心基因。总体而言,该研究确定了准确区分癌症和正常样本的中枢通路和基因,为早期癌症检测和改进BC治疗提供了有希望的新可能性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Medical & Biological Engineering & Computing
Medical & Biological Engineering & Computing 医学-工程:生物医学
CiteScore
6.00
自引率
3.10%
发文量
249
审稿时长
3.5 months
期刊介绍: Founded in 1963, Medical & Biological Engineering & Computing (MBEC) continues to serve the biomedical engineering community, covering the entire spectrum of biomedical and clinical engineering. The journal presents exciting and vital experimental and theoretical developments in biomedical science and technology, and reports on advances in computer-based methodologies in these multidisciplinary subjects. The journal also incorporates new and evolving technologies including cellular engineering and molecular imaging. MBEC publishes original research articles as well as reviews and technical notes. Its Rapid Communications category focuses on material of immediate value to the readership, while the Controversies section provides a forum to exchange views on selected issues, stimulating a vigorous and informed debate in this exciting and high profile field. MBEC is an official journal of the International Federation of Medical and Biological Engineering (IFMBE).
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信