{"title":"Developing a gene expression classifier for breast cancer diagnosis.","authors":"Zahra Hosseinpour, Mostafa Rezaei-Tavirani, Mohammad-Esmaeil Akbari, Masoumeh Farahani","doi":"10.1007/s11517-025-03329-7","DOIUrl":null,"url":null,"abstract":"<p><p>Breast cancer (BC) is the most common type of cancer in women worldwide. Solid tumors are complex structures composed of many cell types and extracellular matrix components. Understanding solid tumors is crucial for developing effective treatments. This study aimed to develop a gene expression classifier to predict BC with high accuracy. The study first identified the most important genes for cancer through differential expression analysis (DEA) between breast cancer and adjacent normal breast samples. The R package STRINGdb was then used to create a protein-protein interaction network (PPI) to examine upregulated genes and find clusters. Enrichment analyses were performed to identify overrepresented biological functions and pathways. A logistic regression prediction model was developed using a breast cancer dataset from TCGA and evaluated using discrimination and calibration measures. BUB1 expression in breast cancer was also investigated using quantitative analysis. Two significant clusters were identified, with cell cycle checkpoints and M phase key pathways in one cluster and extracellular matrix organization in the other. A prediction model using the hub gene set (COMP, FN1, SDC1, BUB1, TTK, and NUSAP1) showed high sensitivity (97.2%) and specificity (96.1%), and an AUC of 0.994. Three hub genes (COMP, FN1, and SDC1) were identified through the PPI network, strongly linked to extracellular matrix organization (BUB1, TTK, and NUSAP1) as hub genes involved in M phase and cell cycle checkpoints. Overall, the study identified hub pathways and genes that accurately distinguish between cancer and normal samples, presenting promising new possibilities for early cancer detection and improved BC therapy.</p>","PeriodicalId":49840,"journal":{"name":"Medical & Biological Engineering & Computing","volume":" ","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2025-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical & Biological Engineering & Computing","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s11517-025-03329-7","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Breast cancer (BC) is the most common type of cancer in women worldwide. Solid tumors are complex structures composed of many cell types and extracellular matrix components. Understanding solid tumors is crucial for developing effective treatments. This study aimed to develop a gene expression classifier to predict BC with high accuracy. The study first identified the most important genes for cancer through differential expression analysis (DEA) between breast cancer and adjacent normal breast samples. The R package STRINGdb was then used to create a protein-protein interaction network (PPI) to examine upregulated genes and find clusters. Enrichment analyses were performed to identify overrepresented biological functions and pathways. A logistic regression prediction model was developed using a breast cancer dataset from TCGA and evaluated using discrimination and calibration measures. BUB1 expression in breast cancer was also investigated using quantitative analysis. Two significant clusters were identified, with cell cycle checkpoints and M phase key pathways in one cluster and extracellular matrix organization in the other. A prediction model using the hub gene set (COMP, FN1, SDC1, BUB1, TTK, and NUSAP1) showed high sensitivity (97.2%) and specificity (96.1%), and an AUC of 0.994. Three hub genes (COMP, FN1, and SDC1) were identified through the PPI network, strongly linked to extracellular matrix organization (BUB1, TTK, and NUSAP1) as hub genes involved in M phase and cell cycle checkpoints. Overall, the study identified hub pathways and genes that accurately distinguish between cancer and normal samples, presenting promising new possibilities for early cancer detection and improved BC therapy.
期刊介绍:
Founded in 1963, Medical & Biological Engineering & Computing (MBEC) continues to serve the biomedical engineering community, covering the entire spectrum of biomedical and clinical engineering. The journal presents exciting and vital experimental and theoretical developments in biomedical science and technology, and reports on advances in computer-based methodologies in these multidisciplinary subjects. The journal also incorporates new and evolving technologies including cellular engineering and molecular imaging.
MBEC publishes original research articles as well as reviews and technical notes. Its Rapid Communications category focuses on material of immediate value to the readership, while the Controversies section provides a forum to exchange views on selected issues, stimulating a vigorous and informed debate in this exciting and high profile field.
MBEC is an official journal of the International Federation of Medical and Biological Engineering (IFMBE).