Jingze Song , Chang Cai , Kedong Zhang , Tongshun Liu
{"title":"基于最小分类错误率标准的特征基因筛查与乳腺癌诊断","authors":"Jingze Song , Chang Cai , Kedong Zhang , Tongshun Liu","doi":"10.1016/j.measurement.2025.117787","DOIUrl":null,"url":null,"abstract":"<div><div>Breast cancer is currently the most common type of malignant cancer in women worldwide, accounting for 31 % of cancers in women, and has been on the rise in terms of both morbidity and mortality. Feature gene screening is essential for diagnosing, prognosis, and timely treatment. This study proposed a breast cancer feature genes screening method based on the minimum classification error rate criterion for accurate breast cancer diagnosis. Firstly, the overlapping area between the two distribution curves of cancer and normal gene expression data, namely, the statistically minimum classification error rate was calculated, and the breast cancer feature genes were then pre-screened from The Cancer Genome Atlas (TCGA) data with the minimum classification error rate criterion. Secondly, the feature genes were further screened based on the Weighted Gene Co-expression Network Analysis (WGCNA) and Protein-Protein Interaction (PPI) network analysis, and the Bayesian network for diagnosing breast cancer was constructed based on the screened genes. Finally, the effectiveness of the genetic screening method was validated using TCGA data within the Bayesian network diagnostic model. Experimental results showed that the method proposed in this paper had an accuracy of 96.67%, precision of 100%, recall of 93.1%, and F1 score of 0.9643, which were improved by 5%, 7.14%, 3.44%, and 5.7% compared to the conventional cancer gene screening methods with differential expression analysis.</div></div>","PeriodicalId":18349,"journal":{"name":"Measurement","volume":"253 ","pages":"Article 117787"},"PeriodicalIF":5.2000,"publicationDate":"2025-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Feature gene screening and diagnosis of breast cancer based on the minimum classification error rate criterion\",\"authors\":\"Jingze Song , Chang Cai , Kedong Zhang , Tongshun Liu\",\"doi\":\"10.1016/j.measurement.2025.117787\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Breast cancer is currently the most common type of malignant cancer in women worldwide, accounting for 31 % of cancers in women, and has been on the rise in terms of both morbidity and mortality. Feature gene screening is essential for diagnosing, prognosis, and timely treatment. This study proposed a breast cancer feature genes screening method based on the minimum classification error rate criterion for accurate breast cancer diagnosis. Firstly, the overlapping area between the two distribution curves of cancer and normal gene expression data, namely, the statistically minimum classification error rate was calculated, and the breast cancer feature genes were then pre-screened from The Cancer Genome Atlas (TCGA) data with the minimum classification error rate criterion. Secondly, the feature genes were further screened based on the Weighted Gene Co-expression Network Analysis (WGCNA) and Protein-Protein Interaction (PPI) network analysis, and the Bayesian network for diagnosing breast cancer was constructed based on the screened genes. Finally, the effectiveness of the genetic screening method was validated using TCGA data within the Bayesian network diagnostic model. Experimental results showed that the method proposed in this paper had an accuracy of 96.67%, precision of 100%, recall of 93.1%, and F1 score of 0.9643, which were improved by 5%, 7.14%, 3.44%, and 5.7% compared to the conventional cancer gene screening methods with differential expression analysis.</div></div>\",\"PeriodicalId\":18349,\"journal\":{\"name\":\"Measurement\",\"volume\":\"253 \",\"pages\":\"Article 117787\"},\"PeriodicalIF\":5.2000,\"publicationDate\":\"2025-05-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Measurement\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0263224125011467\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Measurement","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0263224125011467","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
Feature gene screening and diagnosis of breast cancer based on the minimum classification error rate criterion
Breast cancer is currently the most common type of malignant cancer in women worldwide, accounting for 31 % of cancers in women, and has been on the rise in terms of both morbidity and mortality. Feature gene screening is essential for diagnosing, prognosis, and timely treatment. This study proposed a breast cancer feature genes screening method based on the minimum classification error rate criterion for accurate breast cancer diagnosis. Firstly, the overlapping area between the two distribution curves of cancer and normal gene expression data, namely, the statistically minimum classification error rate was calculated, and the breast cancer feature genes were then pre-screened from The Cancer Genome Atlas (TCGA) data with the minimum classification error rate criterion. Secondly, the feature genes were further screened based on the Weighted Gene Co-expression Network Analysis (WGCNA) and Protein-Protein Interaction (PPI) network analysis, and the Bayesian network for diagnosing breast cancer was constructed based on the screened genes. Finally, the effectiveness of the genetic screening method was validated using TCGA data within the Bayesian network diagnostic model. Experimental results showed that the method proposed in this paper had an accuracy of 96.67%, precision of 100%, recall of 93.1%, and F1 score of 0.9643, which were improved by 5%, 7.14%, 3.44%, and 5.7% compared to the conventional cancer gene screening methods with differential expression analysis.
期刊介绍:
Contributions are invited on novel achievements in all fields of measurement and instrumentation science and technology. Authors are encouraged to submit novel material, whose ultimate goal is an advancement in the state of the art of: measurement and metrology fundamentals, sensors, measurement instruments, measurement and estimation techniques, measurement data processing and fusion algorithms, evaluation procedures and methodologies for plants and industrial processes, performance analysis of systems, processes and algorithms, mathematical models for measurement-oriented purposes, distributed measurement systems in a connected world.