Seyed Morteza Naghib , Mohammad Ali Khorasani , Fariborz Sharifianjazi , Ketevan Tavamaishvili
{"title":"机器学习方法在早期乳腺癌诊断生物标志物发现中的分析策略:承诺、进展和展望","authors":"Seyed Morteza Naghib , Mohammad Ali Khorasani , Fariborz Sharifianjazi , Ketevan Tavamaishvili","doi":"10.1016/j.trac.2025.118412","DOIUrl":null,"url":null,"abstract":"<div><div>Breast Cancer (BC) remains one of the leading causes of cancer-related mortality worldwide, with early detection playing a pivotal role in improving patient survival and treatment outcomes. Biomarkers serve as critical molecular indicators that facilitate the early diagnosis of breast cancer, allowing for timely intervention before the disease progresses to more advanced stages. Traditional methods for biomarker discovery, including immunohistochemistry, polymerase chain reaction (PCR), and enzyme-linked immunosorbent assays (ELISA), have been instrumental in identifying breast cancer markers. However, these approaches often require extensive validation, are time-consuming, and may lack the ability to effectively analyze high-dimensional datasets. The rapid advancements in machine learning (ML) have transformed biomarker discovery by enabling the analysis of complex multi-omics data, integrating genomic, proteomic, and imaging datasets to identify novel biomarkers with enhanced accuracy. This study focuses on the application of ML in detecting key biomarkers COL11A1, TOP2A, MMP1, and EZH2 which are associated with tumor invasiveness, proliferation, and metastatic potential in early-stage breast cancer. These biomarkers were identified through ML-based predictive models such as Random Forest (RF), Support Vector Machines (SVMs), XGBoost, and Deep Neural Networks, which have demonstrated superior performance in distinguishing malignant from benign cases. Our findings highlight the potential of ML-driven biomarker discovery in revolutionizing breast cancer diagnostics by improving risk stratification, enhancing predictive accuracy, and facilitating personalized treatment approaches. By leveraging AI-powered methodologies, clinicians can move toward a data-driven, precision medicine approach, ultimately reducing the burden of late-stage breast cancer diagnoses and mortality rates. However, integrating ML models into routine clinical practice requires addressing key challenges, such as data standardization, model interpretability, and validation through large-scale prospective studies. Future advancements in deep learning (DL), federated learning, and explainable AI (XAI) are expected to further refine these models, ensuring their reliability and applicability in clinical settings.</div></div>","PeriodicalId":439,"journal":{"name":"Trends in Analytical Chemistry","volume":"192 ","pages":"Article 118412"},"PeriodicalIF":12.0000,"publicationDate":"2025-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Analytical strategies in early breast cancer diagnostic biomarker discovery by machine learning methods: Promises, advances and outlooks\",\"authors\":\"Seyed Morteza Naghib , Mohammad Ali Khorasani , Fariborz Sharifianjazi , Ketevan Tavamaishvili\",\"doi\":\"10.1016/j.trac.2025.118412\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Breast Cancer (BC) remains one of the leading causes of cancer-related mortality worldwide, with early detection playing a pivotal role in improving patient survival and treatment outcomes. Biomarkers serve as critical molecular indicators that facilitate the early diagnosis of breast cancer, allowing for timely intervention before the disease progresses to more advanced stages. Traditional methods for biomarker discovery, including immunohistochemistry, polymerase chain reaction (PCR), and enzyme-linked immunosorbent assays (ELISA), have been instrumental in identifying breast cancer markers. However, these approaches often require extensive validation, are time-consuming, and may lack the ability to effectively analyze high-dimensional datasets. The rapid advancements in machine learning (ML) have transformed biomarker discovery by enabling the analysis of complex multi-omics data, integrating genomic, proteomic, and imaging datasets to identify novel biomarkers with enhanced accuracy. This study focuses on the application of ML in detecting key biomarkers COL11A1, TOP2A, MMP1, and EZH2 which are associated with tumor invasiveness, proliferation, and metastatic potential in early-stage breast cancer. These biomarkers were identified through ML-based predictive models such as Random Forest (RF), Support Vector Machines (SVMs), XGBoost, and Deep Neural Networks, which have demonstrated superior performance in distinguishing malignant from benign cases. Our findings highlight the potential of ML-driven biomarker discovery in revolutionizing breast cancer diagnostics by improving risk stratification, enhancing predictive accuracy, and facilitating personalized treatment approaches. By leveraging AI-powered methodologies, clinicians can move toward a data-driven, precision medicine approach, ultimately reducing the burden of late-stage breast cancer diagnoses and mortality rates. However, integrating ML models into routine clinical practice requires addressing key challenges, such as data standardization, model interpretability, and validation through large-scale prospective studies. Future advancements in deep learning (DL), federated learning, and explainable AI (XAI) are expected to further refine these models, ensuring their reliability and applicability in clinical settings.</div></div>\",\"PeriodicalId\":439,\"journal\":{\"name\":\"Trends in Analytical Chemistry\",\"volume\":\"192 \",\"pages\":\"Article 118412\"},\"PeriodicalIF\":12.0000,\"publicationDate\":\"2025-08-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Trends in Analytical Chemistry\",\"FirstCategoryId\":\"1\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0165993625002808\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, ANALYTICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Trends in Analytical Chemistry","FirstCategoryId":"1","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0165993625002808","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
Analytical strategies in early breast cancer diagnostic biomarker discovery by machine learning methods: Promises, advances and outlooks
Breast Cancer (BC) remains one of the leading causes of cancer-related mortality worldwide, with early detection playing a pivotal role in improving patient survival and treatment outcomes. Biomarkers serve as critical molecular indicators that facilitate the early diagnosis of breast cancer, allowing for timely intervention before the disease progresses to more advanced stages. Traditional methods for biomarker discovery, including immunohistochemistry, polymerase chain reaction (PCR), and enzyme-linked immunosorbent assays (ELISA), have been instrumental in identifying breast cancer markers. However, these approaches often require extensive validation, are time-consuming, and may lack the ability to effectively analyze high-dimensional datasets. The rapid advancements in machine learning (ML) have transformed biomarker discovery by enabling the analysis of complex multi-omics data, integrating genomic, proteomic, and imaging datasets to identify novel biomarkers with enhanced accuracy. This study focuses on the application of ML in detecting key biomarkers COL11A1, TOP2A, MMP1, and EZH2 which are associated with tumor invasiveness, proliferation, and metastatic potential in early-stage breast cancer. These biomarkers were identified through ML-based predictive models such as Random Forest (RF), Support Vector Machines (SVMs), XGBoost, and Deep Neural Networks, which have demonstrated superior performance in distinguishing malignant from benign cases. Our findings highlight the potential of ML-driven biomarker discovery in revolutionizing breast cancer diagnostics by improving risk stratification, enhancing predictive accuracy, and facilitating personalized treatment approaches. By leveraging AI-powered methodologies, clinicians can move toward a data-driven, precision medicine approach, ultimately reducing the burden of late-stage breast cancer diagnoses and mortality rates. However, integrating ML models into routine clinical practice requires addressing key challenges, such as data standardization, model interpretability, and validation through large-scale prospective studies. Future advancements in deep learning (DL), federated learning, and explainable AI (XAI) are expected to further refine these models, ensuring their reliability and applicability in clinical settings.
期刊介绍:
TrAC publishes succinct and critical overviews of recent advancements in analytical chemistry, designed to assist analytical chemists and other users of analytical techniques. These reviews offer excellent, up-to-date, and timely coverage of various topics within analytical chemistry. Encompassing areas such as analytical instrumentation, biomedical analysis, biomolecular analysis, biosensors, chemical analysis, chemometrics, clinical chemistry, drug discovery, environmental analysis and monitoring, food analysis, forensic science, laboratory automation, materials science, metabolomics, pesticide-residue analysis, pharmaceutical analysis, proteomics, surface science, and water analysis and monitoring, these critical reviews provide comprehensive insights for practitioners in the field.