Jiaqi Mu , Aquib Nazar , Muhammad Asim Ali , Athar Hussain
{"title":"Integrating machine learning with OMICs data for early detection in breast cancer","authors":"Jiaqi Mu , Aquib Nazar , Muhammad Asim Ali , Athar Hussain","doi":"10.1016/j.genrep.2025.102325","DOIUrl":null,"url":null,"abstract":"<div><div>Breast cancer is one of the most common cancers that significantly affects a large population of women, emphasizing its importance in early detection for effective treatments. The advancement in technologies, especially in machine learning and its integration with multi-omics data, such as genomics, transcriptomics, proteomics, metabolomics, and imaging, is not only revolutionizing the diagnosis and prognosis of breast cancer at its early stages but also providing a door for personalized treatment plans to improve patient outcomes. However, achieving truly personalized treatment requires integration of causal inference methods into machine learning frameworks, as correlational models alone may not ensure effective or safe decision-making. The current study revisits the progress made in this research area, providing a comprehensive insight into the challenges of breast cancer early detection, machine learning (ML) in cancer detection, ML-Omics integration, clinical applications, case studies, and future directions and innovations.</div></div>","PeriodicalId":12673,"journal":{"name":"Gene Reports","volume":"41 ","pages":"Article 102325"},"PeriodicalIF":0.9000,"publicationDate":"2025-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Gene Reports","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2452014425001980","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
Breast cancer is one of the most common cancers that significantly affects a large population of women, emphasizing its importance in early detection for effective treatments. The advancement in technologies, especially in machine learning and its integration with multi-omics data, such as genomics, transcriptomics, proteomics, metabolomics, and imaging, is not only revolutionizing the diagnosis and prognosis of breast cancer at its early stages but also providing a door for personalized treatment plans to improve patient outcomes. However, achieving truly personalized treatment requires integration of causal inference methods into machine learning frameworks, as correlational models alone may not ensure effective or safe decision-making. The current study revisits the progress made in this research area, providing a comprehensive insight into the challenges of breast cancer early detection, machine learning (ML) in cancer detection, ML-Omics integration, clinical applications, case studies, and future directions and innovations.
Gene ReportsBiochemistry, Genetics and Molecular Biology-Genetics
CiteScore
3.30
自引率
7.70%
发文量
246
审稿时长
49 days
期刊介绍:
Gene Reports publishes papers that focus on the regulation, expression, function and evolution of genes in all biological contexts, including all prokaryotic and eukaryotic organisms, as well as viruses. Gene Reports strives to be a very diverse journal and topics in all fields will be considered for publication. Although not limited to the following, some general topics include: DNA Organization, Replication & Evolution -Focus on genomic DNA (chromosomal organization, comparative genomics, DNA replication, DNA repair, mobile DNA, mitochondrial DNA, chloroplast DNA). Expression & Function - Focus on functional RNAs (microRNAs, tRNAs, rRNAs, mRNA splicing, alternative polyadenylation) Regulation - Focus on processes that mediate gene-read out (epigenetics, chromatin, histone code, transcription, translation, protein degradation). Cell Signaling - Focus on mechanisms that control information flow into the nucleus to control gene expression (kinase and phosphatase pathways controlled by extra-cellular ligands, Wnt, Notch, TGFbeta/BMPs, FGFs, IGFs etc.) Profiling of gene expression and genetic variation - Focus on high throughput approaches (e.g., DeepSeq, ChIP-Seq, Affymetrix microarrays, proteomics) that define gene regulatory circuitry, molecular pathways and protein/protein networks. Genetics - Focus on development in model organisms (e.g., mouse, frog, fruit fly, worm), human genetic variation, population genetics, as well as agricultural and veterinary genetics. Molecular Pathology & Regenerative Medicine - Focus on the deregulation of molecular processes in human diseases and mechanisms supporting regeneration of tissues through pluripotent or multipotent stem cells.