{"title":"人工智能驱动的癌症基因组学多组学语言模型的开发和验证:综述","authors":"Medha Jha, Yasha Hasija","doi":"10.1016/j.compbiolchem.2025.108662","DOIUrl":null,"url":null,"abstract":"<div><div>The pervasive challenges in cancer management, ranging from accurate early diagnosis to effective personalised therapies and precise patient stratification, represent significant clinical unmet needs. Artificial intelligence (AI) is transforming cancer research by offering unprecedented capabilities in analysing complex genomic datasets. AI has completely transformed omics research by simplifying the integration of multi-omics data, offering more profound insights into cancer heterogeneity, and enhancing predictive models for patient treatment responses. In the past decade, numerous studies have highlighted how AI has revolutionised omics research. AI models are instrumental in enhancing oncology by addressing these unmet needs through improved clinical trial matching, refined risk assessment, and precise treatment selection. They contribute to more personalised and effective cancer care by classifying cancer types and subtypes, identifying biomarkers, predicting drug responses, stratifying patients, and analysing tumour evolution and heterogeneity. This comprehensive review specifically focuses on the development and validation of AI-powered multi-omics language models for cancer genomics. We posit that the integration of diverse omics data types provides synergistic insights beyond single-omics approaches, which are critical for unravelling cancer heterogeneity and addressing the complex challenges within cancer genomics. The review highlights recent advances, current difficulties, and potential paths forward for these integrated AI approaches. We also detail the main elements of these models, such as their architectures, training plans, evaluation techniques, and data preprocessing. All things considered, multi-omics language models powered by AI hold immense promise for deriving biological insights from intricate cancer genomic data and converting them into actionable information for clinical settings.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"120 ","pages":"Article 108662"},"PeriodicalIF":3.1000,"publicationDate":"2025-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Development and validation of AI-driven multi-omics language models for cancer genomics: A comprehensive review\",\"authors\":\"Medha Jha, Yasha Hasija\",\"doi\":\"10.1016/j.compbiolchem.2025.108662\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The pervasive challenges in cancer management, ranging from accurate early diagnosis to effective personalised therapies and precise patient stratification, represent significant clinical unmet needs. Artificial intelligence (AI) is transforming cancer research by offering unprecedented capabilities in analysing complex genomic datasets. AI has completely transformed omics research by simplifying the integration of multi-omics data, offering more profound insights into cancer heterogeneity, and enhancing predictive models for patient treatment responses. In the past decade, numerous studies have highlighted how AI has revolutionised omics research. AI models are instrumental in enhancing oncology by addressing these unmet needs through improved clinical trial matching, refined risk assessment, and precise treatment selection. They contribute to more personalised and effective cancer care by classifying cancer types and subtypes, identifying biomarkers, predicting drug responses, stratifying patients, and analysing tumour evolution and heterogeneity. This comprehensive review specifically focuses on the development and validation of AI-powered multi-omics language models for cancer genomics. We posit that the integration of diverse omics data types provides synergistic insights beyond single-omics approaches, which are critical for unravelling cancer heterogeneity and addressing the complex challenges within cancer genomics. The review highlights recent advances, current difficulties, and potential paths forward for these integrated AI approaches. We also detail the main elements of these models, such as their architectures, training plans, evaluation techniques, and data preprocessing. All things considered, multi-omics language models powered by AI hold immense promise for deriving biological insights from intricate cancer genomic data and converting them into actionable information for clinical settings.</div></div>\",\"PeriodicalId\":10616,\"journal\":{\"name\":\"Computational Biology and Chemistry\",\"volume\":\"120 \",\"pages\":\"Article 108662\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2025-08-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computational Biology and Chemistry\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1476927125003238\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Biology and Chemistry","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1476927125003238","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOLOGY","Score":null,"Total":0}
Development and validation of AI-driven multi-omics language models for cancer genomics: A comprehensive review
The pervasive challenges in cancer management, ranging from accurate early diagnosis to effective personalised therapies and precise patient stratification, represent significant clinical unmet needs. Artificial intelligence (AI) is transforming cancer research by offering unprecedented capabilities in analysing complex genomic datasets. AI has completely transformed omics research by simplifying the integration of multi-omics data, offering more profound insights into cancer heterogeneity, and enhancing predictive models for patient treatment responses. In the past decade, numerous studies have highlighted how AI has revolutionised omics research. AI models are instrumental in enhancing oncology by addressing these unmet needs through improved clinical trial matching, refined risk assessment, and precise treatment selection. They contribute to more personalised and effective cancer care by classifying cancer types and subtypes, identifying biomarkers, predicting drug responses, stratifying patients, and analysing tumour evolution and heterogeneity. This comprehensive review specifically focuses on the development and validation of AI-powered multi-omics language models for cancer genomics. We posit that the integration of diverse omics data types provides synergistic insights beyond single-omics approaches, which are critical for unravelling cancer heterogeneity and addressing the complex challenges within cancer genomics. The review highlights recent advances, current difficulties, and potential paths forward for these integrated AI approaches. We also detail the main elements of these models, such as their architectures, training plans, evaluation techniques, and data preprocessing. All things considered, multi-omics language models powered by AI hold immense promise for deriving biological insights from intricate cancer genomic data and converting them into actionable information for clinical settings.
期刊介绍:
Computational Biology and Chemistry publishes original research papers and review articles in all areas of computational life sciences. High quality research contributions with a major computational component in the areas of nucleic acid and protein sequence research, molecular evolution, molecular genetics (functional genomics and proteomics), theory and practice of either biology-specific or chemical-biology-specific modeling, and structural biology of nucleic acids and proteins are particularly welcome. Exceptionally high quality research work in bioinformatics, systems biology, ecology, computational pharmacology, metabolism, biomedical engineering, epidemiology, and statistical genetics will also be considered.
Given their inherent uncertainty, protein modeling and molecular docking studies should be thoroughly validated. In the absence of experimental results for validation, the use of molecular dynamics simulations along with detailed free energy calculations, for example, should be used as complementary techniques to support the major conclusions. Submissions of premature modeling exercises without additional biological insights will not be considered.
Review articles will generally be commissioned by the editors and should not be submitted to the journal without explicit invitation. However prospective authors are welcome to send a brief (one to three pages) synopsis, which will be evaluated by the editors.