Muhammad Tahir, Mahboobeh Norouzi, Shehroz S Khan, James R Davie, Soichiro Yamanaka, Ahmed Ashraf
{"title":"Artificial intelligence and deep learning algorithms for epigenetic sequence analysis: A review for epigeneticists and AI experts.","authors":"Muhammad Tahir, Mahboobeh Norouzi, Shehroz S Khan, James R Davie, Soichiro Yamanaka, Ahmed Ashraf","doi":"10.1016/j.compbiomed.2024.109302","DOIUrl":null,"url":null,"abstract":"<p><p>Epigenetics encompasses mechanisms that can alter the expression of genes without changing the underlying genetic sequence. The epigenetic regulation of gene expression is initiated and sustained by several mechanisms such as DNA methylation, histone modifications, chromatin conformation, and non-coding RNA. The changes in gene regulation and expression can manifest in the form of various diseases and disorders such as cancer and congenital deformities. Over the last few decades, high-throughput experimental approaches have been used to identify and understand epigenetic changes, but these laboratory experimental approaches and biochemical processes are time-consuming and expensive. To overcome these challenges, machine learning and artificial intelligence (AI) approaches have been extensively used for mapping epigenetic modifications to their phenotypic manifestations. In this paper we provide a narrative review of published research on AI models trained on epigenomic data to address a variety of problems such as prediction of disease markers, gene expression, enhancer-promoter interaction, and chromatin states. The purpose of this review is twofold as it is addressed to both AI experts and epigeneticists. For AI researchers, we provided a taxonomy of epigenetics research problems that can benefit from an AI-based approach. For epigeneticists, given each of the above problems we provide a list of candidate AI solutions in the literature. We have also identified several gaps in the literature, research challenges, and recommendations to address these challenges.</p>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"183 ","pages":"109302"},"PeriodicalIF":7.0000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1016/j.compbiomed.2024.109302","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/4 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Epigenetics encompasses mechanisms that can alter the expression of genes without changing the underlying genetic sequence. The epigenetic regulation of gene expression is initiated and sustained by several mechanisms such as DNA methylation, histone modifications, chromatin conformation, and non-coding RNA. The changes in gene regulation and expression can manifest in the form of various diseases and disorders such as cancer and congenital deformities. Over the last few decades, high-throughput experimental approaches have been used to identify and understand epigenetic changes, but these laboratory experimental approaches and biochemical processes are time-consuming and expensive. To overcome these challenges, machine learning and artificial intelligence (AI) approaches have been extensively used for mapping epigenetic modifications to their phenotypic manifestations. In this paper we provide a narrative review of published research on AI models trained on epigenomic data to address a variety of problems such as prediction of disease markers, gene expression, enhancer-promoter interaction, and chromatin states. The purpose of this review is twofold as it is addressed to both AI experts and epigeneticists. For AI researchers, we provided a taxonomy of epigenetics research problems that can benefit from an AI-based approach. For epigeneticists, given each of the above problems we provide a list of candidate AI solutions in the literature. We have also identified several gaps in the literature, research challenges, and recommendations to address these challenges.
表观遗传学包括在不改变基本基因序列的情况下改变基因表达的机制。基因表达的表观遗传调控是由 DNA 甲基化、组蛋白修饰、染色质构象和非编码 RNA 等几种机制启动和维持的。基因调控和表达的变化可表现为各种疾病和失调,如癌症和先天性畸形。在过去的几十年里,高通量实验方法已被用于识别和了解表观遗传变化,但这些实验室实验方法和生化过程耗时且昂贵。为了克服这些挑战,机器学习和人工智能(AI)方法已被广泛用于映射表观遗传修饰及其表型表现。在本文中,我们对已发表的有关人工智能模型的研究进行了叙述性综述,这些模型是在表观基因组数据的基础上训练而成的,用于解决疾病标志物预测、基因表达、增强子-启动子相互作用和染色质状态等各种问题。本综述具有双重目的,既面向人工智能专家,也面向表观遗传学家。对于人工智能研究人员,我们提供了可从基于人工智能的方法中获益的表观遗传学研究问题分类法。对于表观遗传学家,我们针对上述每个问题提供了一份候选人工智能解决方案的文献列表。我们还指出了文献中的几个空白点、研究挑战以及应对这些挑战的建议。
期刊介绍:
Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.