基于序列的机器学习揭示了倭黑猩猩和黑猩猩基因组的三维差异。

IF 3.2 2区 生物学 Q2 EVOLUTIONARY BIOLOGY
Colin M Brand, Shuzhen Kuang, Erin N Gilbertson, Evonne McArthur, Katherine S Pollard, Timothy H Webster, John A Capra
{"title":"基于序列的机器学习揭示了倭黑猩猩和黑猩猩基因组的三维差异。","authors":"Colin M Brand, Shuzhen Kuang, Erin N Gilbertson, Evonne McArthur, Katherine S Pollard, Timothy H Webster, John A Capra","doi":"10.1093/gbe/evae210","DOIUrl":null,"url":null,"abstract":"<p><p>The 3D structure of the genome is an important mediator of gene expression. As phenotypic divergence is largely driven by gene regulatory variation, comparing genome 3D contacts across species can further understanding of the molecular basis of species differences. However, while experimental data on genome 3D contacts in humans are increasingly abundant, only a handful of 3D genome contact maps exist for other species. Here, we demonstrate that human experimental data can be used to close this data gap. We apply a machine learning model that predicts 3D genome contacts from DNA sequence to the genomes from 56 bonobos and chimpanzees and identify species-specific patterns of genome folding. We estimated 3D divergence between individuals from the resulting contact maps in 4,420 1 Mb genomic windows, of which ∼17% were substantially divergent in predicted genome contacts. Bonobos and chimpanzees diverged at 89 windows, overlapping genes associated with multiple traits implicated in Pan phenotypic divergence. We discovered 51 bonobo-specific variants that individually produce the observed bonobo contact pattern in bonobo-chimpanzee divergent windows. Our results demonstrate that machine learning methods can leverage human data to fill in data gaps across species, offering the first look at population-level 3D genome variation in nonhuman primates. We also identify loci where changes in 3D folding may contribute to phenotypic differences in our closest living relatives.</p>","PeriodicalId":12779,"journal":{"name":"Genome Biology and Evolution","volume":" ","pages":""},"PeriodicalIF":3.2000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11579661/pdf/","citationCount":"0","resultStr":"{\"title\":\"Sequence-Based Machine Learning Reveals 3D Genome Differences between Bonobos and Chimpanzees.\",\"authors\":\"Colin M Brand, Shuzhen Kuang, Erin N Gilbertson, Evonne McArthur, Katherine S Pollard, Timothy H Webster, John A Capra\",\"doi\":\"10.1093/gbe/evae210\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The 3D structure of the genome is an important mediator of gene expression. As phenotypic divergence is largely driven by gene regulatory variation, comparing genome 3D contacts across species can further understanding of the molecular basis of species differences. However, while experimental data on genome 3D contacts in humans are increasingly abundant, only a handful of 3D genome contact maps exist for other species. Here, we demonstrate that human experimental data can be used to close this data gap. We apply a machine learning model that predicts 3D genome contacts from DNA sequence to the genomes from 56 bonobos and chimpanzees and identify species-specific patterns of genome folding. We estimated 3D divergence between individuals from the resulting contact maps in 4,420 1 Mb genomic windows, of which ∼17% were substantially divergent in predicted genome contacts. Bonobos and chimpanzees diverged at 89 windows, overlapping genes associated with multiple traits implicated in Pan phenotypic divergence. We discovered 51 bonobo-specific variants that individually produce the observed bonobo contact pattern in bonobo-chimpanzee divergent windows. Our results demonstrate that machine learning methods can leverage human data to fill in data gaps across species, offering the first look at population-level 3D genome variation in nonhuman primates. We also identify loci where changes in 3D folding may contribute to phenotypic differences in our closest living relatives.</p>\",\"PeriodicalId\":12779,\"journal\":{\"name\":\"Genome Biology and Evolution\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2024-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11579661/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Genome Biology and Evolution\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1093/gbe/evae210\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"EVOLUTIONARY BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genome Biology and Evolution","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/gbe/evae210","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"EVOLUTIONARY BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

基因组的三维结构是基因表达的重要媒介。由于表型差异主要是由基因调控变异驱动的,因此比较不同物种的基因组三维接触可以进一步了解物种差异的分子基础。然而,虽然人类基因组三维接触的实验数据越来越丰富,但其他物种的基因组三维接触图却寥寥无几。在这里,我们证明人类实验数据可用于填补这一数据空白。我们在 56 只倭黑猩猩和黑猩猩的基因组中应用了一种机器学习模型,该模型可通过 DNA 序列预测基因组的三维接触,并识别出基因组折叠的物种特异性模式。我们在 4,420 个 1 Mb 的基因组窗口中,根据所得到的接触图估计了个体间的三维差异,其中有 17% 的个体在预测的基因组接触中存在实质性差异。倭黑猩猩和黑猩猩在89个窗口存在差异,与泛型表型差异相关的多个性状的基因重叠。我们发现了 51 个倭黑猩猩特异变体,这些变体在倭黑猩猩与黑猩猩的分歧窗口中单独产生了观察到的倭黑猩猩接触模式。我们的研究结果表明,机器学习方法可以利用人类数据来填补物种间的数据空白,首次揭示了非人灵长类动物种群水平的三维基因组变异。我们还确定了三维折叠变化可能导致我们近亲的表型差异的位点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Sequence-Based Machine Learning Reveals 3D Genome Differences between Bonobos and Chimpanzees.

The 3D structure of the genome is an important mediator of gene expression. As phenotypic divergence is largely driven by gene regulatory variation, comparing genome 3D contacts across species can further understanding of the molecular basis of species differences. However, while experimental data on genome 3D contacts in humans are increasingly abundant, only a handful of 3D genome contact maps exist for other species. Here, we demonstrate that human experimental data can be used to close this data gap. We apply a machine learning model that predicts 3D genome contacts from DNA sequence to the genomes from 56 bonobos and chimpanzees and identify species-specific patterns of genome folding. We estimated 3D divergence between individuals from the resulting contact maps in 4,420 1 Mb genomic windows, of which ∼17% were substantially divergent in predicted genome contacts. Bonobos and chimpanzees diverged at 89 windows, overlapping genes associated with multiple traits implicated in Pan phenotypic divergence. We discovered 51 bonobo-specific variants that individually produce the observed bonobo contact pattern in bonobo-chimpanzee divergent windows. Our results demonstrate that machine learning methods can leverage human data to fill in data gaps across species, offering the first look at population-level 3D genome variation in nonhuman primates. We also identify loci where changes in 3D folding may contribute to phenotypic differences in our closest living relatives.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Genome Biology and Evolution
Genome Biology and Evolution EVOLUTIONARY BIOLOGY-GENETICS & HEREDITY
CiteScore
5.80
自引率
6.10%
发文量
169
审稿时长
1 months
期刊介绍: About the journal Genome Biology and Evolution (GBE) publishes leading original research at the interface between evolutionary biology and genomics. Papers considered for publication report novel evolutionary findings that concern natural genome diversity, population genomics, the structure, function, organisation and expression of genomes, comparative genomics, proteomics, and environmental genomic interactions. Major evolutionary insights from the fields of computational biology, structural biology, developmental biology, and cell biology are also considered, as are theoretical advances in the field of genome evolution. GBE’s scope embraces genome-wide evolutionary investigations at all taxonomic levels and for all forms of life — within populations or across domains. Its aims are to further the understanding of genomes in their evolutionary context and further the understanding of evolution from a genome-wide perspective.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信