多基因组GWAS揭示了与人类LINE-1和Alu-拷贝数变异有关的基因位点

Juan I Bravo, Lucia Zhang, Bérénice Anath Benayoun
{"title":"多基因组GWAS揭示了与人类LINE-1和Alu-拷贝数变异有关的基因位点","authors":"Juan I Bravo, Lucia Zhang, Bérénice Anath Benayoun","doi":"10.1101/2024.09.10.612283","DOIUrl":null,"url":null,"abstract":"Long INterspersed Element-1 (LINE-1; L1) and Alu are two families of transposable elements (TEs) occupying ~17% and ~11% of the human genome, respectively. Though only a small fraction of L1 copies is able to produce the machinery to mobilize autonomously, Alu elements and degenerate L1 copies can hijack their functional machinery and mobilize in trans. The expression and subsequent copy number expansion of L1 and Alu can exert pathological effects on their hosts, promoting genome instability, inflammation, and cell cycle alterations. These features have made L1 and Alu promising focus subjects in studies of aging and aging diseases where they can become active. However, the mechanisms regulating variation in their expression and copy number remain incompletely characterized. Moreover, the relevance of known mechanisms to diverse human populations remains unclear, as mechanisms are often characterized in isogenic cell culture models. To address these gaps, we leveraged genomic data from the 1000 Genomes Project to carry out a trans-ethnic GWAS of L1 and Alu insertion global singletons. These singletons are rare insertions observed only once in a population, potentially reflecting recently acquired L1 and Alu integrants or structural variants, and which we used as proxies for L1/Alu-associated copy number variation. Our computational approach identified single nucleotide variants in genomic regions containing genes with potential and known TE regulatory properties, and it enriched for single nucleotide variants in regions containing known regulators of L1 expression. Moreover, we identified many reference TE copies and polymorphic structural variants that were associated with L1/Alu singletons, suggesting their potential contribution to TE copy number variation through transposition-dependent or transposition-independent mechanisms. Finally, a transcriptional analysis of lymphoblastoid cells highlighted potential cell cycle alterations in a subset of samples harboring L1/Alu singletons. Collectively, our results (i) suggest that known TE regulatory mechanisms may also play regulatory roles in diverse human populations, (ii) expand the list of genic and repetitive genomic loci implicated in TE copy number variation, and (iii) reinforce the links between TEs and disease.","PeriodicalId":501161,"journal":{"name":"bioRxiv - Genomics","volume":"3 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-ancestry GWAS reveals loci linked to human variation in LINE-1- and Alu-copy numbers\",\"authors\":\"Juan I Bravo, Lucia Zhang, Bérénice Anath Benayoun\",\"doi\":\"10.1101/2024.09.10.612283\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Long INterspersed Element-1 (LINE-1; L1) and Alu are two families of transposable elements (TEs) occupying ~17% and ~11% of the human genome, respectively. Though only a small fraction of L1 copies is able to produce the machinery to mobilize autonomously, Alu elements and degenerate L1 copies can hijack their functional machinery and mobilize in trans. The expression and subsequent copy number expansion of L1 and Alu can exert pathological effects on their hosts, promoting genome instability, inflammation, and cell cycle alterations. These features have made L1 and Alu promising focus subjects in studies of aging and aging diseases where they can become active. However, the mechanisms regulating variation in their expression and copy number remain incompletely characterized. Moreover, the relevance of known mechanisms to diverse human populations remains unclear, as mechanisms are often characterized in isogenic cell culture models. To address these gaps, we leveraged genomic data from the 1000 Genomes Project to carry out a trans-ethnic GWAS of L1 and Alu insertion global singletons. These singletons are rare insertions observed only once in a population, potentially reflecting recently acquired L1 and Alu integrants or structural variants, and which we used as proxies for L1/Alu-associated copy number variation. Our computational approach identified single nucleotide variants in genomic regions containing genes with potential and known TE regulatory properties, and it enriched for single nucleotide variants in regions containing known regulators of L1 expression. Moreover, we identified many reference TE copies and polymorphic structural variants that were associated with L1/Alu singletons, suggesting their potential contribution to TE copy number variation through transposition-dependent or transposition-independent mechanisms. Finally, a transcriptional analysis of lymphoblastoid cells highlighted potential cell cycle alterations in a subset of samples harboring L1/Alu singletons. Collectively, our results (i) suggest that known TE regulatory mechanisms may also play regulatory roles in diverse human populations, (ii) expand the list of genic and repetitive genomic loci implicated in TE copy number variation, and (iii) reinforce the links between TEs and disease.\",\"PeriodicalId\":501161,\"journal\":{\"name\":\"bioRxiv - Genomics\",\"volume\":\"3 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"bioRxiv - Genomics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1101/2024.09.10.612283\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv - Genomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.09.10.612283","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

长插入元件-1(LINE-1;L1)和Alu是两个转座元件(TE)家族,分别占人类基因组的17%和11%。虽然只有一小部分L1拷贝能够产生自主调动的机制,但Alu元件和退化的L1拷贝可以劫持其功能机制并进行反式调动。L1 和 Alu 的表达及随后的拷贝数扩增可对宿主产生病理影响,促进基因组不稳定、炎症和细胞周期改变。这些特点使 L1 和 Alu 成为研究衰老和衰老性疾病的有希望的重点对象,因为它们在这些疾病中会变得活跃。然而,调节它们的表达和拷贝数变化的机制仍未完全定性。此外,已知机制与不同人群的相关性仍不清楚,因为这些机制通常是在同源细胞培养模型中表征的。为了填补这些空白,我们利用 "千人基因组计划"(1000 Genomes Project)的基因组数据,对 L1 和 Alu 插入全局单倍子进行了跨种族 GWAS 分析。这些单体是在人群中只观察到一次的罕见插入,可能反映了最近获得的 L1 和 Alu 整合体或结构变异,我们将其用作 L1/Alu 相关拷贝数变异的替代物。我们的计算方法识别了基因组区域中的单核苷酸变异,这些区域包含具有潜在和已知 TE 调控特性的基因,并且富集了包含已知 L1 表达调控因子的区域中的单核苷酸变异。此外,我们还发现了许多与 L1/Alu 单体相关的参考 TE 拷贝和多态结构变异,这表明它们可能通过转座依赖或转座非依赖机制对 TE 拷贝数变异做出了贡献。最后,淋巴母细胞的转录分析突显了携带 L1/Alu 单倍子的部分样本中潜在的细胞周期改变。总之,我们的研究结果(i)表明已知的TE调控机制也可能在不同的人类群体中发挥调控作用,(ii)扩大了与TE拷贝数变异有关的基因和重复基因组位点的清单,(iii)加强了TE与疾病之间的联系。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Multi-ancestry GWAS reveals loci linked to human variation in LINE-1- and Alu-copy numbers
Long INterspersed Element-1 (LINE-1; L1) and Alu are two families of transposable elements (TEs) occupying ~17% and ~11% of the human genome, respectively. Though only a small fraction of L1 copies is able to produce the machinery to mobilize autonomously, Alu elements and degenerate L1 copies can hijack their functional machinery and mobilize in trans. The expression and subsequent copy number expansion of L1 and Alu can exert pathological effects on their hosts, promoting genome instability, inflammation, and cell cycle alterations. These features have made L1 and Alu promising focus subjects in studies of aging and aging diseases where they can become active. However, the mechanisms regulating variation in their expression and copy number remain incompletely characterized. Moreover, the relevance of known mechanisms to diverse human populations remains unclear, as mechanisms are often characterized in isogenic cell culture models. To address these gaps, we leveraged genomic data from the 1000 Genomes Project to carry out a trans-ethnic GWAS of L1 and Alu insertion global singletons. These singletons are rare insertions observed only once in a population, potentially reflecting recently acquired L1 and Alu integrants or structural variants, and which we used as proxies for L1/Alu-associated copy number variation. Our computational approach identified single nucleotide variants in genomic regions containing genes with potential and known TE regulatory properties, and it enriched for single nucleotide variants in regions containing known regulators of L1 expression. Moreover, we identified many reference TE copies and polymorphic structural variants that were associated with L1/Alu singletons, suggesting their potential contribution to TE copy number variation through transposition-dependent or transposition-independent mechanisms. Finally, a transcriptional analysis of lymphoblastoid cells highlighted potential cell cycle alterations in a subset of samples harboring L1/Alu singletons. Collectively, our results (i) suggest that known TE regulatory mechanisms may also play regulatory roles in diverse human populations, (ii) expand the list of genic and repetitive genomic loci implicated in TE copy number variation, and (iii) reinforce the links between TEs and disease.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信