Characterization of nuclear mitochondrial insertions in canine genome assemblies

Peter Z Schall, Jennifer R S Meadows, Fabian Ramos-Almodovar, Jeffrey M Kidd
{"title":"Characterization of nuclear mitochondrial insertions in canine genome assemblies","authors":"Peter Z Schall, Jennifer R S Meadows, Fabian Ramos-Almodovar, Jeffrey M Kidd","doi":"10.1101/2024.09.13.612826","DOIUrl":null,"url":null,"abstract":"Background: The presence of mitochondrial sequences in the nuclear genome (Numts) confounds analyses of mitochondrial sequence variation and is a potential source of false positives in disease studies. To improve the analysis of mitochondrial variation in canines, we completed a systematic assessment of Numt content across genome assemblies, canine populations and the carnivore lineage. Results: Centering our analysis on the UU_Cfam_GSD_1.0/canFam4/Mischka assembly, a commonly used reference in dog genetic variation studies, we find a total of 321 Numts, located throughout the nuclear genome and encompassing the entire sequence of the mitochondria. Comparison to 14 canine genome assemblies identified 63 Numts with presence-absence dimorphism among dogs, wolves, and a coyote. Further, a subset of Numts were maintained across carnivore evolutionary time (arctic fox, polar bear, cat), with 8 sequences likely more than 10 million years old, and shared with the domestic cat. On a population level, using structural variant data from the Dog10K Consortium for 1,879 dogs and wolves, we identified 11 Numts that are absent in at least one sample as well as 53 Numts that are absent from the Mischka assembly. Conclusions: We highlight scenarios where the presence of Numts is a potentially confounding factor and provide an annotation of these sequences in canine genome assemblies. This resource will aid the identification and interpretation of polymorphisms in both somatic and germline mitochondrial studies in canines.","PeriodicalId":501161,"journal":{"name":"bioRxiv - Genomics","volume":"29 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv - Genomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.09.13.612826","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Background: The presence of mitochondrial sequences in the nuclear genome (Numts) confounds analyses of mitochondrial sequence variation and is a potential source of false positives in disease studies. To improve the analysis of mitochondrial variation in canines, we completed a systematic assessment of Numt content across genome assemblies, canine populations and the carnivore lineage. Results: Centering our analysis on the UU_Cfam_GSD_1.0/canFam4/Mischka assembly, a commonly used reference in dog genetic variation studies, we find a total of 321 Numts, located throughout the nuclear genome and encompassing the entire sequence of the mitochondria. Comparison to 14 canine genome assemblies identified 63 Numts with presence-absence dimorphism among dogs, wolves, and a coyote. Further, a subset of Numts were maintained across carnivore evolutionary time (arctic fox, polar bear, cat), with 8 sequences likely more than 10 million years old, and shared with the domestic cat. On a population level, using structural variant data from the Dog10K Consortium for 1,879 dogs and wolves, we identified 11 Numts that are absent in at least one sample as well as 53 Numts that are absent from the Mischka assembly. Conclusions: We highlight scenarios where the presence of Numts is a potentially confounding factor and provide an annotation of these sequences in canine genome assemblies. This resource will aid the identification and interpretation of polymorphisms in both somatic and germline mitochondrial studies in canines.
犬基因组组装中的核线粒体插入特征
背景:核基因组中线粒体序列(Numts)的存在会干扰线粒体序列变异的分析,是疾病研究中假阳性的潜在来源。为了改进犬科动物线粒体变异的分析,我们完成了对基因组组装、犬科动物种群和食肉动物血统中 Numt 含量的系统评估。结果:UU_Cfam_GSD_1.0/canFam4/Mischka 是犬类遗传变异研究中常用的参考文献,我们以 UU_Cfam_GSD_1.0/canFam4/Mischka 基因组为中心进行了分析,发现共有 321 个 Numts 分布在整个核基因组中,并涵盖了线粒体的整个序列。通过与 14 个犬基因组序列进行比较,我们发现 63 个 Numts 在犬、狼和郊狼中具有存在-不存在二态性。此外,在食肉动物(北极狐、北极熊、猫)的进化过程中,Numts 的子集一直保持不变,其中 8 个序列的历史可能超过 1000 万年,并且与家猫共享。在种群水平上,利用 Dog10K 联盟提供的 1,879 只狗和狼的结构变异数据,我们发现了 11 个至少在一个样本中不存在的 Numts,以及 53 个在 Mischka 集合中不存在的 Numts。结论:我们强调了 Numts 的存在可能成为干扰因素的情况,并提供了犬基因组组装中这些序列的注释。这一资源将有助于鉴定和解释犬类体细胞和种系线粒体研究中的多态性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信