J. Antonio Baeza, Jeremiah J. Minish, Todd P. Michael
{"title":"利用纳米孔长读取技术组装三种海鲢鱼(Teleostei: Kyphosidae)的线粒体基因组。","authors":"J. Antonio Baeza, Jeremiah J. Minish, Todd P. Michael","doi":"10.1111/1755-0998.14034","DOIUrl":null,"url":null,"abstract":"<p>Complete mitochondrial genomes have become markers of choice to explore phylogenetic relationships at multiple taxonomic levels and they are often assembled using whole genome short-read sequencing. Herein, using three species of sea chubs as an example, we explored the accuracy of mitochondrial chromosomes assembled using Oxford Nanopore Technology (ONT) Kit 14 R10.4.1 long reads at different sequencing depths (high, low and very low or genome skimming) by comparing them to ‘gold’ standard reference mitochondrial genomes assembled using Illumina NovaSeq short reads. In two species of sea chubs, <i>Girella nigricans</i> and <i>Kyphosus azureus</i>, ONT long-read assembled mitochondrial genomes at high sequencing depths (> 25× whole [nuclear] genome) were identical to their respective short-read assembled mitochondrial genomes. Not a single ‘homopolymer insertion’, ‘homopolymer deletion’, ‘simple substitution’, ‘single insertion’, ‘short insertion’, ‘single deletion’ or ‘short deletion’ were detected in the long-read assembled mitochondrial genomes after aligning each one of them to their short-read counterparts. In turn, in a third species, <i>Medialuna californiensis</i>, a 25× sequencing depth long-read assembled mitochondrial genome was 14 nucleotides longer than its short-read counterpart. The difference in total length between the latter two assemblies was due to the presence of a short motif 14 bp long that was repeated (twice) in the long read but not in the short-read assembly. Read subsampling at a sequencing depth of 1× resulted in the assembly of partial or complete mitochondrial genomes with numerous errors, including, among others, simple indels, and indels at homopolymer regions. At 3× and 5× subsampling, genomes were identical (perfect) or almost identical (quasiperfect, 99.5% over 16,500 bp) to their respective Illumina assemblies. The newly assembled mitochondrial genomes exhibit identical gene composition and organisation compared with cofamilial species and a phylomitogenomic analysis based on translated protein-coding genes suggested that the family Kyphosidae is not monophyletic. The same analysis detected possible cases of misidentification of mitochondrial genomes deposited in GenBank. This study demonstrates that perfect (complete and fully accurate) or quasiperfect (complete but with a single or a very few errors) mitochondrial genomes can be assembled at high (> 25×) and low (3–5×) but not very low (1×, genome skimming) sequencing depths using ONT long reads and the latest ONT chemistries (Kit 14 and R10.4.1 flowcells with SUP basecalling). The newly assembled and annotated mitochondrial genomes can be used as a reference in environmental DNA studies focusing on bioprospecting and biomonitoring of these and other coastal species experiencing environmental insult. Given the small size of the sequencing device and low cost, we argue that ONT technology has the potential to improve access to high-throughput sequencing technologies in low- and moderate-income countries.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":"25 1","pages":""},"PeriodicalIF":5.5000,"publicationDate":"2024-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/1755-0998.14034","citationCount":"0","resultStr":"{\"title\":\"Assembly of Mitochondrial Genomes Using Nanopore Long-Read Technology in Three Sea Chubs (Teleostei: Kyphosidae)\",\"authors\":\"J. Antonio Baeza, Jeremiah J. Minish, Todd P. Michael\",\"doi\":\"10.1111/1755-0998.14034\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Complete mitochondrial genomes have become markers of choice to explore phylogenetic relationships at multiple taxonomic levels and they are often assembled using whole genome short-read sequencing. Herein, using three species of sea chubs as an example, we explored the accuracy of mitochondrial chromosomes assembled using Oxford Nanopore Technology (ONT) Kit 14 R10.4.1 long reads at different sequencing depths (high, low and very low or genome skimming) by comparing them to ‘gold’ standard reference mitochondrial genomes assembled using Illumina NovaSeq short reads. In two species of sea chubs, <i>Girella nigricans</i> and <i>Kyphosus azureus</i>, ONT long-read assembled mitochondrial genomes at high sequencing depths (> 25× whole [nuclear] genome) were identical to their respective short-read assembled mitochondrial genomes. Not a single ‘homopolymer insertion’, ‘homopolymer deletion’, ‘simple substitution’, ‘single insertion’, ‘short insertion’, ‘single deletion’ or ‘short deletion’ were detected in the long-read assembled mitochondrial genomes after aligning each one of them to their short-read counterparts. In turn, in a third species, <i>Medialuna californiensis</i>, a 25× sequencing depth long-read assembled mitochondrial genome was 14 nucleotides longer than its short-read counterpart. The difference in total length between the latter two assemblies was due to the presence of a short motif 14 bp long that was repeated (twice) in the long read but not in the short-read assembly. Read subsampling at a sequencing depth of 1× resulted in the assembly of partial or complete mitochondrial genomes with numerous errors, including, among others, simple indels, and indels at homopolymer regions. At 3× and 5× subsampling, genomes were identical (perfect) or almost identical (quasiperfect, 99.5% over 16,500 bp) to their respective Illumina assemblies. The newly assembled mitochondrial genomes exhibit identical gene composition and organisation compared with cofamilial species and a phylomitogenomic analysis based on translated protein-coding genes suggested that the family Kyphosidae is not monophyletic. The same analysis detected possible cases of misidentification of mitochondrial genomes deposited in GenBank. This study demonstrates that perfect (complete and fully accurate) or quasiperfect (complete but with a single or a very few errors) mitochondrial genomes can be assembled at high (> 25×) and low (3–5×) but not very low (1×, genome skimming) sequencing depths using ONT long reads and the latest ONT chemistries (Kit 14 and R10.4.1 flowcells with SUP basecalling). The newly assembled and annotated mitochondrial genomes can be used as a reference in environmental DNA studies focusing on bioprospecting and biomonitoring of these and other coastal species experiencing environmental insult. Given the small size of the sequencing device and low cost, we argue that ONT technology has the potential to improve access to high-throughput sequencing technologies in low- and moderate-income countries.</p>\",\"PeriodicalId\":211,\"journal\":{\"name\":\"Molecular Ecology Resources\",\"volume\":\"25 1\",\"pages\":\"\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2024-10-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/1755-0998.14034\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Molecular Ecology Resources\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/1755-0998.14034\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Ecology Resources","FirstCategoryId":"99","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/1755-0998.14034","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
完整的线粒体基因组已成为探索多分类水平系统发育关系的首选标记,通常使用全基因组短读测序法组装线粒体基因组。在此,我们以三个物种的海鲦为例,通过与使用 Illumina NovaSeq 短读取组装的 "黄金 "标准参考线粒体基因组进行比较,探讨了使用牛津纳米孔技术(ONT)14 R10.4.1 套件在不同测序深度(高、低和极低或基因组撇取)下组装的线粒体染色体的准确性。在两种海鲦(Girella nigricans 和 Kyphosus azureus)中,高测序深度(> 25× 全[核]基因组)下 ONT 长读数组装的线粒体基因组与各自短读数组装的线粒体基因组完全相同。将长线程组装的线粒体基因组与短线程组装的线粒体基因组进行比对后,没有发现任何 "同源多聚物插入"、"同源多聚物缺失"、"简单替换"、"单插入"、"短插入"、"单缺失 "或 "短缺失"。而在第三个物种--加州麦地那龙(Medialuna californiensis)中,25 倍测序深度的长线粒体基因组比短线粒体基因组长 14 个核苷酸。后两种装配的总长度之所以不同,是因为存在一个 14 bp 长的短图案,该图案在长读数中重复(两次),但在短读数装配中没有。在测序深度为 1× 的情况下,读数子取样会导致部分或完整线粒体基因组的组装出现大量错误,其中包括简单嵌合和同源多聚物区域的嵌合。在 3 倍和 5 倍子取样时,基因组与各自的 Illumina 组装结果完全相同(完美)或几乎完全相同(准完美,在 16,500 bp 上达到 99.5%)。与同族物种相比,新组装的线粒体基因组显示出相同的基因组成和组织结构,而基于翻译蛋白编码基因的系统发生组分析表明,Kyphosidae科并非单系。同样的分析还发现了存放在 GenBank 中的线粒体基因组可能存在识别错误的情况。这项研究表明,使用 ONT 长读数和最新的 ONT 化学试剂(Kit 14 和 R10.4.1 flowcells,带 SUP basecalling),可以在高测序深度(> 25×)和低测序深度(3-5×)但不是极低测序深度(1×,基因组略读)下组装出完美(完整且完全准确)或准完美(完整但只有一个或极少数错误)的线粒体基因组。新组装和注释的线粒体基因组可作为环境 DNA 研究的参考,重点是这些物种和其他遭受环境污染的沿海物种的生物勘探和生物监测。鉴于测序装置体积小、成本低,我们认为 ONT 技术有可能改善中低收入国家对高通量测序技术的利用。
Assembly of Mitochondrial Genomes Using Nanopore Long-Read Technology in Three Sea Chubs (Teleostei: Kyphosidae)
Complete mitochondrial genomes have become markers of choice to explore phylogenetic relationships at multiple taxonomic levels and they are often assembled using whole genome short-read sequencing. Herein, using three species of sea chubs as an example, we explored the accuracy of mitochondrial chromosomes assembled using Oxford Nanopore Technology (ONT) Kit 14 R10.4.1 long reads at different sequencing depths (high, low and very low or genome skimming) by comparing them to ‘gold’ standard reference mitochondrial genomes assembled using Illumina NovaSeq short reads. In two species of sea chubs, Girella nigricans and Kyphosus azureus, ONT long-read assembled mitochondrial genomes at high sequencing depths (> 25× whole [nuclear] genome) were identical to their respective short-read assembled mitochondrial genomes. Not a single ‘homopolymer insertion’, ‘homopolymer deletion’, ‘simple substitution’, ‘single insertion’, ‘short insertion’, ‘single deletion’ or ‘short deletion’ were detected in the long-read assembled mitochondrial genomes after aligning each one of them to their short-read counterparts. In turn, in a third species, Medialuna californiensis, a 25× sequencing depth long-read assembled mitochondrial genome was 14 nucleotides longer than its short-read counterpart. The difference in total length between the latter two assemblies was due to the presence of a short motif 14 bp long that was repeated (twice) in the long read but not in the short-read assembly. Read subsampling at a sequencing depth of 1× resulted in the assembly of partial or complete mitochondrial genomes with numerous errors, including, among others, simple indels, and indels at homopolymer regions. At 3× and 5× subsampling, genomes were identical (perfect) or almost identical (quasiperfect, 99.5% over 16,500 bp) to their respective Illumina assemblies. The newly assembled mitochondrial genomes exhibit identical gene composition and organisation compared with cofamilial species and a phylomitogenomic analysis based on translated protein-coding genes suggested that the family Kyphosidae is not monophyletic. The same analysis detected possible cases of misidentification of mitochondrial genomes deposited in GenBank. This study demonstrates that perfect (complete and fully accurate) or quasiperfect (complete but with a single or a very few errors) mitochondrial genomes can be assembled at high (> 25×) and low (3–5×) but not very low (1×, genome skimming) sequencing depths using ONT long reads and the latest ONT chemistries (Kit 14 and R10.4.1 flowcells with SUP basecalling). The newly assembled and annotated mitochondrial genomes can be used as a reference in environmental DNA studies focusing on bioprospecting and biomonitoring of these and other coastal species experiencing environmental insult. Given the small size of the sequencing device and low cost, we argue that ONT technology has the potential to improve access to high-throughput sequencing technologies in low- and moderate-income countries.
期刊介绍:
Molecular Ecology Resources promotes the creation of comprehensive resources for the scientific community, encompassing computer programs, statistical and molecular advancements, and a diverse array of molecular tools. Serving as a conduit for disseminating these resources, the journal targets a broad audience of researchers in the fields of evolution, ecology, and conservation. Articles in Molecular Ecology Resources are crafted to support investigations tackling significant questions within these disciplines.
In addition to original resource articles, Molecular Ecology Resources features Reviews, Opinions, and Comments relevant to the field. The journal also periodically releases Special Issues focusing on resource development within specific areas.