The Utilization of Reference-Guided Assembly and In Silico Libraries Improves the Draft Genome of Clarias batrachus and Culter alburnus

IF 2.6 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY
Kai Liu, Nan Xie, Yuxi Wang, Xinyi Liu
{"title":"The Utilization of Reference-Guided Assembly and In Silico Libraries Improves the Draft Genome of Clarias batrachus and Culter alburnus","authors":"Kai Liu,&nbsp;Nan Xie,&nbsp;Yuxi Wang,&nbsp;Xinyi Liu","doi":"10.1007/s10126-023-10248-x","DOIUrl":null,"url":null,"abstract":"<div><p>Long-read sequencing technologies can generate highly contiguous genome assemblies compared to short-read methods. However, their higher cost often poses a significant barrier. To address this, we explore the utilization of mapping-based genome assembly and reference-guided assembly as cost-effective alternative approaches. We assess the efficacy of these approaches in improving the contiguity of <i>Clarias batrachus</i> and <i>Culter alburnus</i> draft genomes. Our findings demonstrate that employing an iterative mapping strategy leads to a reduction in assembly errors. Specifically, after three iterations, the Mismatches per 100 kbp value for the <i>C. batrachus</i> genome decreased from 2447.20 to 2432.67, reaching a minimum of 2422.67 after two iterations. Additionally, the N50 value for the <i>C. batrachus</i> genome increased from 362,143 to 1,315,126 bp, with a maximum of 1,315,403 bp after two iterations. Furthermore, we achieved Mismatches per 100 kbp values of 3.70 for the reference-guided assembly of <i>C. batrachus</i> and 0.34 for <i>C. alburnus</i>. Correspondingly, the N50 value for the <i>C. batrachus</i> and <i>C. alburnus</i> genomes increased from 362,143 bp and 3,686,385 bp to 2,026,888 bp and 43,735,735 bp, respectively. Finally, we successfully utilized the improved <i>C. batrachus</i> and <i>C. alburnus</i> genomes to compare genome studies using the combined approach of Ragout and Ragtag. Through a comprehensive comparative analysis of mapping-based and reference-guided genome assembly methods, we shed light on the specific contributions of reference-guided assembly in reducing assembly errors and improving assembly continuity and integrity. These advancements establish reference-guided assembly and the utilization of in silico libraries as a promising and suitable approach for comparative genomics studies.</p></div>","PeriodicalId":690,"journal":{"name":"Marine Biotechnology","volume":"25 6","pages":"907 - 917"},"PeriodicalIF":2.6000,"publicationDate":"2023-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Marine Biotechnology","FirstCategoryId":"99","ListUrlMain":"https://link.springer.com/article/10.1007/s10126-023-10248-x","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Long-read sequencing technologies can generate highly contiguous genome assemblies compared to short-read methods. However, their higher cost often poses a significant barrier. To address this, we explore the utilization of mapping-based genome assembly and reference-guided assembly as cost-effective alternative approaches. We assess the efficacy of these approaches in improving the contiguity of Clarias batrachus and Culter alburnus draft genomes. Our findings demonstrate that employing an iterative mapping strategy leads to a reduction in assembly errors. Specifically, after three iterations, the Mismatches per 100 kbp value for the C. batrachus genome decreased from 2447.20 to 2432.67, reaching a minimum of 2422.67 after two iterations. Additionally, the N50 value for the C. batrachus genome increased from 362,143 to 1,315,126 bp, with a maximum of 1,315,403 bp after two iterations. Furthermore, we achieved Mismatches per 100 kbp values of 3.70 for the reference-guided assembly of C. batrachus and 0.34 for C. alburnus. Correspondingly, the N50 value for the C. batrachus and C. alburnus genomes increased from 362,143 bp and 3,686,385 bp to 2,026,888 bp and 43,735,735 bp, respectively. Finally, we successfully utilized the improved C. batrachus and C. alburnus genomes to compare genome studies using the combined approach of Ragout and Ragtag. Through a comprehensive comparative analysis of mapping-based and reference-guided genome assembly methods, we shed light on the specific contributions of reference-guided assembly in reducing assembly errors and improving assembly continuity and integrity. These advancements establish reference-guided assembly and the utilization of in silico libraries as a promising and suitable approach for comparative genomics studies.

Abstract Image

利用参考文献指导的组装和硅库改进了蝠鲼和 Culter alburnus 的基因组草案。
与短线程方法相比,长线程测序技术可以生成高度连续的基因组集合。然而,其较高的成本往往构成重大障碍。为了解决这个问题,我们探索了基于制图的基因组组装和参考文献指导的组装,作为具有成本效益的替代方法。我们评估了这些方法在改善蝙蝠鲫(Clarias batrachus)和白鲑鱼(Culter alburnus)草图基因组连续性方面的功效。我们的研究结果表明,采用迭代映射策略可减少装配错误。具体来说,经过三次迭代后,蝠鲼基因组每 100 kbp 的错配值从 2447.20 降至 2432.67,两次迭代后达到 2422.67 的最低值。此外,蝙蝠鱼基因组的 N50 值从 362,143 bp 增加到 1,315,126 bp,两次迭代后达到最大值 1,315,403 bp。此外,在参考指导下进行的蝙蝠科动物基因组组装中,每 100 kbp 的错配值为 3.70,而在蝙蝠科动物基因组组装中,每 100 kbp 的错配值为 0.34。相应地,蝙蝠鲈和白鲑鱼基因组的 N50 值分别从 362,143 bp 和 3,686,385 bp 增加到 2,026,888 bp 和 43,735,735 bp。最后,我们成功地利用改良后的蝙蝠鲈和白鲑鱼基因组,采用 Ragout 和 Ragtag 的组合方法对基因组进行了比较研究。通过对基于绘图的基因组组装方法和参考文献指导的基因组组装方法进行全面的比较分析,我们阐明了参考文献指导的组装方法在减少组装错误、提高组装连续性和完整性方面的具体贡献。这些进展确立了以参考文献为指导的组装和利用硅库是比较基因组学研究的一种有前途的合适方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Marine Biotechnology
Marine Biotechnology 工程技术-海洋与淡水生物学
CiteScore
4.80
自引率
3.30%
发文量
95
审稿时长
2 months
期刊介绍: Marine Biotechnology welcomes high-quality research papers presenting novel data on the biotechnology of aquatic organisms. The journal publishes high quality papers in the areas of molecular biology, genomics, proteomics, cell biology, and biochemistry, and particularly encourages submissions of papers related to genome biology such as linkage mapping, large-scale gene discoveries, QTL analysis, physical mapping, and comparative and functional genome analysis. Papers on technological development and marine natural products should demonstrate innovation and novel applications.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信