Nearly complete genome assembly of a critically endangered pine illuminates evolution and conservation of conifers

Ren-Gang Zhang, Hui Liu, Heng Shu, De-Tuan Liu, Hong-Yun Shang, Kai-Hua Jia, Xiao-Quan Wang, Wei-Bang Sun, Wei Zhao, Yong-Peng Ma
{"title":"Nearly complete genome assembly of a critically endangered pine illuminates evolution and conservation of conifers","authors":"Ren-Gang Zhang, Hui Liu, Heng Shu, De-Tuan Liu, Hong-Yun Shang, Kai-Hua Jia, Xiao-Quan Wang, Wei-Bang Sun, Wei Zhao, Yong-Peng Ma","doi":"10.1101/2024.08.07.607108","DOIUrl":null,"url":null,"abstract":"Conifers are dominant in most temperate and boreal forest ecosystems, and are the most widely distributed of the gymnosperms. Despite this, many conifer species are threatened with extinction, and in particular the genetic mechanisms underlying their endangerment remain largely unknown. Pinus squamata, which harbors an extremely large diploid genome and conservation significance, is among the 100 most endangered species (plants and animals) globally, and has been designated as 'Critically Endangered' on the IUCN Red List. In this study, we report an almost complete genome sequence for P. squamata generated by a suite of sequencing technologies, with an assembly of 29.2 Gb, a scaffold N50 length of 2.5 Gb, and a remarkable contig N50 length of 915.4 Mb. This represents the largest and most high-quality gymnosperm genome sequenced to date. The genome is characterized by an ultra-low rate of heterozygosity, is dominated by transposable elements, and contains 55,413 protein-coding genes. Our study provides the first detailed examination of chromosome organization in P. squamata, revealing Rabl configurations and distinctive centromere signatures. This genomic milestone not only deepens our understanding of gymnosperm genetics and evolution but also lays a solid foundation for the development of effective conservation measures, ensuring the survival of this rare species in the face of environmental challenges.","PeriodicalId":501161,"journal":{"name":"bioRxiv - Genomics","volume":"2 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv - Genomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.08.07.607108","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Conifers are dominant in most temperate and boreal forest ecosystems, and are the most widely distributed of the gymnosperms. Despite this, many conifer species are threatened with extinction, and in particular the genetic mechanisms underlying their endangerment remain largely unknown. Pinus squamata, which harbors an extremely large diploid genome and conservation significance, is among the 100 most endangered species (plants and animals) globally, and has been designated as 'Critically Endangered' on the IUCN Red List. In this study, we report an almost complete genome sequence for P. squamata generated by a suite of sequencing technologies, with an assembly of 29.2 Gb, a scaffold N50 length of 2.5 Gb, and a remarkable contig N50 length of 915.4 Mb. This represents the largest and most high-quality gymnosperm genome sequenced to date. The genome is characterized by an ultra-low rate of heterozygosity, is dominated by transposable elements, and contains 55,413 protein-coding genes. Our study provides the first detailed examination of chromosome organization in P. squamata, revealing Rabl configurations and distinctive centromere signatures. This genomic milestone not only deepens our understanding of gymnosperm genetics and evolution but also lays a solid foundation for the development of effective conservation measures, ensuring the survival of this rare species in the face of environmental challenges.
一种极度濒危松树的近乎完整的基因组组装揭示了针叶树的进化与保护
针叶树在大多数温带和北方森林生态系统中占主导地位,是分布最广的裸子植物。尽管如此,许多针叶树物种仍濒临灭绝,尤其是它们濒临灭绝的遗传机制在很大程度上仍不为人所知。松柏(Pinus squamata)拥有极其庞大的二倍体基因组,具有重要的保护意义,是全球 100 个最濒危物种(植物和动物)之一,在世界自然保护联盟(IUCN)红色名录上被列为 "极度濒危 "物种。在这项研究中,我们报告了通过一系列测序技术生成的鳞蛙几乎完整的基因组序列,其组装长度为 29.2 Gb,支架 N50 长度为 2.5 Gb,显著的等位基因 N50 长度为 915.4 Mb。这是迄今为止测序的最大和最高质量的裸子植物基因组。该基因组的特点是杂合率极低,以转座元件为主,包含 55,413 个蛋白质编码基因。我们的研究首次详细考察了鳞翅果的染色体组织,揭示了 Rabl 配置和独特的中心粒特征。这一基因组研究的里程碑不仅加深了我们对裸子植物遗传学和进化的理解,而且为制定有效的保护措施奠定了坚实的基础,确保这一稀有物种在面临环境挑战时能够生存下去。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信