Zheng-Feng Wang, En-Ping Yu, Lin Fu, Hua-Ge Deng, Wei-Guang Zhu, Feng-Xia Xu, Hong-Lin Cao
{"title":"三种红藓的染色体尺度组合:重复序列分布和结构重排。","authors":"Zheng-Feng Wang, En-Ping Yu, Lin Fu, Hua-Ge Deng, Wei-Guang Zhu, Feng-Xia Xu, Hong-Lin Cao","doi":"10.1093/gigascience/giaf047","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The genus Ormosia belongs to the Fabaceae family; almost all Ormosia species are endemic to China, which is considered one of the centers of this genus. Thus, genomic studies on the genus are needed to better understand species evolution and ensure the conservation and utilization of these species. We performed a chromosome-scale assembly of O. purpureiflora and updated the chromosome-scale assemblies of O. emarginata and O. semicastrata for comparative genomics.</p><p><strong>Findings: </strong>The genome assembly sizes of the 3 species ranged from 1.42 to 1.58 Gb, with O. purpureiflora being the largest. Repetitive sequences accounted for 74.0-76.3% of the genomes, and the predicted gene counts ranged from 50,517 to 55,061. Benchmarking Universal Single-Copy Orthologs (BUSCO) analysis indicated 97.0-98.4% genome completeness, whereas the long terminal repeat (LTR) assembly index values ranged from 13.66 to 17.56, meeting the \"reference genome\" quality standard. Gene completeness, assessed using BUSCO and OMArk, ranged from 95.1% to 96.3% and from 97.1% to 98.1%, respectively.Characterizing genome architectures further revealed that inversions were the main structural rearrangements in Ormosia. In numbers, density distributions of repetitive elements revealed the types of Helitron and terminal inverted repeat (TIR) elements and the types of Gypsy and unknown LTR retrotransposons (LTR-RTs) concentrated in different regions on the chromosomes, whereas Copia LTR-RTs were generally evenly distributed along the chromosomes in Ormosia.Compared with the sister species Lupinus albus, Ormosia species had lower numbers and percentages of resistance (R) genes and transcription factor genes. Genes related to alkaloid, terpene, and flavonoid biosynthesis were found to be duplicated through tandem or proximal duplications. Notably, some genes associated with growth and defense were absent in O. purpureiflora.By resequencing 153 genotypes (∼30 Gb of data per sample) from 6 O. purpureiflora (sub)populations, we identified 40,146 single nucleotide polymorphisms. Corresponding to its very small populations, O. purpureiflora exhibited low genetic diversity.</p><p><strong>Conclusions: </strong>The Ormosia genome assemblies provide valuable resources for studying the evolution, conservation, and potential utility of both Ormosia and Fabaceae species.</p>","PeriodicalId":12581,"journal":{"name":"GigaScience","volume":"14 ","pages":""},"PeriodicalIF":11.8000,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12083454/pdf/","citationCount":"0","resultStr":"{\"title\":\"Chromosome-scale assemblies of three Ormosia species: repetitive sequences distribution and structural rearrangement.\",\"authors\":\"Zheng-Feng Wang, En-Ping Yu, Lin Fu, Hua-Ge Deng, Wei-Guang Zhu, Feng-Xia Xu, Hong-Lin Cao\",\"doi\":\"10.1093/gigascience/giaf047\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The genus Ormosia belongs to the Fabaceae family; almost all Ormosia species are endemic to China, which is considered one of the centers of this genus. Thus, genomic studies on the genus are needed to better understand species evolution and ensure the conservation and utilization of these species. We performed a chromosome-scale assembly of O. purpureiflora and updated the chromosome-scale assemblies of O. emarginata and O. semicastrata for comparative genomics.</p><p><strong>Findings: </strong>The genome assembly sizes of the 3 species ranged from 1.42 to 1.58 Gb, with O. purpureiflora being the largest. Repetitive sequences accounted for 74.0-76.3% of the genomes, and the predicted gene counts ranged from 50,517 to 55,061. Benchmarking Universal Single-Copy Orthologs (BUSCO) analysis indicated 97.0-98.4% genome completeness, whereas the long terminal repeat (LTR) assembly index values ranged from 13.66 to 17.56, meeting the \\\"reference genome\\\" quality standard. Gene completeness, assessed using BUSCO and OMArk, ranged from 95.1% to 96.3% and from 97.1% to 98.1%, respectively.Characterizing genome architectures further revealed that inversions were the main structural rearrangements in Ormosia. In numbers, density distributions of repetitive elements revealed the types of Helitron and terminal inverted repeat (TIR) elements and the types of Gypsy and unknown LTR retrotransposons (LTR-RTs) concentrated in different regions on the chromosomes, whereas Copia LTR-RTs were generally evenly distributed along the chromosomes in Ormosia.Compared with the sister species Lupinus albus, Ormosia species had lower numbers and percentages of resistance (R) genes and transcription factor genes. Genes related to alkaloid, terpene, and flavonoid biosynthesis were found to be duplicated through tandem or proximal duplications. Notably, some genes associated with growth and defense were absent in O. purpureiflora.By resequencing 153 genotypes (∼30 Gb of data per sample) from 6 O. purpureiflora (sub)populations, we identified 40,146 single nucleotide polymorphisms. Corresponding to its very small populations, O. purpureiflora exhibited low genetic diversity.</p><p><strong>Conclusions: </strong>The Ormosia genome assemblies provide valuable resources for studying the evolution, conservation, and potential utility of both Ormosia and Fabaceae species.</p>\",\"PeriodicalId\":12581,\"journal\":{\"name\":\"GigaScience\",\"volume\":\"14 \",\"pages\":\"\"},\"PeriodicalIF\":11.8000,\"publicationDate\":\"2025-01-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12083454/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"GigaScience\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1093/gigascience/giaf047\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"GigaScience","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/gigascience/giaf047","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
Chromosome-scale assemblies of three Ormosia species: repetitive sequences distribution and structural rearrangement.
Background: The genus Ormosia belongs to the Fabaceae family; almost all Ormosia species are endemic to China, which is considered one of the centers of this genus. Thus, genomic studies on the genus are needed to better understand species evolution and ensure the conservation and utilization of these species. We performed a chromosome-scale assembly of O. purpureiflora and updated the chromosome-scale assemblies of O. emarginata and O. semicastrata for comparative genomics.
Findings: The genome assembly sizes of the 3 species ranged from 1.42 to 1.58 Gb, with O. purpureiflora being the largest. Repetitive sequences accounted for 74.0-76.3% of the genomes, and the predicted gene counts ranged from 50,517 to 55,061. Benchmarking Universal Single-Copy Orthologs (BUSCO) analysis indicated 97.0-98.4% genome completeness, whereas the long terminal repeat (LTR) assembly index values ranged from 13.66 to 17.56, meeting the "reference genome" quality standard. Gene completeness, assessed using BUSCO and OMArk, ranged from 95.1% to 96.3% and from 97.1% to 98.1%, respectively.Characterizing genome architectures further revealed that inversions were the main structural rearrangements in Ormosia. In numbers, density distributions of repetitive elements revealed the types of Helitron and terminal inverted repeat (TIR) elements and the types of Gypsy and unknown LTR retrotransposons (LTR-RTs) concentrated in different regions on the chromosomes, whereas Copia LTR-RTs were generally evenly distributed along the chromosomes in Ormosia.Compared with the sister species Lupinus albus, Ormosia species had lower numbers and percentages of resistance (R) genes and transcription factor genes. Genes related to alkaloid, terpene, and flavonoid biosynthesis were found to be duplicated through tandem or proximal duplications. Notably, some genes associated with growth and defense were absent in O. purpureiflora.By resequencing 153 genotypes (∼30 Gb of data per sample) from 6 O. purpureiflora (sub)populations, we identified 40,146 single nucleotide polymorphisms. Corresponding to its very small populations, O. purpureiflora exhibited low genetic diversity.
Conclusions: The Ormosia genome assemblies provide valuable resources for studying the evolution, conservation, and potential utility of both Ormosia and Fabaceae species.
期刊介绍:
GigaScience seeks to transform data dissemination and utilization in the life and biomedical sciences. As an online open-access open-data journal, it specializes in publishing "big-data" studies encompassing various fields. Its scope includes not only "omic" type data and the fields of high-throughput biology currently serviced by large public repositories, but also the growing range of more difficult-to-access data, such as imaging, neuroscience, ecology, cohort data, systems biology and other new types of large-scale shareable data.