Yan Wang, Xiaopeng Hao, Chunhai Chen, Haigang Wang, Peng Gao, Xukui Yang, Xue Dong, Huibin Qin, Meng Li, Sen Hou, Jianbo Jian, Jianwu Chang, Jing Wu, Zhixin Mu
{"title":"Telomere-to-telomere genome of common bean (Phaseolus vulgaris L., YP4).","authors":"Yan Wang, Xiaopeng Hao, Chunhai Chen, Haigang Wang, Peng Gao, Xukui Yang, Xue Dong, Huibin Qin, Meng Li, Sen Hou, Jianbo Jian, Jianwu Chang, Jing Wu, Zhixin Mu","doi":"10.1093/gigascience/giaf001","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Common bean is a significant grain legume in human diets. However, the lack of a complete reference genome for common beans has hindered efforts to improve agronomic cultivars.</p><p><strong>Findings: </strong>Herein, we present the first telomere-to-telomere (T2T) genome assembly of common bean (Phaseolus vulgaris L., YP4) using PacBio High-Fidelity reads, ONT ultra-long sequencing, and Hi-C technologies. The assembly resulted in a genome size of 560.30 Mb with an N50 of 55.11 Mb, exhibiting high completeness and accuracy (BUSCO score: 99.5%, quality value (QV): 54.86). The sequences were anchored into 11 chromosomes, with 20 of 22 telomeres identified, leading to the formation of 9 T2T pseudomolecules. Furthermore, we identified repetitive elements accounting for 61.20% of the genome and predicted 29,925 protein-coding genes. Phylogenetic analysis suggested an estimated divergence time of approximately 11.6 million years ago between P. vulgaris and Vigna angularis. Comparative genome analysis revealed the expanded gene families and variations between YP4 and G19833 associated with defense response.</p><p><strong>Conclusions: </strong>The T2T reference genome and genomic insights presented here are crucial for future genetic studies not only in common bean but also in other legumes.</p>","PeriodicalId":12581,"journal":{"name":"GigaScience","volume":"14 ","pages":""},"PeriodicalIF":11.8000,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12077395/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"GigaScience","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/gigascience/giaf001","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Common bean is a significant grain legume in human diets. However, the lack of a complete reference genome for common beans has hindered efforts to improve agronomic cultivars.
Findings: Herein, we present the first telomere-to-telomere (T2T) genome assembly of common bean (Phaseolus vulgaris L., YP4) using PacBio High-Fidelity reads, ONT ultra-long sequencing, and Hi-C technologies. The assembly resulted in a genome size of 560.30 Mb with an N50 of 55.11 Mb, exhibiting high completeness and accuracy (BUSCO score: 99.5%, quality value (QV): 54.86). The sequences were anchored into 11 chromosomes, with 20 of 22 telomeres identified, leading to the formation of 9 T2T pseudomolecules. Furthermore, we identified repetitive elements accounting for 61.20% of the genome and predicted 29,925 protein-coding genes. Phylogenetic analysis suggested an estimated divergence time of approximately 11.6 million years ago between P. vulgaris and Vigna angularis. Comparative genome analysis revealed the expanded gene families and variations between YP4 and G19833 associated with defense response.
Conclusions: The T2T reference genome and genomic insights presented here are crucial for future genetic studies not only in common bean but also in other legumes.
期刊介绍:
GigaScience seeks to transform data dissemination and utilization in the life and biomedical sciences. As an online open-access open-data journal, it specializes in publishing "big-data" studies encompassing various fields. Its scope includes not only "omic" type data and the fields of high-throughput biology currently serviced by large public repositories, but also the growing range of more difficult-to-access data, such as imaging, neuroscience, ecology, cohort data, systems biology and other new types of large-scale shareable data.