{"title":"The telomere-to-telomere gapless genome of grass carp provides insights for genetic improvement.","authors":"Fei Liu, Yuan Li, Guishuang Wang, Dong Zhang, Xinlan Yang, Chaowei Zhou, Rongzhu Zhou, Haiping Liu","doi":"10.1093/gigascience/giaf059","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The grass carp (Ctenopharyngodon idella) is a large herbivorous freshwater fish belonging to the Cyprinidae family. It is widely cultivated as a food source in China and is renowned as one of the Four Great Domestic Fishes. Despite its economic importance, the published genome assemblies of grass carp remain incomplete due to gaps, thereby hindering molecular research and genetic improvement.</p><p><strong>Results: </strong>In this study, we report the assembly of a telomere-to-telomere (T2T) gap-free genome of the grass carp with total length of 890,918,310 bp for 24 chromosomes without gaps, representing the highest completeness and assembly quality to date. Our assembly contains 27,446 protein-coding genes, and 93.04% of all were annotated with multiple databases, with 48 telomeres and 24 centromeres characterized. Gap-free reference genomes enable us to study the structure of centromeres and identify conserved centromere-specific satellite motifs for grass carp. Furthermore, we identified 108 gene-related gaps across 12 chromosomes and 38 structural variations across 17 chromosomes in this T2T assembly.</p><p><strong>Conclusions: </strong>The validated gap-free genome provides invaluable resource for future genomic studies grass carp, offering new insights into its genetic architecture and evolutionary dynamics.</p>","PeriodicalId":12581,"journal":{"name":"GigaScience","volume":"14 ","pages":""},"PeriodicalIF":11.8000,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12204074/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"GigaScience","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/gigascience/giaf059","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Background: The grass carp (Ctenopharyngodon idella) is a large herbivorous freshwater fish belonging to the Cyprinidae family. It is widely cultivated as a food source in China and is renowned as one of the Four Great Domestic Fishes. Despite its economic importance, the published genome assemblies of grass carp remain incomplete due to gaps, thereby hindering molecular research and genetic improvement.
Results: In this study, we report the assembly of a telomere-to-telomere (T2T) gap-free genome of the grass carp with total length of 890,918,310 bp for 24 chromosomes without gaps, representing the highest completeness and assembly quality to date. Our assembly contains 27,446 protein-coding genes, and 93.04% of all were annotated with multiple databases, with 48 telomeres and 24 centromeres characterized. Gap-free reference genomes enable us to study the structure of centromeres and identify conserved centromere-specific satellite motifs for grass carp. Furthermore, we identified 108 gene-related gaps across 12 chromosomes and 38 structural variations across 17 chromosomes in this T2T assembly.
Conclusions: The validated gap-free genome provides invaluable resource for future genomic studies grass carp, offering new insights into its genetic architecture and evolutionary dynamics.
期刊介绍:
GigaScience seeks to transform data dissemination and utilization in the life and biomedical sciences. As an online open-access open-data journal, it specializes in publishing "big-data" studies encompassing various fields. Its scope includes not only "omic" type data and the fields of high-throughput biology currently serviced by large public repositories, but also the growing range of more difficult-to-access data, such as imaging, neuroscience, ecology, cohort data, systems biology and other new types of large-scale shareable data.