{"title":"Chromosome-level genome assembly of a critically endangered species Leuciscus chuanchicus.","authors":"Qi Wang, Qi Zhou, Hongyan Liu, Jiongtang Li, Yanliang Jiang","doi":"10.1038/s41597-025-04787-2","DOIUrl":null,"url":null,"abstract":"<p><p>Leuciscus chuanchicus, a critically endangered cyprinid endemic in the Yellow River, represents an evolutionary significant lineage within Leuciscinae. However, conservation efforts for this species have been hindered by the lack of genetic and genomic resources. Here we reported a high-quality chromosome-level genome of L. chuanchicus by combining Illumina reads, PacBio HiFi long reads and Hi-C data. The assembled genome size was 1.16 Gb, with a contig N50 size of 31,116,631 bp and a scaffold N50 size of 43,855,677 bp. The resulting 130 scaffolds were further clustered and ordered into 25 chromosomes based on the Hi-C data, representing 97.84% of the assembled sequences. The genome contained 60.36% repetitive sequences and 35,014 noncoding RNAs. A total of 31,196 protein-coding genes were predicted, of which 28,323 (90.79%) were functionally annotated. The BUSCO and OMArk revealed 97.6% and 91.28% completion rates, respectively. This study assembled a high-quality genome of L. chuanchicus, and provided fundamental genomic resources for investigating the molecular mechanism and evolution of the Leuciscinae.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"441"},"PeriodicalIF":5.8000,"publicationDate":"2025-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11910599/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Data","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41597-025-04787-2","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Leuciscus chuanchicus, a critically endangered cyprinid endemic in the Yellow River, represents an evolutionary significant lineage within Leuciscinae. However, conservation efforts for this species have been hindered by the lack of genetic and genomic resources. Here we reported a high-quality chromosome-level genome of L. chuanchicus by combining Illumina reads, PacBio HiFi long reads and Hi-C data. The assembled genome size was 1.16 Gb, with a contig N50 size of 31,116,631 bp and a scaffold N50 size of 43,855,677 bp. The resulting 130 scaffolds were further clustered and ordered into 25 chromosomes based on the Hi-C data, representing 97.84% of the assembled sequences. The genome contained 60.36% repetitive sequences and 35,014 noncoding RNAs. A total of 31,196 protein-coding genes were predicted, of which 28,323 (90.79%) were functionally annotated. The BUSCO and OMArk revealed 97.6% and 91.28% completion rates, respectively. This study assembled a high-quality genome of L. chuanchicus, and provided fundamental genomic resources for investigating the molecular mechanism and evolution of the Leuciscinae.
期刊介绍:
Scientific Data is an open-access journal focused on data, publishing descriptions of research datasets and articles on data sharing across natural sciences, medicine, engineering, and social sciences. Its goal is to enhance the sharing and reuse of scientific data, encourage broader data sharing, and acknowledge those who share their data.
The journal primarily publishes Data Descriptors, which offer detailed descriptions of research datasets, including data collection methods and technical analyses validating data quality. These descriptors aim to facilitate data reuse rather than testing hypotheses or presenting new interpretations, methods, or in-depth analyses.