{"title":"A haplotype-resolved chromosomal-level genome assembly of Oxalis articulata.","authors":"Wenjie Yang, Changci Jiang, Changwei Bi, Zhuang Zhao, Chenlong Fu, Fangping Li, Zhuangwei Hou, Quanjun Hu, Zefu Wang","doi":"10.1038/s41597-025-05237-9","DOIUrl":null,"url":null,"abstract":"<p><p>Oxalis articulata is a widely distributed perennial herb known for its adaptability to diverse environmental conditions. Here, we report the first haplotype-resolved, chromosome-scale genome assembly for O. articulata (Oxalidaceae), generated through PacBio HiFi long reads and Hi-C technology. The resulting assembly comprises two haplotypes with sizes of 377.04 Mb (contig N50 = 31.13 Mb) and 342.70 Mb (contig N50 = 47.46 Mb), respectively. We predicted 36,063 protein-coding genes (94.33% functionally annotated) and 38,292 protein-coding genes (93.80% functionally annotated) in the two haplotypes, respectively. This high-quality genome assembly lays a solid groundwork for future initiatives in ecological and biochemical research for O. articulata.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"856"},"PeriodicalIF":6.9000,"publicationDate":"2025-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12102200/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Data","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41597-025-05237-9","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Oxalis articulata is a widely distributed perennial herb known for its adaptability to diverse environmental conditions. Here, we report the first haplotype-resolved, chromosome-scale genome assembly for O. articulata (Oxalidaceae), generated through PacBio HiFi long reads and Hi-C technology. The resulting assembly comprises two haplotypes with sizes of 377.04 Mb (contig N50 = 31.13 Mb) and 342.70 Mb (contig N50 = 47.46 Mb), respectively. We predicted 36,063 protein-coding genes (94.33% functionally annotated) and 38,292 protein-coding genes (93.80% functionally annotated) in the two haplotypes, respectively. This high-quality genome assembly lays a solid groundwork for future initiatives in ecological and biochemical research for O. articulata.
期刊介绍:
Scientific Data is an open-access journal focused on data, publishing descriptions of research datasets and articles on data sharing across natural sciences, medicine, engineering, and social sciences. Its goal is to enhance the sharing and reuse of scientific data, encourage broader data sharing, and acknowledge those who share their data.
The journal primarily publishes Data Descriptors, which offer detailed descriptions of research datasets, including data collection methods and technical analyses validating data quality. These descriptors aim to facilitate data reuse rather than testing hypotheses or presenting new interpretations, methods, or in-depth analyses.