Hoyeol Jang, Ara Cho, Hyuk-Jin Kim, Haneul Kim, Seung-Hoon Jeong, Sun Mi Huh, Hee-Ju Yu, Dong-Kab Kim, Joo-Hwan Kim, Jeong-Hwan Mun
{"title":"韩国特有种双叶翘和连翘的染色体水平组合。","authors":"Hoyeol Jang, Ara Cho, Hyuk-Jin Kim, Haneul Kim, Seung-Hoon Jeong, Sun Mi Huh, Hee-Ju Yu, Dong-Kab Kim, Joo-Hwan Kim, Jeong-Hwan Mun","doi":"10.1038/s41597-024-04252-6","DOIUrl":null,"url":null,"abstract":"<p><p>Abeliophyllum distichum and Forsythia ovata are closely related species endemic to Korea and are highly valued as ornamental shrubs in the Oleaceae family. A combination of PacBio and Illumina sequencing with Hi-C scaffolding technologies was employed to develop chromosome-level genome assemblies of these species. The assembled genome sizes are 795.72 Mb for A. distichum and 1,108.53 Mb for F. ovata. The assemblies exhibit scaffold N50 lengths of 53.12 Mb and 68.97 Mb, with minimal gaps measuring 323.40 kb and 149.00 kb, and 97.71% and 98.82% BUSCO scores for Embryophyta single-copy orthologs, respectively, indicating high contiguity and completeness. The genomes contain 485.24 Mb and 691.68 Mb of repetitive sequences, 4,926 and 7,175 full-length long terminal repeat retrotransposons, and 49,414 and 57,587 protein-coding genes, respectively. The 14 pseudochromosomes encompass 93.80% of the A. distichum genome and 89.11% of the F. ovata genome, thereby demonstrating one-to-one chromosome-level collinearity. These high-quality genome assemblies serve as invaluable resources for genetic and breeding studies, facilitating a deeper understanding of the evolutionary history of these distinctive species.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"11 1","pages":"1372"},"PeriodicalIF":6.9000,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11655977/pdf/","citationCount":"0","resultStr":"{\"title\":\"Chromosome-level assemblies of the endemic Korean species Abeliophyllum distichum and Forsythia ovata.\",\"authors\":\"Hoyeol Jang, Ara Cho, Hyuk-Jin Kim, Haneul Kim, Seung-Hoon Jeong, Sun Mi Huh, Hee-Ju Yu, Dong-Kab Kim, Joo-Hwan Kim, Jeong-Hwan Mun\",\"doi\":\"10.1038/s41597-024-04252-6\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Abeliophyllum distichum and Forsythia ovata are closely related species endemic to Korea and are highly valued as ornamental shrubs in the Oleaceae family. A combination of PacBio and Illumina sequencing with Hi-C scaffolding technologies was employed to develop chromosome-level genome assemblies of these species. The assembled genome sizes are 795.72 Mb for A. distichum and 1,108.53 Mb for F. ovata. The assemblies exhibit scaffold N50 lengths of 53.12 Mb and 68.97 Mb, with minimal gaps measuring 323.40 kb and 149.00 kb, and 97.71% and 98.82% BUSCO scores for Embryophyta single-copy orthologs, respectively, indicating high contiguity and completeness. The genomes contain 485.24 Mb and 691.68 Mb of repetitive sequences, 4,926 and 7,175 full-length long terminal repeat retrotransposons, and 49,414 and 57,587 protein-coding genes, respectively. The 14 pseudochromosomes encompass 93.80% of the A. distichum genome and 89.11% of the F. ovata genome, thereby demonstrating one-to-one chromosome-level collinearity. These high-quality genome assemblies serve as invaluable resources for genetic and breeding studies, facilitating a deeper understanding of the evolutionary history of these distinctive species.</p>\",\"PeriodicalId\":21597,\"journal\":{\"name\":\"Scientific Data\",\"volume\":\"11 1\",\"pages\":\"1372\"},\"PeriodicalIF\":6.9000,\"publicationDate\":\"2024-12-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11655977/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Scientific Data\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1038/s41597-024-04252-6\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Data","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41597-024-04252-6","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
Chromosome-level assemblies of the endemic Korean species Abeliophyllum distichum and Forsythia ovata.
Abeliophyllum distichum and Forsythia ovata are closely related species endemic to Korea and are highly valued as ornamental shrubs in the Oleaceae family. A combination of PacBio and Illumina sequencing with Hi-C scaffolding technologies was employed to develop chromosome-level genome assemblies of these species. The assembled genome sizes are 795.72 Mb for A. distichum and 1,108.53 Mb for F. ovata. The assemblies exhibit scaffold N50 lengths of 53.12 Mb and 68.97 Mb, with minimal gaps measuring 323.40 kb and 149.00 kb, and 97.71% and 98.82% BUSCO scores for Embryophyta single-copy orthologs, respectively, indicating high contiguity and completeness. The genomes contain 485.24 Mb and 691.68 Mb of repetitive sequences, 4,926 and 7,175 full-length long terminal repeat retrotransposons, and 49,414 and 57,587 protein-coding genes, respectively. The 14 pseudochromosomes encompass 93.80% of the A. distichum genome and 89.11% of the F. ovata genome, thereby demonstrating one-to-one chromosome-level collinearity. These high-quality genome assemblies serve as invaluable resources for genetic and breeding studies, facilitating a deeper understanding of the evolutionary history of these distinctive species.
期刊介绍:
Scientific Data is an open-access journal focused on data, publishing descriptions of research datasets and articles on data sharing across natural sciences, medicine, engineering, and social sciences. Its goal is to enhance the sharing and reuse of scientific data, encourage broader data sharing, and acknowledge those who share their data.
The journal primarily publishes Data Descriptors, which offer detailed descriptions of research datasets, including data collection methods and technical analyses validating data quality. These descriptors aim to facilitate data reuse rather than testing hypotheses or presenting new interpretations, methods, or in-depth analyses.