Ali Ali, Guangtu Gao, Rafet Al-Tobasei, Ramey C Youngblood, Geoffrey C Waldbieser, Brian E Scheffler, Yniv Palti, Mohamed Salem
{"title":"Chromosome level genome assembly and annotation of the Swanson rainbow trout homozygous line.","authors":"Ali Ali, Guangtu Gao, Rafet Al-Tobasei, Ramey C Youngblood, Geoffrey C Waldbieser, Brian E Scheffler, Yniv Palti, Mohamed Salem","doi":"10.1038/s41597-025-04693-7","DOIUrl":null,"url":null,"abstract":"<p><p>The genome of the Swanson doubled haploid (DH) YY male line of rainbow trout was de novo assembled using the Canu pipeline, high-coverage PacBio long-read sequence data, Bionano optical maps, and Hi-C proximity ligation sequence data, resulting in 29 major scaffolds aligning with the karyotype of the Swanson line (2 N = 58). This assembly, totaling 2.3 Gb with an N50 of 52.4 Mb, represents approximately 95% of the genome in 29 chromosome sequences with only 109 gaps between scaffolds. Notably, corrections to previous errors in the Swanson line genome assembly were made, including the identification of a double large inversion on the Omy05 chromosome (~57 Mb), the absence of the Omy20 inversion between the Arlee and Swanson assemblies, and the discovery of a ~6.7 Mb inversion on Omy26. This comprehensive assembly contributes to refining the rainbow trout reference genome and serves as a valuable resource for future genetic studies within this species.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"345"},"PeriodicalIF":6.9000,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11865591/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Data","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41597-025-04693-7","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
The genome of the Swanson doubled haploid (DH) YY male line of rainbow trout was de novo assembled using the Canu pipeline, high-coverage PacBio long-read sequence data, Bionano optical maps, and Hi-C proximity ligation sequence data, resulting in 29 major scaffolds aligning with the karyotype of the Swanson line (2 N = 58). This assembly, totaling 2.3 Gb with an N50 of 52.4 Mb, represents approximately 95% of the genome in 29 chromosome sequences with only 109 gaps between scaffolds. Notably, corrections to previous errors in the Swanson line genome assembly were made, including the identification of a double large inversion on the Omy05 chromosome (~57 Mb), the absence of the Omy20 inversion between the Arlee and Swanson assemblies, and the discovery of a ~6.7 Mb inversion on Omy26. This comprehensive assembly contributes to refining the rainbow trout reference genome and serves as a valuable resource for future genetic studies within this species.
期刊介绍:
Scientific Data is an open-access journal focused on data, publishing descriptions of research datasets and articles on data sharing across natural sciences, medicine, engineering, and social sciences. Its goal is to enhance the sharing and reuse of scientific data, encourage broader data sharing, and acknowledge those who share their data.
The journal primarily publishes Data Descriptors, which offer detailed descriptions of research datasets, including data collection methods and technical analyses validating data quality. These descriptors aim to facilitate data reuse rather than testing hypotheses or presenting new interpretations, methods, or in-depth analyses.