Vinaya Kumar Katneni, Karthic Krishnan, Sudheesh K Prabhudas, Roja Jayaraman, Nida Quraishi, Kumaraguru Vasagam, Ashok Kumar Jangam, Jesudhas Raymond Jani Angel, Nimisha Kaikkolante, Kumaravel Jayaraman, S Shekhar Mudagandur
{"title":"Genome assembly at chromosome scale with telomere ends for Pearlspot, Etroplus suratensis.","authors":"Vinaya Kumar Katneni, Karthic Krishnan, Sudheesh K Prabhudas, Roja Jayaraman, Nida Quraishi, Kumaraguru Vasagam, Ashok Kumar Jangam, Jesudhas Raymond Jani Angel, Nimisha Kaikkolante, Kumaravel Jayaraman, S Shekhar Mudagandur","doi":"10.1038/s41597-024-04096-0","DOIUrl":null,"url":null,"abstract":"<p><p>The pearlspot, Etroplus suratensis is a climate resilient cichlid fish that exhibits unusual adaptation to salinity. The fish is able to complete full life cycle in diverse salinity habitats ranging from fresh water to marine environments. High-quality primary and phased genome assemblies were generated for pearlspot fish using PacBio HiFi and Arima HiC sequencing technologies, for the first time. The primary assembly is highly contiguous with contig N50 length of 36 Mb. The final assembly is of 1.247 Gb with N50 length of 51.57 Mb and 98% of the genome length anchored to 24 chromosomes. The genome was assessed to be 99.9% complete based on BUSCO evaluation and was predicted to contain 52.96% repeat elements. We have predicted 27,192 protein encoding genes, of which 21,580 were functionally annotated. The genome offers an invaluable resource to understand adaptation of pearlspot fish to diverse salinity habitats.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"11 1","pages":"1226"},"PeriodicalIF":5.8000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11560961/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Data","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41597-024-04096-0","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
The pearlspot, Etroplus suratensis is a climate resilient cichlid fish that exhibits unusual adaptation to salinity. The fish is able to complete full life cycle in diverse salinity habitats ranging from fresh water to marine environments. High-quality primary and phased genome assemblies were generated for pearlspot fish using PacBio HiFi and Arima HiC sequencing technologies, for the first time. The primary assembly is highly contiguous with contig N50 length of 36 Mb. The final assembly is of 1.247 Gb with N50 length of 51.57 Mb and 98% of the genome length anchored to 24 chromosomes. The genome was assessed to be 99.9% complete based on BUSCO evaluation and was predicted to contain 52.96% repeat elements. We have predicted 27,192 protein encoding genes, of which 21,580 were functionally annotated. The genome offers an invaluable resource to understand adaptation of pearlspot fish to diverse salinity habitats.
期刊介绍:
Scientific Data is an open-access journal focused on data, publishing descriptions of research datasets and articles on data sharing across natural sciences, medicine, engineering, and social sciences. Its goal is to enhance the sharing and reuse of scientific data, encourage broader data sharing, and acknowledge those who share their data.
The journal primarily publishes Data Descriptors, which offer detailed descriptions of research datasets, including data collection methods and technical analyses validating data quality. These descriptors aim to facilitate data reuse rather than testing hypotheses or presenting new interpretations, methods, or in-depth analyses.