Shri Hari Prasad, Deepu Mathew, Ravisankar Valsalan
{"title":"An efficient repeat masking library for the genomic data of coconut and related trees","authors":"Shri Hari Prasad, Deepu Mathew, Ravisankar Valsalan","doi":"10.2478/sg-2023-0011","DOIUrl":null,"url":null,"abstract":"Abstract Even though repeat masking using custom designed libraries significantly improves the genome annotation and gene prediction, such libraries for palm trees are yet to be designed and made accessible to the researchers. In this study, a repeat library was designed and validated for use in coconut and related palm genomes. Coconut genome with chromosome-level assembly was used to design independent libraries for tall and dwarf ecotypes, which were subsequently merged. Efficiency of the combined de novo library in genome annotation and gene prediction was assessed in comparison with the conventional libraries (Dfam+RepBase), using RepeatMasker. De novo library had 76.3 % efficiency in coconut genomes compared to 3.51 % in custom libraries and number of genes predicted was reduced from an average of 193,099 to 31,022. In date palm, oil pam and sago palm also, combined library gave higher repeat masking and reduced the number of genes predicted. The de novo library can be accessed at http://www.kau.in/repeat-libraries.","PeriodicalId":21834,"journal":{"name":"Silvae Genetica","volume":"13 1","pages":"114 - 117"},"PeriodicalIF":1.1000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Silvae Genetica","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.2478/sg-2023-0011","RegionNum":4,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"FORESTRY","Score":null,"Total":0}
引用次数: 0
Abstract
Abstract Even though repeat masking using custom designed libraries significantly improves the genome annotation and gene prediction, such libraries for palm trees are yet to be designed and made accessible to the researchers. In this study, a repeat library was designed and validated for use in coconut and related palm genomes. Coconut genome with chromosome-level assembly was used to design independent libraries for tall and dwarf ecotypes, which were subsequently merged. Efficiency of the combined de novo library in genome annotation and gene prediction was assessed in comparison with the conventional libraries (Dfam+RepBase), using RepeatMasker. De novo library had 76.3 % efficiency in coconut genomes compared to 3.51 % in custom libraries and number of genes predicted was reduced from an average of 193,099 to 31,022. In date palm, oil pam and sago palm also, combined library gave higher repeat masking and reduced the number of genes predicted. The de novo library can be accessed at http://www.kau.in/repeat-libraries.
期刊介绍:
Silvae Genetica is an international peer reviewed journal with more than 65 year tradition and experience in all fields of theoretical and applied Forest Genetics and Tree breeding. It continues "Zeitschrift für Forstgenetik und Forstpflanzenzüchtung" (Journal of Forest Genetics and Forest Tree Breeding) founded by W. LANGNER in 1951.