Mathabatha F. Maleka , Tsholofelo J. Modise , Morné G. Du Plessis , Gesine M. Coetzer
{"title":"Identification and characterization of sequence variants from a de novo-assembled partial pan-genome of cactus pear (Opuntia L.)","authors":"Mathabatha F. Maleka , Tsholofelo J. Modise , Morné G. Du Plessis , Gesine M. Coetzer","doi":"10.1016/j.sajb.2024.10.023","DOIUrl":null,"url":null,"abstract":"<div><div>Members of the genus <em>Opuntia</em> exhibit extensive morphological variation that is seemingly linked to their propensity for occupying extreme environments all over the world. The plants are very important to the agriculture industry as they have many health-promoting nutritional and bioactive compounds. Also, as succulents, they are interesting because they develop peculiar stems – which, however, fulfil the role of leaves – that enable the storage of copious amounts of water. Yet, little is known about the genomes of <em>Opuntia</em> species although having such data can give insight into the genetic diversity that regulates the observed morphological variation as well as the molecular processes responsible for the development of succulent leaves. The current study aimed to sequence and <em>de novo</em> assemble a partial reference genome of <em>Opuntia</em> that would be used to find sequence variants for differentiating species and cultivars of this important agricultural crop. Illumina sequencing was performed on 10 cultivars that represented two species (<em>Opuntia ficus-indica</em> and <em>O. robusta</em>) within the South African <em>Opuntia</em> germplasm. Sequence assembly of 214 million filtered high-quality reads generated a 657 Mbp partial pan-genome that represented ∼30 % of a predicted <em>O. ficus-indica</em> genome. Functional annotation of the assembled sequences at different bioinformatic databases revealed many genes relating to diverse developmental pathways as well as water storage. Further, sequence variants including some 60,000 simple sequence repeats (SSRs) and 118,000 biallelic single nucleotide polymorphisms (SNPs) were discovered by mapping reads of each cultivar against the assembled partial reference pan-genome of <em>Opuntia</em>. While the identified number of SNPs is significant it, however, corresponded to moderate and higher frequency variants only (minor allele frequency [MAF] > 0.2). Also, a random selection of 1,000 SNPs genotyped across all specimens were able to differentiate the 10 cultivars and two species studied herein. Overall, the current study reports the first partial reference pan-genome of <em>Opuntia</em> specimens. The generated data should serve as a valuable genetic resource for future studies seeking to analyze diversity-related phenomena or elucidate the molecular development and growth of <em>Opuntia</em> specimens.</div></div>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0254629924006574","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 0
Abstract
Members of the genus Opuntia exhibit extensive morphological variation that is seemingly linked to their propensity for occupying extreme environments all over the world. The plants are very important to the agriculture industry as they have many health-promoting nutritional and bioactive compounds. Also, as succulents, they are interesting because they develop peculiar stems – which, however, fulfil the role of leaves – that enable the storage of copious amounts of water. Yet, little is known about the genomes of Opuntia species although having such data can give insight into the genetic diversity that regulates the observed morphological variation as well as the molecular processes responsible for the development of succulent leaves. The current study aimed to sequence and de novo assemble a partial reference genome of Opuntia that would be used to find sequence variants for differentiating species and cultivars of this important agricultural crop. Illumina sequencing was performed on 10 cultivars that represented two species (Opuntia ficus-indica and O. robusta) within the South African Opuntia germplasm. Sequence assembly of 214 million filtered high-quality reads generated a 657 Mbp partial pan-genome that represented ∼30 % of a predicted O. ficus-indica genome. Functional annotation of the assembled sequences at different bioinformatic databases revealed many genes relating to diverse developmental pathways as well as water storage. Further, sequence variants including some 60,000 simple sequence repeats (SSRs) and 118,000 biallelic single nucleotide polymorphisms (SNPs) were discovered by mapping reads of each cultivar against the assembled partial reference pan-genome of Opuntia. While the identified number of SNPs is significant it, however, corresponded to moderate and higher frequency variants only (minor allele frequency [MAF] > 0.2). Also, a random selection of 1,000 SNPs genotyped across all specimens were able to differentiate the 10 cultivars and two species studied herein. Overall, the current study reports the first partial reference pan-genome of Opuntia specimens. The generated data should serve as a valuable genetic resource for future studies seeking to analyze diversity-related phenomena or elucidate the molecular development and growth of Opuntia specimens.