Adam P Ryan, Sean Bergin, Jillian Scully, Evelyn Zuniga-Soto, Conor Hession, Amelia E Barber, Emilia Gómez-Molero, Oliver Bader, Florent Morio, Bing Zhai, Kenneth H Wolfe, Geraldine Butler
{"title":"假丝酵母的小泛基因组反映了整体的低种内多样性。","authors":"Adam P Ryan, Sean Bergin, Jillian Scully, Evelyn Zuniga-Soto, Conor Hession, Amelia E Barber, Emilia Gómez-Molero, Oliver Bader, Florent Morio, Bing Zhai, Kenneth H Wolfe, Geraldine Butler","doi":"10.1128/mbio.01320-25","DOIUrl":null,"url":null,"abstract":"<p><p><i>Candida parapsilosis</i> is an opportunistic yeast pathogen that can cause life-threatening infections in immunocompromised humans. Whole-genome sequencing studies of the species have demonstrated remarkably low diversity, with strains typically differing by about 1.5 single nucleotide polymorphisms (SNPs) per 10 kb. However, SNP calling alone does not capture the full extent of genetic variation. Here, we define the pangenome of 372 <i>C</i>. <i>parapsilosis</i> isolates to determine variation in gene content. The pangenome consists of 5,859 genes, of which 48 are not found in the genome of the reference strain. This includes 5,791 core genes (present in ≥99.5% of isolates). Four genes, including the allantoin permease gene <i>DAL4</i>, were present in all isolates but were truncated in some strains. The truncated <i>DAL4</i> was classified as a pseudogene in the reference strain CDC317. CRISPR-Cas9 gene editing showed that removing the early stop codon (producing the full-length Dal4 protein) is associated with improved use of allantoin as a sole nitrogen source. We find that the accessory genome of <i>C. parapsilosis</i> consists of 68 homologous clusters. This includes 38 previously annotated genes, 27 novel paralogs of previously annotated genes, and 3 uncharacterized open reading frames. Approximately one-third of the accessory genome (24/68 genes) is associated with gene fusions between tandem genes in the major facilitator superfamily. Additionally, we identified two highly divergent <i>C. parapsilosis</i> strains and found that, despite their increased phylogenetic distance (~30 SNPs per 10 kb), both strains have similar gene content to the other 372.IMPORTANCE<i>Candida parapsilosis</i> is a human fungal pathogen listed in the high-priority group by the World Health Organization. It is an increasing cause of hospital-acquired and drug-resistant infections. Here, we studied the genetic diversity of 372 <i>C</i>. <i>parapsilosis</i> isolates, the largest genomic surveillance of this species to date. We show that there is relatively little genetic variation. However, we identified two more distantly related isolates from Germany, suggesting that even more sampling may yield more diversity. We find that the pangenome (the cumulative gene content of all isolates) is surprisingly small, compared to other fungal species. Many of the non-core genes are involved in transport. We also find that variations in gene content are associated with nitrogen metabolism, which may contribute to the virulence characteristics of this species.</p>","PeriodicalId":18315,"journal":{"name":"mBio","volume":" ","pages":"e0132025"},"PeriodicalIF":4.7000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12505961/pdf/","citationCount":"0","resultStr":"{\"title\":\"Small pangenome of <i>Candida parapsilosis</i> reflects overall low intraspecific diversity.\",\"authors\":\"Adam P Ryan, Sean Bergin, Jillian Scully, Evelyn Zuniga-Soto, Conor Hession, Amelia E Barber, Emilia Gómez-Molero, Oliver Bader, Florent Morio, Bing Zhai, Kenneth H Wolfe, Geraldine Butler\",\"doi\":\"10.1128/mbio.01320-25\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p><i>Candida parapsilosis</i> is an opportunistic yeast pathogen that can cause life-threatening infections in immunocompromised humans. Whole-genome sequencing studies of the species have demonstrated remarkably low diversity, with strains typically differing by about 1.5 single nucleotide polymorphisms (SNPs) per 10 kb. However, SNP calling alone does not capture the full extent of genetic variation. Here, we define the pangenome of 372 <i>C</i>. <i>parapsilosis</i> isolates to determine variation in gene content. The pangenome consists of 5,859 genes, of which 48 are not found in the genome of the reference strain. This includes 5,791 core genes (present in ≥99.5% of isolates). Four genes, including the allantoin permease gene <i>DAL4</i>, were present in all isolates but were truncated in some strains. The truncated <i>DAL4</i> was classified as a pseudogene in the reference strain CDC317. CRISPR-Cas9 gene editing showed that removing the early stop codon (producing the full-length Dal4 protein) is associated with improved use of allantoin as a sole nitrogen source. We find that the accessory genome of <i>C. parapsilosis</i> consists of 68 homologous clusters. This includes 38 previously annotated genes, 27 novel paralogs of previously annotated genes, and 3 uncharacterized open reading frames. Approximately one-third of the accessory genome (24/68 genes) is associated with gene fusions between tandem genes in the major facilitator superfamily. Additionally, we identified two highly divergent <i>C. parapsilosis</i> strains and found that, despite their increased phylogenetic distance (~30 SNPs per 10 kb), both strains have similar gene content to the other 372.IMPORTANCE<i>Candida parapsilosis</i> is a human fungal pathogen listed in the high-priority group by the World Health Organization. It is an increasing cause of hospital-acquired and drug-resistant infections. Here, we studied the genetic diversity of 372 <i>C</i>. <i>parapsilosis</i> isolates, the largest genomic surveillance of this species to date. We show that there is relatively little genetic variation. However, we identified two more distantly related isolates from Germany, suggesting that even more sampling may yield more diversity. We find that the pangenome (the cumulative gene content of all isolates) is surprisingly small, compared to other fungal species. Many of the non-core genes are involved in transport. We also find that variations in gene content are associated with nitrogen metabolism, which may contribute to the virulence characteristics of this species.</p>\",\"PeriodicalId\":18315,\"journal\":{\"name\":\"mBio\",\"volume\":\" \",\"pages\":\"e0132025\"},\"PeriodicalIF\":4.7000,\"publicationDate\":\"2025-10-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12505961/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"mBio\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1128/mbio.01320-25\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/8/25 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"MICROBIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"mBio","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1128/mbio.01320-25","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/8/25 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
Small pangenome of Candida parapsilosis reflects overall low intraspecific diversity.
Candida parapsilosis is an opportunistic yeast pathogen that can cause life-threatening infections in immunocompromised humans. Whole-genome sequencing studies of the species have demonstrated remarkably low diversity, with strains typically differing by about 1.5 single nucleotide polymorphisms (SNPs) per 10 kb. However, SNP calling alone does not capture the full extent of genetic variation. Here, we define the pangenome of 372 C. parapsilosis isolates to determine variation in gene content. The pangenome consists of 5,859 genes, of which 48 are not found in the genome of the reference strain. This includes 5,791 core genes (present in ≥99.5% of isolates). Four genes, including the allantoin permease gene DAL4, were present in all isolates but were truncated in some strains. The truncated DAL4 was classified as a pseudogene in the reference strain CDC317. CRISPR-Cas9 gene editing showed that removing the early stop codon (producing the full-length Dal4 protein) is associated with improved use of allantoin as a sole nitrogen source. We find that the accessory genome of C. parapsilosis consists of 68 homologous clusters. This includes 38 previously annotated genes, 27 novel paralogs of previously annotated genes, and 3 uncharacterized open reading frames. Approximately one-third of the accessory genome (24/68 genes) is associated with gene fusions between tandem genes in the major facilitator superfamily. Additionally, we identified two highly divergent C. parapsilosis strains and found that, despite their increased phylogenetic distance (~30 SNPs per 10 kb), both strains have similar gene content to the other 372.IMPORTANCECandida parapsilosis is a human fungal pathogen listed in the high-priority group by the World Health Organization. It is an increasing cause of hospital-acquired and drug-resistant infections. Here, we studied the genetic diversity of 372 C. parapsilosis isolates, the largest genomic surveillance of this species to date. We show that there is relatively little genetic variation. However, we identified two more distantly related isolates from Germany, suggesting that even more sampling may yield more diversity. We find that the pangenome (the cumulative gene content of all isolates) is surprisingly small, compared to other fungal species. Many of the non-core genes are involved in transport. We also find that variations in gene content are associated with nitrogen metabolism, which may contribute to the virulence characteristics of this species.
期刊介绍:
mBio® is ASM''s first broad-scope, online-only, open access journal. mBio offers streamlined review and publication of the best research in microbiology and allied fields.