Phylogenomic workflow for uncultivable microbial eukaryotes using single-cell RNA sequencing - A case study with planktonic ciliates (Ciliophora, Oligotrichea).
Shahed U A Shazib, Ragib Ahsan, Marie Leleu, George B McManus, Laura A Katz, Luciana F Santoferrara
{"title":"Phylogenomic workflow for uncultivable microbial eukaryotes using single-cell RNA sequencing - A case study with planktonic ciliates (Ciliophora, Oligotrichea).","authors":"Shahed U A Shazib, Ragib Ahsan, Marie Leleu, George B McManus, Laura A Katz, Luciana F Santoferrara","doi":"10.1016/j.ympev.2024.108239","DOIUrl":null,"url":null,"abstract":"<p><p>Phylogenetic analyses increasingly rely on genomic and transcriptomic data to produce better supported inferences on the evolutionary relationships among microbial eukaryotes. Such phylogenomic analyses, however, require robust workflows, bioinformatic expertise and computational power. Microbial eukaryotes pose additional challenges given the complexity of their genomes and the presence of non-target sequences (e.g., symbionts, prey) in data obtained from single cells of uncultivable lineages. To address these challenges, we developed a phylogenomic workflow based on single-cell RNA sequencing, integrating all essential steps from cell isolation to data curation and species tree inference. We assessed our workflow by using publicly available and newly generated transcriptomes (11 and 28, respectively) from the Oligotrichea, a diverse group of marine planktonic ciliates. This group's phylogenetic relationships have been relatively well-studied based on ribosomal RNA gene markers, which we reconstructed by read mapping of transcriptome sequences and compared to our phylogenomic inferences. We also compared phylogenomic analyses based on single-copy protein-coding genes (well-curated orthologs) and multi-copy genes (including paralogs) by sequence concatenation and a coalescence approach (Asteroid), respectively. Finally, using subsets of up to 1,014 gene families (GFs), we assessed the influence of missing data in our phylogenomic inferences. All our analyses yielded similar results, and most inferred relationships were consistent and well-supported. Overall, we found that Asteroid provides robust support for species tree inferences, while simplifying curation steps, minimizing the effects of missing data and maximizing the number of GFs represented in the analyses. Our workflow can be adapted for phylogenomic analyses based on single-cell RNA sequencing of other uncultivable microbial eukaryotes.</p>","PeriodicalId":56109,"journal":{"name":"Molecular Phylogenetics and Evolution","volume":" ","pages":"108239"},"PeriodicalIF":3.6000,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Phylogenetics and Evolution","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1016/j.ympev.2024.108239","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Phylogenetic analyses increasingly rely on genomic and transcriptomic data to produce better supported inferences on the evolutionary relationships among microbial eukaryotes. Such phylogenomic analyses, however, require robust workflows, bioinformatic expertise and computational power. Microbial eukaryotes pose additional challenges given the complexity of their genomes and the presence of non-target sequences (e.g., symbionts, prey) in data obtained from single cells of uncultivable lineages. To address these challenges, we developed a phylogenomic workflow based on single-cell RNA sequencing, integrating all essential steps from cell isolation to data curation and species tree inference. We assessed our workflow by using publicly available and newly generated transcriptomes (11 and 28, respectively) from the Oligotrichea, a diverse group of marine planktonic ciliates. This group's phylogenetic relationships have been relatively well-studied based on ribosomal RNA gene markers, which we reconstructed by read mapping of transcriptome sequences and compared to our phylogenomic inferences. We also compared phylogenomic analyses based on single-copy protein-coding genes (well-curated orthologs) and multi-copy genes (including paralogs) by sequence concatenation and a coalescence approach (Asteroid), respectively. Finally, using subsets of up to 1,014 gene families (GFs), we assessed the influence of missing data in our phylogenomic inferences. All our analyses yielded similar results, and most inferred relationships were consistent and well-supported. Overall, we found that Asteroid provides robust support for species tree inferences, while simplifying curation steps, minimizing the effects of missing data and maximizing the number of GFs represented in the analyses. Our workflow can be adapted for phylogenomic analyses based on single-cell RNA sequencing of other uncultivable microbial eukaryotes.
期刊介绍:
Molecular Phylogenetics and Evolution is dedicated to bringing Darwin''s dream within grasp - to "have fairly true genealogical trees of each great kingdom of Nature." The journal provides a forum for molecular studies that advance our understanding of phylogeny and evolution, further the development of phylogenetically more accurate taxonomic classifications, and ultimately bring a unified classification for all the ramifying lines of life. Phylogeographic studies will be considered for publication if they offer EXCEPTIONAL theoretical or empirical advances.