Targeted sequencing analysis pipeline for species identification of human pathogenic fungi using long-read nanopore sequencing.

IF 5.2 1区 生物学 Q1 MYCOLOGY
Nattapong Langsiri, Navaporn Worasilchai, Laszlo Irinyi, Piroon Jenjaroenpun, Thidathip Wongsurawat, Janet Jennifer Luangsa-Ard, Wieland Meyer, Ariya Chindamporn
{"title":"Targeted sequencing analysis pipeline for species identification of human pathogenic fungi using long-read nanopore sequencing.","authors":"Nattapong Langsiri,&nbsp;Navaporn Worasilchai,&nbsp;Laszlo Irinyi,&nbsp;Piroon Jenjaroenpun,&nbsp;Thidathip Wongsurawat,&nbsp;Janet Jennifer Luangsa-Ard,&nbsp;Wieland Meyer,&nbsp;Ariya Chindamporn","doi":"10.1186/s43008-023-00125-6","DOIUrl":null,"url":null,"abstract":"<p><p>Among molecular-based techniques for fungal identification, Sanger sequencing of the primary universal fungal DNA barcode, the internal transcribed spacer (ITS) region (ITS1, 5.8S, ITS2), is commonly used in clinical routine laboratories due to its simplicity, universality, efficacy, and affordability for fungal species identification. However, Sanger sequencing fails to identify mixed ITS sequences in the case of mixed infections. To overcome this limitation, different high-throughput sequencing technologies have been explored. The nanopore-based technology is now one of the most promising long-read sequencing technologies on the market as it has the potential to sequence the full-length ITS region in a single read. In this study, we established a workflow for species identification using the sequences of the entire ITS region generated by nanopore sequencing of both pure yeast isolates and mocked mixed species reads generated with different scenarios. The species used in this study included Candida albicans (n = 2), Candida tropicalis (n = 1), Nakaseomyces glabratus (formerly Candida glabrata) (n = 1), Trichosporon asahii (n = 2), Pichia kudriavzevii (formerly Candida krusei) (n = 1), and Cryptococcus neoformans (n = 1). Comparing various methods to generate the consensus sequence for fungal species identification, the results from this study indicate that read clustering using a modified version of the NanoCLUST pipeline is more sensitive than Canu or VSEARCH, as it classified species accurately with a lower abundance cluster of reads (3% abundance compared to 10% with VSEARCH). The modified NanoCLUST also reduced the number of classified clusters compared to VSEARCH, making the subsequent BLAST+ analysis faster. Subsampling of the datasets, which reduces the size of the datasets by approximately tenfold, did not significantly affect the identification results in terms of the identified species name, percent identity, query coverage, percentage of reads in the classified cluster, and the number of clusters. The ability of the method to distinguish mixed species within sub-populations of large datasets has the potential to aid computer analysis by reducing the required processing power. The herein presented new sequence analysis pipeline will facilitate better interpretation of fungal sequence data for species identification.</p>","PeriodicalId":54345,"journal":{"name":"Ima Fungus","volume":"14 1","pages":"18"},"PeriodicalIF":5.2000,"publicationDate":"2023-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10483712/pdf/","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ima Fungus","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s43008-023-00125-6","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MYCOLOGY","Score":null,"Total":0}
引用次数: 1

Abstract

Among molecular-based techniques for fungal identification, Sanger sequencing of the primary universal fungal DNA barcode, the internal transcribed spacer (ITS) region (ITS1, 5.8S, ITS2), is commonly used in clinical routine laboratories due to its simplicity, universality, efficacy, and affordability for fungal species identification. However, Sanger sequencing fails to identify mixed ITS sequences in the case of mixed infections. To overcome this limitation, different high-throughput sequencing technologies have been explored. The nanopore-based technology is now one of the most promising long-read sequencing technologies on the market as it has the potential to sequence the full-length ITS region in a single read. In this study, we established a workflow for species identification using the sequences of the entire ITS region generated by nanopore sequencing of both pure yeast isolates and mocked mixed species reads generated with different scenarios. The species used in this study included Candida albicans (n = 2), Candida tropicalis (n = 1), Nakaseomyces glabratus (formerly Candida glabrata) (n = 1), Trichosporon asahii (n = 2), Pichia kudriavzevii (formerly Candida krusei) (n = 1), and Cryptococcus neoformans (n = 1). Comparing various methods to generate the consensus sequence for fungal species identification, the results from this study indicate that read clustering using a modified version of the NanoCLUST pipeline is more sensitive than Canu or VSEARCH, as it classified species accurately with a lower abundance cluster of reads (3% abundance compared to 10% with VSEARCH). The modified NanoCLUST also reduced the number of classified clusters compared to VSEARCH, making the subsequent BLAST+ analysis faster. Subsampling of the datasets, which reduces the size of the datasets by approximately tenfold, did not significantly affect the identification results in terms of the identified species name, percent identity, query coverage, percentage of reads in the classified cluster, and the number of clusters. The ability of the method to distinguish mixed species within sub-populations of large datasets has the potential to aid computer analysis by reducing the required processing power. The herein presented new sequence analysis pipeline will facilitate better interpretation of fungal sequence data for species identification.

Abstract Image

Abstract Image

Abstract Image

利用长读纳米孔测序进行人类病原真菌物种鉴定的靶向测序分析管道。
在基于分子的真菌鉴定技术中,主要通用真菌DNA条形码内部转录间隔区(ITS1,5.8S,ITS2)的Sanger测序因其简单、通用、有效和可负担性而在临床常规实验室中广泛使用。然而,在混合感染的情况下,Sanger测序无法识别混合ITS序列。为了克服这一限制,已经探索了不同的高通量测序技术。基于纳米孔的技术现在是市场上最有前途的长读测序技术之一,因为它有潜力在一次读取中对全长ITS区域进行测序。在这项研究中,我们使用纯酵母分离株和模拟混合物种读数的纳米孔测序产生的整个ITS区域的序列,建立了物种鉴定的工作流程。本研究中使用的菌种包括白色念珠菌(n = 2) ,热带假丝酵母(n = 1) ,光滑念珠菌(原光滑念珠菌)(n = 1) ,asahii毛孢子虫(n = 2) ,库氏毕赤酵母(原克鲁塞念珠菌)(n = 1) ,和新型隐球菌(n = 1) 。通过比较产生真菌物种鉴定一致序列的各种方法,本研究的结果表明,使用NanoCLUST管道的修改版本的读数聚类比Canu或VSEARCH更敏感,因为它用较低丰度的读数聚类准确地对物种进行了分类(丰度为3%,而VSEARCH为10%)。与VSEARCH相比,改进后的NanoCLUST还减少了分类簇的数量,使后续的BLAST+ 分析速度更快。数据集的二次采样将数据集的大小减少了大约十倍,在已识别的物种名称、身份百分比、查询覆盖率、分类聚类中的读取百分比和聚类数量方面,对识别结果没有显著影响。该方法在大型数据集的子种群中区分混合物种的能力有可能通过降低所需的处理能力来帮助计算机分析。本文提出的新序列分析管道将有助于更好地解释真菌序列数据,用于物种鉴定。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Ima Fungus
Ima Fungus Agricultural and Biological Sciences-Agricultural and Biological Sciences (miscellaneous)
CiteScore
11.00
自引率
3.70%
发文量
18
审稿时长
20 weeks
期刊介绍: The flagship journal of the International Mycological Association. IMA Fungus is an international, peer-reviewed, open-access, full colour, fast-track journal. Papers on any aspect of mycology are considered, and published on-line with final pagination after proofs have been corrected; they are then effectively published under the International Code of Nomenclature for algae, fungi, and plants. The journal strongly supports good practice policies, and requires voucher specimens or cultures to be deposited in a public collection with an online database, DNA sequences in GenBank, alignments in TreeBASE, and validating information on new scientific names, including typifications, to be lodged in MycoBank. News, meeting reports, personalia, research news, correspondence, book news, and information on forthcoming international meetings are included in each issue
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信