PGIP:用于寄生虫基因组快速分类鉴定的web服务器。

IF 3.5 2区 医学 Q1 PARASITOLOGY
Jiayao Zhang, Feng Tang, Bixian Ni, Qiang Zhang, Xinyi Gong, Fanzhen Mao, Jun Cao, Yaobao Liu
{"title":"PGIP:用于寄生虫基因组快速分类鉴定的web服务器。","authors":"Jiayao Zhang, Feng Tang, Bixian Ni, Qiang Zhang, Xinyi Gong, Fanzhen Mao, Jun Cao, Yaobao Liu","doi":"10.1186/s13071-025-07007-3","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Parasitic diseases remain a global health challenge, and traditional methods in their diagnosis face limitations in sensitivity and scalability. Genome-based sequencing technologies have improved and are increasingly employed for the identification of parasites; however, their clinical adoption remains hindered by the complexity of bioinformatics analysis, reliance on incomplete reference databases, and accessibility barriers for nonspecialists. Overcoming these challenges necessitates the development of standardized analytical workflows and high-quality genomic resources specifically tailored for parasite identification.</p><p><strong>Methods: </strong>We developed a user-friendly web server named the Parasite Genome Identification Platform (PGIP). The reference database was sourced from the National Center for Biotechnology Information (NCBI), WormBase, European Nucleotide Archive (ENA), and VEuPathDB, rigorously filtered for quality, and deduplicated using Cluster Database at High Identity with Tolerance (CD-HIT) to ensure accuracy and nonredundancy. To streamline analysis, we integrated a standardized identification pipeline built on Nextflow, which encompasses host DNA depletion, quality control, parasite species identification via both reads mapping and assembly-based approaches, and automated report generation for comprehensive diagnostic insights.</p><p><strong>Results: </strong>PGIP integrates a curated database of 280 parasite genomes; which is rigorously filtered for quality and taxonomic accuracy. Validation across diverse datasets demonstrated the precise species-level resolution of PGIP, and its compatibility with clinical samples. The platform features an intuitive graphic interface; and one-click analysis significantly reduces reliance on bioinformatics expertise, thus enabling rapid diagnosis.</p><p><strong>Conclusions: </strong>PGIP offers an accurate, efficient, and a user-friendly web server designed to simplify and accelerate the taxonomic identification of parasite genomes using data from metagenomic next-generation sequencing. Its automated framework reduces the need for specialized expertise, enabling rapid application in clinical and public health settings.</p>","PeriodicalId":19793,"journal":{"name":"Parasites & Vectors","volume":"18 1","pages":"365"},"PeriodicalIF":3.5000,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12392538/pdf/","citationCount":"0","resultStr":"{\"title\":\"PGIP: a web server for the rapid taxonomic identification of parasite genomes.\",\"authors\":\"Jiayao Zhang, Feng Tang, Bixian Ni, Qiang Zhang, Xinyi Gong, Fanzhen Mao, Jun Cao, Yaobao Liu\",\"doi\":\"10.1186/s13071-025-07007-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Parasitic diseases remain a global health challenge, and traditional methods in their diagnosis face limitations in sensitivity and scalability. Genome-based sequencing technologies have improved and are increasingly employed for the identification of parasites; however, their clinical adoption remains hindered by the complexity of bioinformatics analysis, reliance on incomplete reference databases, and accessibility barriers for nonspecialists. Overcoming these challenges necessitates the development of standardized analytical workflows and high-quality genomic resources specifically tailored for parasite identification.</p><p><strong>Methods: </strong>We developed a user-friendly web server named the Parasite Genome Identification Platform (PGIP). The reference database was sourced from the National Center for Biotechnology Information (NCBI), WormBase, European Nucleotide Archive (ENA), and VEuPathDB, rigorously filtered for quality, and deduplicated using Cluster Database at High Identity with Tolerance (CD-HIT) to ensure accuracy and nonredundancy. To streamline analysis, we integrated a standardized identification pipeline built on Nextflow, which encompasses host DNA depletion, quality control, parasite species identification via both reads mapping and assembly-based approaches, and automated report generation for comprehensive diagnostic insights.</p><p><strong>Results: </strong>PGIP integrates a curated database of 280 parasite genomes; which is rigorously filtered for quality and taxonomic accuracy. Validation across diverse datasets demonstrated the precise species-level resolution of PGIP, and its compatibility with clinical samples. The platform features an intuitive graphic interface; and one-click analysis significantly reduces reliance on bioinformatics expertise, thus enabling rapid diagnosis.</p><p><strong>Conclusions: </strong>PGIP offers an accurate, efficient, and a user-friendly web server designed to simplify and accelerate the taxonomic identification of parasite genomes using data from metagenomic next-generation sequencing. Its automated framework reduces the need for specialized expertise, enabling rapid application in clinical and public health settings.</p>\",\"PeriodicalId\":19793,\"journal\":{\"name\":\"Parasites & Vectors\",\"volume\":\"18 1\",\"pages\":\"365\"},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2025-08-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12392538/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Parasites & Vectors\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s13071-025-07007-3\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PARASITOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Parasites & Vectors","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s13071-025-07007-3","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PARASITOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景:寄生虫病仍然是一个全球性的健康挑战,传统的寄生虫病诊断方法在敏感性和可扩展性方面存在局限性。基于基因组的测序技术已经得到改进,并越来越多地用于寄生虫的鉴定;然而,它们的临床应用仍然受到生物信息学分析的复杂性、对不完整参考数据库的依赖以及非专业人员的可及性障碍的阻碍。克服这些挑战需要开发标准化的分析工作流程和专门为寄生虫鉴定量身定制的高质量基因组资源。方法:开发了用户友好的寄生虫基因组鉴定平台(PGIP)。参考数据库来自美国国家生物技术信息中心(NCBI)、WormBase、欧洲核苷酸档案(ENA)和VEuPathDB,经过严格的质量过滤,并使用高一致性集群数据库(CD-HIT)进行重复数据删除,以确保准确性和非冗余性。为了简化分析,我们集成了建立在Nextflow上的标准化鉴定管道,其中包括宿主DNA消耗,质量控制,通过读取映射和基于组装的方法进行寄生虫物种鉴定,以及用于全面诊断见解的自动报告生成。结果:PGIP整合了280个寄生虫基因组的数据库;这是严格过滤的质量和分类的准确性。不同数据集的验证证明了PGIP精确的物种水平分辨率,以及它与临床样本的兼容性。该平台具有直观的图形界面;一键式分析大大减少了对生物信息学专业知识的依赖,从而实现了快速诊断。结论:PGIP提供了一个准确、高效、用户友好的web服务器,旨在简化和加速利用新一代宏基因组测序数据对寄生虫基因组进行分类鉴定。其自动化框架减少了对专业知识的需求,使其能够在临床和公共卫生环境中快速应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

PGIP: a web server for the rapid taxonomic identification of parasite genomes.

PGIP: a web server for the rapid taxonomic identification of parasite genomes.

PGIP: a web server for the rapid taxonomic identification of parasite genomes.

PGIP: a web server for the rapid taxonomic identification of parasite genomes.

Background: Parasitic diseases remain a global health challenge, and traditional methods in their diagnosis face limitations in sensitivity and scalability. Genome-based sequencing technologies have improved and are increasingly employed for the identification of parasites; however, their clinical adoption remains hindered by the complexity of bioinformatics analysis, reliance on incomplete reference databases, and accessibility barriers for nonspecialists. Overcoming these challenges necessitates the development of standardized analytical workflows and high-quality genomic resources specifically tailored for parasite identification.

Methods: We developed a user-friendly web server named the Parasite Genome Identification Platform (PGIP). The reference database was sourced from the National Center for Biotechnology Information (NCBI), WormBase, European Nucleotide Archive (ENA), and VEuPathDB, rigorously filtered for quality, and deduplicated using Cluster Database at High Identity with Tolerance (CD-HIT) to ensure accuracy and nonredundancy. To streamline analysis, we integrated a standardized identification pipeline built on Nextflow, which encompasses host DNA depletion, quality control, parasite species identification via both reads mapping and assembly-based approaches, and automated report generation for comprehensive diagnostic insights.

Results: PGIP integrates a curated database of 280 parasite genomes; which is rigorously filtered for quality and taxonomic accuracy. Validation across diverse datasets demonstrated the precise species-level resolution of PGIP, and its compatibility with clinical samples. The platform features an intuitive graphic interface; and one-click analysis significantly reduces reliance on bioinformatics expertise, thus enabling rapid diagnosis.

Conclusions: PGIP offers an accurate, efficient, and a user-friendly web server designed to simplify and accelerate the taxonomic identification of parasite genomes using data from metagenomic next-generation sequencing. Its automated framework reduces the need for specialized expertise, enabling rapid application in clinical and public health settings.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Parasites & Vectors
Parasites & Vectors 医学-寄生虫学
CiteScore
6.30
自引率
9.40%
发文量
433
审稿时长
1.4 months
期刊介绍: Parasites & Vectors is an open access, peer-reviewed online journal dealing with the biology of parasites, parasitic diseases, intermediate hosts, vectors and vector-borne pathogens. Manuscripts published in this journal will be available to all worldwide, with no barriers to access, immediately following acceptance. However, authors retain the copyright of their material and may use it, or distribute it, as they wish. Manuscripts on all aspects of the basic and applied biology of parasites, intermediate hosts, vectors and vector-borne pathogens will be considered. In addition to the traditional and well-established areas of science in these fields, we also aim to provide a vehicle for publication of the rapidly developing resources and technology in parasite, intermediate host and vector genomics and their impacts on biological research. We are able to publish large datasets and extensive results, frequently associated with genomic and post-genomic technologies, which are not readily accommodated in traditional journals. Manuscripts addressing broader issues, for example economics, social sciences and global climate change in relation to parasites, vectors and disease control, are also welcomed.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信