E.PathDash,对公开病原体基因表达数据进行通路激活分析。

IF 5 2区 生物学 Q1 MICROBIOLOGY
mSystems Pub Date : 2024-11-19 Epub Date: 2024-10-18 DOI:10.1128/msystems.01030-24
Lily Taub, Thomas H Hampton, Sharanya Sarkar, Georgia Doing, Samuel L Neff, Carson E Finger, Kiyoshi Ferreira Fukutani, Bruce A Stanton
{"title":"E.PathDash,对公开病原体基因表达数据进行通路激活分析。","authors":"Lily Taub, Thomas H Hampton, Sharanya Sarkar, Georgia Doing, Samuel L Neff, Carson E Finger, Kiyoshi Ferreira Fukutani, Bruce A Stanton","doi":"10.1128/msystems.01030-24","DOIUrl":null,"url":null,"abstract":"<p><p>E.PathDash facilitates re-analysis of gene expression data from pathogens clinically relevant to chronic respiratory diseases, including a total of 48 studies, 548 samples, and 404 unique treatment comparisons. The application enables users to assess broad biological stress responses at the KEGG pathway or gene ontology level and also provides data for individual genes. E.PathDash reduces the time required to gain access to data from multiple hours per data set to seconds. Users can download high-quality images such as volcano plots and boxplots, differential gene expression results, and raw count data, making it fully interoperable with other tools. Importantly, users can rapidly toggle between experimental comparisons and different studies of the same phenomenon, enabling them to judge the extent to which observed responses are reproducible. As a proof of principle, we invited two cystic fibrosis scientists to use the application to explore scientific questions relevant to their specific research areas. Reassuringly, pathway activation analysis recapitulated results reported in original publications, but it also yielded new insights into pathogen responses to changes in their environments, validating the utility of the application. All software and data are freely accessible, and the application is available at scangeo.dartmouth.edu/EPathDash.</p><p><strong>Importance: </strong>Chronic respiratory illnesses impose a high disease burden on our communities and people with respiratory diseases are susceptible to robust bacterial infections from pathogens, including <i>Pseudomonas aeruginosa</i> and <i>Staphylococcus aureus</i>, that contribute to morbidity and mortality. Public gene expression datasets generated from these and other pathogens are abundantly available and an important resource for synthesizing existing pathogenic research, leading to interventions that improve patient outcomes. However, it can take many hours or weeks to render publicly available datasets usable; significant time and skills are needed to clean, standardize, and apply reproducible and robust bioinformatic pipelines to the data. Through collaboration with two microbiologists, we have shown that E.PathDash addresses this problem, enabling them to elucidate pathogen responses to a variety of over 400 experimental conditions and generate mechanistic hypotheses for cell-level behavior in response to disease-relevant exposures, all in a fraction of the time.</p>","PeriodicalId":18819,"journal":{"name":"mSystems","volume":" ","pages":"e0103024"},"PeriodicalIF":5.0000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11575265/pdf/","citationCount":"0","resultStr":"{\"title\":\"E.PathDash, pathway activation analysis of publicly available pathogen gene expression data.\",\"authors\":\"Lily Taub, Thomas H Hampton, Sharanya Sarkar, Georgia Doing, Samuel L Neff, Carson E Finger, Kiyoshi Ferreira Fukutani, Bruce A Stanton\",\"doi\":\"10.1128/msystems.01030-24\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>E.PathDash facilitates re-analysis of gene expression data from pathogens clinically relevant to chronic respiratory diseases, including a total of 48 studies, 548 samples, and 404 unique treatment comparisons. The application enables users to assess broad biological stress responses at the KEGG pathway or gene ontology level and also provides data for individual genes. E.PathDash reduces the time required to gain access to data from multiple hours per data set to seconds. Users can download high-quality images such as volcano plots and boxplots, differential gene expression results, and raw count data, making it fully interoperable with other tools. Importantly, users can rapidly toggle between experimental comparisons and different studies of the same phenomenon, enabling them to judge the extent to which observed responses are reproducible. As a proof of principle, we invited two cystic fibrosis scientists to use the application to explore scientific questions relevant to their specific research areas. Reassuringly, pathway activation analysis recapitulated results reported in original publications, but it also yielded new insights into pathogen responses to changes in their environments, validating the utility of the application. All software and data are freely accessible, and the application is available at scangeo.dartmouth.edu/EPathDash.</p><p><strong>Importance: </strong>Chronic respiratory illnesses impose a high disease burden on our communities and people with respiratory diseases are susceptible to robust bacterial infections from pathogens, including <i>Pseudomonas aeruginosa</i> and <i>Staphylococcus aureus</i>, that contribute to morbidity and mortality. Public gene expression datasets generated from these and other pathogens are abundantly available and an important resource for synthesizing existing pathogenic research, leading to interventions that improve patient outcomes. However, it can take many hours or weeks to render publicly available datasets usable; significant time and skills are needed to clean, standardize, and apply reproducible and robust bioinformatic pipelines to the data. Through collaboration with two microbiologists, we have shown that E.PathDash addresses this problem, enabling them to elucidate pathogen responses to a variety of over 400 experimental conditions and generate mechanistic hypotheses for cell-level behavior in response to disease-relevant exposures, all in a fraction of the time.</p>\",\"PeriodicalId\":18819,\"journal\":{\"name\":\"mSystems\",\"volume\":\" \",\"pages\":\"e0103024\"},\"PeriodicalIF\":5.0000,\"publicationDate\":\"2024-11-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11575265/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"mSystems\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1128/msystems.01030-24\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/10/18 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"MICROBIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"mSystems","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1128/msystems.01030-24","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/10/18 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

E.PathDash 可帮助重新分析与慢性呼吸道疾病临床相关的病原体的基因表达数据,包括总共 48 项研究、548 个样本和 404 个独特的治疗比较。该应用程序使用户能够在 KEGG 通路或基因本体水平上评估广泛的生物应激反应,还能提供单个基因的数据。E.PathDash 可将访问数据所需的时间从每个数据集数小时缩短到数秒。用户可以下载火山图和箱形图等高质量图像、差异基因表达结果和原始计数数据,使其与其他工具完全互操作。重要的是,用户可以在实验比较和同一现象的不同研究之间快速切换,从而判断观察到的反应在多大程度上具有可重复性。作为原理验证,我们邀请了两位囊性纤维化科学家使用该应用程序来探索与其特定研究领域相关的科学问题。令人欣慰的是,通路激活分析再现了原始出版物中报告的结果,而且还对病原体对其环境变化的反应提出了新的见解,从而验证了该应用程序的实用性。所有软件和数据均可免费获取,应用程序可在 scangeo.dartmouth.edu/EPathDash.Importance 上查阅:慢性呼吸道疾病给我们的社区带来了沉重的疾病负担,呼吸道疾病患者很容易受到包括铜绿假单胞菌和金黄色葡萄球菌在内的病原体的细菌感染,从而导致发病和死亡。从这些病原体和其他病原体生成的公共基因表达数据集非常丰富,是综合现有病原体研究的重要资源,可用于改善患者预后的干预措施。然而,要使公开的数据集可用,可能需要数小时或数周的时间;需要大量的时间和技能对数据进行清理、标准化,并应用可重复的、强大的生物信息学管道。通过与两位微生物学家的合作,我们已经证明 E.PathDash 能够解决这个问题,使他们能够在极短的时间内阐明病原体对 400 多种实验条件的反应,并生成细胞级行为对疾病相关暴露的机理假设。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
E.PathDash, pathway activation analysis of publicly available pathogen gene expression data.

E.PathDash facilitates re-analysis of gene expression data from pathogens clinically relevant to chronic respiratory diseases, including a total of 48 studies, 548 samples, and 404 unique treatment comparisons. The application enables users to assess broad biological stress responses at the KEGG pathway or gene ontology level and also provides data for individual genes. E.PathDash reduces the time required to gain access to data from multiple hours per data set to seconds. Users can download high-quality images such as volcano plots and boxplots, differential gene expression results, and raw count data, making it fully interoperable with other tools. Importantly, users can rapidly toggle between experimental comparisons and different studies of the same phenomenon, enabling them to judge the extent to which observed responses are reproducible. As a proof of principle, we invited two cystic fibrosis scientists to use the application to explore scientific questions relevant to their specific research areas. Reassuringly, pathway activation analysis recapitulated results reported in original publications, but it also yielded new insights into pathogen responses to changes in their environments, validating the utility of the application. All software and data are freely accessible, and the application is available at scangeo.dartmouth.edu/EPathDash.

Importance: Chronic respiratory illnesses impose a high disease burden on our communities and people with respiratory diseases are susceptible to robust bacterial infections from pathogens, including Pseudomonas aeruginosa and Staphylococcus aureus, that contribute to morbidity and mortality. Public gene expression datasets generated from these and other pathogens are abundantly available and an important resource for synthesizing existing pathogenic research, leading to interventions that improve patient outcomes. However, it can take many hours or weeks to render publicly available datasets usable; significant time and skills are needed to clean, standardize, and apply reproducible and robust bioinformatic pipelines to the data. Through collaboration with two microbiologists, we have shown that E.PathDash addresses this problem, enabling them to elucidate pathogen responses to a variety of over 400 experimental conditions and generate mechanistic hypotheses for cell-level behavior in response to disease-relevant exposures, all in a fraction of the time.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
mSystems
mSystems Biochemistry, Genetics and Molecular Biology-Biochemistry
CiteScore
10.50
自引率
3.10%
发文量
308
审稿时长
13 weeks
期刊介绍: mSystems™ will publish preeminent work that stems from applying technologies for high-throughput analyses to achieve insights into the metabolic and regulatory systems at the scale of both the single cell and microbial communities. The scope of mSystems™ encompasses all important biological and biochemical findings drawn from analyses of large data sets, as well as new computational approaches for deriving these insights. mSystems™ will welcome submissions from researchers who focus on the microbiome, genomics, metagenomics, transcriptomics, metabolomics, proteomics, glycomics, bioinformatics, and computational microbiology. mSystems™ will provide streamlined decisions, while carrying on ASM''s tradition of rigorous peer review.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信