William G. Ryan, Ali S. Imami, Hunter Eby, Robert Mccullumsmith, Rammohan Shukla
{"title":"PAVER: Pathway Analysis Visualization with Embedding Representations","authors":"William G. Ryan, Ali S. Imami, Hunter Eby, Robert Mccullumsmith, Rammohan Shukla","doi":"10.46570/utjms.vol11-2023-906","DOIUrl":null,"url":null,"abstract":"Interpreting pathway analysis often poses a significant challenge due to the extensive lists of gene ontology (GO) terms that require meticulous manual curation to identify underlying themes. We developed PAVER, a novel R software package, to address this issue by automating theme generation and clustering of GO terms. By utilizing embedding representations and advanced machine learning techniques, PAVER discerns patterns within the GO terms, creating an intuitive visual landscape of clusters for ease of functional interpretation. This method significantly minimizes the time and effort traditionally required for manual curation. We applied PAVER to a previously published dataset, where it demonstrated robustness by generating themes that closely mirrored those produced by manual curation. With PAVER, we present a powerful tool that not only enhances the efficiency of pathway analysis but also broadens its accessibility across various fields, including disease pathway modeling, drug target identification, and comparative genomics. Our work with PAVER marks a significant step towards simplifying the pathway analysis interpretation process in bioinformatics research.","PeriodicalId":220681,"journal":{"name":"Translation: The University of Toledo Journal of Medical Sciences","volume":"39 8","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Translation: The University of Toledo Journal of Medical Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.46570/utjms.vol11-2023-906","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Interpreting pathway analysis often poses a significant challenge due to the extensive lists of gene ontology (GO) terms that require meticulous manual curation to identify underlying themes. We developed PAVER, a novel R software package, to address this issue by automating theme generation and clustering of GO terms. By utilizing embedding representations and advanced machine learning techniques, PAVER discerns patterns within the GO terms, creating an intuitive visual landscape of clusters for ease of functional interpretation. This method significantly minimizes the time and effort traditionally required for manual curation. We applied PAVER to a previously published dataset, where it demonstrated robustness by generating themes that closely mirrored those produced by manual curation. With PAVER, we present a powerful tool that not only enhances the efficiency of pathway analysis but also broadens its accessibility across various fields, including disease pathway modeling, drug target identification, and comparative genomics. Our work with PAVER marks a significant step towards simplifying the pathway analysis interpretation process in bioinformatics research.
由于基因本体(GO)术语列表庞大,需要细致的人工整理才能确定潜在的主题,因此解读通路分析往往是一项巨大的挑战。我们开发了一款新颖的 R 软件包 PAVER,通过自动生成主题和聚类 GO 术语来解决这一问题。通过利用嵌入表示法和先进的机器学习技术,PAVER 可以识别 GO 术语中的模式,创建直观的聚类视觉景观,便于功能解释。这种方法大大减少了传统手工整理所需的时间和精力。我们将 PAVER 应用于以前发表的一个数据集,它生成的主题与人工整理的主题非常相似,从而证明了它的鲁棒性。有了 PAVER,我们展示了一个强大的工具,它不仅提高了通路分析的效率,还拓宽了通路分析在疾病通路建模、药物靶点鉴定和比较基因组学等各个领域的应用。我们的 PAVER 工作标志着我们在简化生物信息学研究中的通路分析解释过程方面迈出了重要一步。