GOREA: A Framework for Systemic and Unbiased Interpretation of Gene Ontology Enrichment.

IF 6.5 3区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY
Hojin Lee, Young-In Park, Ina Jeon, Dawon Kang, Harim Chun, Jungmin Choi
{"title":"GOREA: A Framework for Systemic and Unbiased Interpretation of Gene Ontology Enrichment.","authors":"Hojin Lee, Young-In Park, Ina Jeon, Dawon Kang, Harim Chun, Jungmin Choi","doi":"10.1016/j.mocell.2025.100283","DOIUrl":null,"url":null,"abstract":"<p><p>Functional enrichment analysis is essential for extracting biological meaning from gene expression data. Gene Set Enrichment Analysis (GSEA) and Over-Representation Analysis (ORA) are widely used approaches for this purpose. However, interpreting the large number of enriched Gene Ontology Biological Process (GOBP) terms remains challenging. Existing tools such as simplifyEnrichment often yield overly general and fragmented keywords, and they do not effectively utilize quantitative metrics such as Normalized Enrichment Scores (NES) or gene overlap proportions, thereby limiting biological interpretation and prioritization. To address these issues, we developed GOREA, an improved tool for summarizing GOBP terms. GOREA improves upon simplifyEnrichment by integrating binary cut and hierarchical clustering, incorporating GOBP term hierarchy to define representative terms, and ranking clusters based on NES or gene overlap proportions. Using ComplexHeatmap package, GOREA visualizes results as a heatmap accompanied by a panel of broad GOBP terms and representative terms for each cluster, providing both general and specific biological insights. Compared to simplifyEnrichment, GOREA yields more specific and interpretable clusters while significantly reducing computational time. GOREA effectively identified distinct biological processes in immune-related data and revealed substantial overlap between GOBP terms and cancer hallmark gene sets, demonstrating its applicability across diverse biological contexts. These findings suggest that GOREA provides a substantial improvement over existing approaches and offers a scalable and efficient framework for gene set enrichment analysis across diverse biological contexts.</p>","PeriodicalId":18795,"journal":{"name":"Molecules and Cells","volume":" ","pages":"100283"},"PeriodicalIF":6.5000,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecules and Cells","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1016/j.mocell.2025.100283","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Functional enrichment analysis is essential for extracting biological meaning from gene expression data. Gene Set Enrichment Analysis (GSEA) and Over-Representation Analysis (ORA) are widely used approaches for this purpose. However, interpreting the large number of enriched Gene Ontology Biological Process (GOBP) terms remains challenging. Existing tools such as simplifyEnrichment often yield overly general and fragmented keywords, and they do not effectively utilize quantitative metrics such as Normalized Enrichment Scores (NES) or gene overlap proportions, thereby limiting biological interpretation and prioritization. To address these issues, we developed GOREA, an improved tool for summarizing GOBP terms. GOREA improves upon simplifyEnrichment by integrating binary cut and hierarchical clustering, incorporating GOBP term hierarchy to define representative terms, and ranking clusters based on NES or gene overlap proportions. Using ComplexHeatmap package, GOREA visualizes results as a heatmap accompanied by a panel of broad GOBP terms and representative terms for each cluster, providing both general and specific biological insights. Compared to simplifyEnrichment, GOREA yields more specific and interpretable clusters while significantly reducing computational time. GOREA effectively identified distinct biological processes in immune-related data and revealed substantial overlap between GOBP terms and cancer hallmark gene sets, demonstrating its applicability across diverse biological contexts. These findings suggest that GOREA provides a substantial improvement over existing approaches and offers a scalable and efficient framework for gene set enrichment analysis across diverse biological contexts.

基因本体富集的系统和公正解释框架。
功能富集分析是从基因表达数据中提取生物学意义的关键。基因集富集分析(GSEA)和过度代表性分析(ORA)是广泛使用的方法。然而,解释大量丰富的基因本体生物过程(GOBP)术语仍然具有挑战性。现有的工具,如simplifyEnrichment,往往产生过于笼统和碎片化的关键词,它们不能有效地利用定量指标,如标准化浓缩分数(NES)或基因重叠比例,从而限制了生物学解释和优先级。为了解决这些问题,我们开发了GOREA,这是一个用于总结GOBP术语的改进工具。GOBP在简化浓缩的基础上进行了改进,整合了二元分割和层次聚类,结合GOBP术语层次来定义代表性术语,并基于NES或基因重叠比例对聚类进行排序。使用ComplexHeatmap包,GOREA将结果可视化为热图,并附有一组广泛的GOBP术语和每个集群的代表性术语,提供一般和特定的生物学见解。与simplifyEnrichment相比,GOREA产生更具体和可解释的聚类,同时显着减少了计算时间。GOREA有效地识别了免疫相关数据中不同的生物学过程,并揭示了GOBP术语和癌症标志基因集之间的大量重叠,证明了其在不同生物学背景下的适用性。这些发现表明,GOREA提供了对现有方法的实质性改进,并为跨不同生物背景的基因集富集分析提供了一个可扩展和有效的框架。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Molecules and Cells
Molecules and Cells 生物-生化与分子生物学
CiteScore
6.60
自引率
10.50%
发文量
83
审稿时长
2.3 months
期刊介绍: Molecules and Cells is an international on-line open-access journal devoted to the advancement and dissemination of fundamental knowledge in molecular and cellular biology. It was launched in 1990 and ISO abbreviation is "Mol. Cells". Reports on a broad range of topics of general interest to molecular and cell biologists are published. It is published on the last day of each month by the Korean Society for Molecular and Cellular Biology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信