Hojin Lee, Young-In Park, Ina Jeon, Dawon Kang, Harim Chun, Jungmin Choi
{"title":"GOREA: A Framework for Systemic and Unbiased Interpretation of Gene Ontology Enrichment.","authors":"Hojin Lee, Young-In Park, Ina Jeon, Dawon Kang, Harim Chun, Jungmin Choi","doi":"10.1016/j.mocell.2025.100283","DOIUrl":null,"url":null,"abstract":"<p><p>Functional enrichment analysis is essential for extracting biological meaning from gene expression data. Gene Set Enrichment Analysis (GSEA) and Over-Representation Analysis (ORA) are widely used approaches for this purpose. However, interpreting the large number of enriched Gene Ontology Biological Process (GOBP) terms remains challenging. Existing tools such as simplifyEnrichment often yield overly general and fragmented keywords, and they do not effectively utilize quantitative metrics such as Normalized Enrichment Scores (NES) or gene overlap proportions, thereby limiting biological interpretation and prioritization. To address these issues, we developed GOREA, an improved tool for summarizing GOBP terms. GOREA improves upon simplifyEnrichment by integrating binary cut and hierarchical clustering, incorporating GOBP term hierarchy to define representative terms, and ranking clusters based on NES or gene overlap proportions. Using ComplexHeatmap package, GOREA visualizes results as a heatmap accompanied by a panel of broad GOBP terms and representative terms for each cluster, providing both general and specific biological insights. Compared to simplifyEnrichment, GOREA yields more specific and interpretable clusters while significantly reducing computational time. GOREA effectively identified distinct biological processes in immune-related data and revealed substantial overlap between GOBP terms and cancer hallmark gene sets, demonstrating its applicability across diverse biological contexts. These findings suggest that GOREA provides a substantial improvement over existing approaches and offers a scalable and efficient framework for gene set enrichment analysis across diverse biological contexts.</p>","PeriodicalId":18795,"journal":{"name":"Molecules and Cells","volume":" ","pages":"100283"},"PeriodicalIF":6.5000,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecules and Cells","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1016/j.mocell.2025.100283","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Functional enrichment analysis is essential for extracting biological meaning from gene expression data. Gene Set Enrichment Analysis (GSEA) and Over-Representation Analysis (ORA) are widely used approaches for this purpose. However, interpreting the large number of enriched Gene Ontology Biological Process (GOBP) terms remains challenging. Existing tools such as simplifyEnrichment often yield overly general and fragmented keywords, and they do not effectively utilize quantitative metrics such as Normalized Enrichment Scores (NES) or gene overlap proportions, thereby limiting biological interpretation and prioritization. To address these issues, we developed GOREA, an improved tool for summarizing GOBP terms. GOREA improves upon simplifyEnrichment by integrating binary cut and hierarchical clustering, incorporating GOBP term hierarchy to define representative terms, and ranking clusters based on NES or gene overlap proportions. Using ComplexHeatmap package, GOREA visualizes results as a heatmap accompanied by a panel of broad GOBP terms and representative terms for each cluster, providing both general and specific biological insights. Compared to simplifyEnrichment, GOREA yields more specific and interpretable clusters while significantly reducing computational time. GOREA effectively identified distinct biological processes in immune-related data and revealed substantial overlap between GOBP terms and cancer hallmark gene sets, demonstrating its applicability across diverse biological contexts. These findings suggest that GOREA provides a substantial improvement over existing approaches and offers a scalable and efficient framework for gene set enrichment analysis across diverse biological contexts.
期刊介绍:
Molecules and Cells is an international on-line open-access journal devoted to the advancement and dissemination of fundamental knowledge in molecular and cellular biology. It was launched in 1990 and ISO abbreviation is "Mol. Cells". Reports on a broad range of topics of general interest to molecular and cell biologists are published. It is published on the last day of each month by the Korean Society for Molecular and Cellular Biology.