Elena Cristina Rusu , Helena Clavero-Mestres , Mario Sánchez-Álvarez , Marina Veciana-Molins , Laia Bertran , Pablo Monfort-Lanzas , Carmen Aguilar , Javier Camaron , Teresa Auguet
{"title":"Uncovering hepatic transcriptomic and circulating proteomic signatures in MASH: A meta-analysis and machine learning-based biomarker discovery","authors":"Elena Cristina Rusu , Helena Clavero-Mestres , Mario Sánchez-Álvarez , Marina Veciana-Molins , Laia Bertran , Pablo Monfort-Lanzas , Carmen Aguilar , Javier Camaron , Teresa Auguet","doi":"10.1016/j.compbiomed.2025.110170","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Metabolic-associated steatohepatitis (MASH), the progressive form of metabolic-associated steatotic liver disease (MASLD), poses significant risks for liver fibrosis and cardiovascular complications. Despite extensive research, reliable biomarkers for MASH diagnosis and progression remain elusive. This study aimed to identify hepatic transcriptomic and circulating proteomic signatures specific to MASH, and to develop a machine learning-based biomarker discovery model.</div></div><div><h3>Methods</h3><div>A systematic review of RNA-Seq and proteomic datasets was conducted, retrieving 7 hepatic transcriptomics and 3 circulating proteomics studies, encompassing 483 liver samples and 169 serum/plasma samples, respectively. Differential gene and protein expression analyses were performed, and pathways were enriched using gene set enrichment analysis. A machine learning (ML) model was developed to identify MASH-specific biomarkers, utilizing biologically significant protein ratios.</div></div><div><h3>Key findings</h3><div>Hepatic transcriptomic analysis identified 5017 differentially expressed genes (DEGs), with significant enrichment of extracellular matrix (ECM) pathways. Serum proteomics revealed six differentially expressed proteins (DEPs), including complement-related proteins. Integration of transcriptomic and proteomic data highlighted the complement cascade as a key pathway in MASH, with discordant regulation between the liver and circulation. The ML-based biomarker discovery model, utilizing protein ratios, achieved an F1 scores of 0.83 and 0.64 in the training sets and 0.67 in an external validation set.</div></div><div><h3>Conclusion</h3><div>Our findings indicate ECM deregulation and complement system involvement in MASH progression. The novel ML model incorporating protein ratios offers a potential tool for MASH diagnosis. However, further refinement and validation across larger and more diverse cohorts is needed to generalize these results.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"191 ","pages":"Article 110170"},"PeriodicalIF":7.0000,"publicationDate":"2025-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482525005219","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Metabolic-associated steatohepatitis (MASH), the progressive form of metabolic-associated steatotic liver disease (MASLD), poses significant risks for liver fibrosis and cardiovascular complications. Despite extensive research, reliable biomarkers for MASH diagnosis and progression remain elusive. This study aimed to identify hepatic transcriptomic and circulating proteomic signatures specific to MASH, and to develop a machine learning-based biomarker discovery model.
Methods
A systematic review of RNA-Seq and proteomic datasets was conducted, retrieving 7 hepatic transcriptomics and 3 circulating proteomics studies, encompassing 483 liver samples and 169 serum/plasma samples, respectively. Differential gene and protein expression analyses were performed, and pathways were enriched using gene set enrichment analysis. A machine learning (ML) model was developed to identify MASH-specific biomarkers, utilizing biologically significant protein ratios.
Key findings
Hepatic transcriptomic analysis identified 5017 differentially expressed genes (DEGs), with significant enrichment of extracellular matrix (ECM) pathways. Serum proteomics revealed six differentially expressed proteins (DEPs), including complement-related proteins. Integration of transcriptomic and proteomic data highlighted the complement cascade as a key pathway in MASH, with discordant regulation between the liver and circulation. The ML-based biomarker discovery model, utilizing protein ratios, achieved an F1 scores of 0.83 and 0.64 in the training sets and 0.67 in an external validation set.
Conclusion
Our findings indicate ECM deregulation and complement system involvement in MASH progression. The novel ML model incorporating protein ratios offers a potential tool for MASH diagnosis. However, further refinement and validation across larger and more diverse cohorts is needed to generalize these results.
期刊介绍:
Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.