多组学分析和基于机器学习的肝脏疾病诊断分类分子特征识别沿着微生物-肠道-肝脏轴

IF 3.3 Q2 GASTROENTEROLOGY & HEPATOLOGY
Betul Comertpay, Esra Gov
{"title":"多组学分析和基于机器学习的肝脏疾病诊断分类分子特征识别沿着微生物-肠道-肝脏轴","authors":"Betul Comertpay,&nbsp;Esra Gov","doi":"10.1016/j.jceh.2025.102552","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Liver disease, responsible for around two million deaths annually, remains a pressing global health challenge. Microbial interactions within the microbiota–gut–liver axis play a substantial role in the pathogenesis of various liver conditions, including early chronic liver disease (eCLD), chronic liver disease (CLD), acute liver failure (ALF), acute-on-chronic liver failure (ACLF), non-alcoholic fatty liver disease (NAFLD), steatohepatitis, and cirrhosis. This study aimed to identify key molecular signatures involved in liver disease progression by analyzing transcriptomic and gut microbiome data, and to evaluate their diagnostic utility using machine learning models.</div></div><div><h3>Methods</h3><div>Transcriptomic analysis identified differentially expressed genes (DEGs) that, when integrated with regulatory elements microRNAs, transcription factors, receptors, and the gut microbiome highlight disease-specific molecular interactions. To assess the diagnostic potential of these molecular signatures, a two-step analysis involving principal component analysis (PCA) and Random Forest classification was conducted, achieving accuracies of 75% for ALF and 89% for NAFLD. Additionally, machine learning algorithms, including K-neighbors, multi-layer perceptron (MLP), decision tree, Random Forest, logistic regression, gradient boosting, CatBoost, Extreme Gradient Boosting (XGB), and Light Gradient Boosting Machine (LGBM), were applied to gene expression data for ALF and NAFLD.</div></div><div><h3>Results</h3><div>Key genes including CLDN14, EGFR, GSK3B, MYC, and TJP2, alongside regulatory miRNAs let-7a-5p, miR-124-3p, and miR-195-5p and transcription factors NFKB1 and SP1 may be suggested as critical to liver disease progression. Additionally, gut microbiota members, Dictyostelium discoideum and Eikenella might be novel candidates associated with liver disease, highlighting the importance of the gut-liver axis. The Random Forest model reached 75% accuracy and 83% area under the curve for ALF, while NAFLD classification achieved 100% accuracy, precision, recall, and area under the curve underscoring robust diagnostic potential.</div></div><div><h3>Conclusion</h3><div>This study establishes a solid foundation for further research and therapeutic advancement by identifying key biomolecules and pathways critical to liver disease. Additional experimental validation is needed to confirm clinical applicability.</div></div>","PeriodicalId":15479,"journal":{"name":"Journal of Clinical and Experimental Hepatology","volume":"15 5","pages":"Article 102552"},"PeriodicalIF":3.3000,"publicationDate":"2025-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multiomics Analysis and Machine Learning-based Identification of Molecular Signatures for Diagnostic Classification in Liver Disease Types Along the Microbiota-gut-liver Axis\",\"authors\":\"Betul Comertpay,&nbsp;Esra Gov\",\"doi\":\"10.1016/j.jceh.2025.102552\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><div>Liver disease, responsible for around two million deaths annually, remains a pressing global health challenge. Microbial interactions within the microbiota–gut–liver axis play a substantial role in the pathogenesis of various liver conditions, including early chronic liver disease (eCLD), chronic liver disease (CLD), acute liver failure (ALF), acute-on-chronic liver failure (ACLF), non-alcoholic fatty liver disease (NAFLD), steatohepatitis, and cirrhosis. This study aimed to identify key molecular signatures involved in liver disease progression by analyzing transcriptomic and gut microbiome data, and to evaluate their diagnostic utility using machine learning models.</div></div><div><h3>Methods</h3><div>Transcriptomic analysis identified differentially expressed genes (DEGs) that, when integrated with regulatory elements microRNAs, transcription factors, receptors, and the gut microbiome highlight disease-specific molecular interactions. To assess the diagnostic potential of these molecular signatures, a two-step analysis involving principal component analysis (PCA) and Random Forest classification was conducted, achieving accuracies of 75% for ALF and 89% for NAFLD. Additionally, machine learning algorithms, including K-neighbors, multi-layer perceptron (MLP), decision tree, Random Forest, logistic regression, gradient boosting, CatBoost, Extreme Gradient Boosting (XGB), and Light Gradient Boosting Machine (LGBM), were applied to gene expression data for ALF and NAFLD.</div></div><div><h3>Results</h3><div>Key genes including CLDN14, EGFR, GSK3B, MYC, and TJP2, alongside regulatory miRNAs let-7a-5p, miR-124-3p, and miR-195-5p and transcription factors NFKB1 and SP1 may be suggested as critical to liver disease progression. Additionally, gut microbiota members, Dictyostelium discoideum and Eikenella might be novel candidates associated with liver disease, highlighting the importance of the gut-liver axis. The Random Forest model reached 75% accuracy and 83% area under the curve for ALF, while NAFLD classification achieved 100% accuracy, precision, recall, and area under the curve underscoring robust diagnostic potential.</div></div><div><h3>Conclusion</h3><div>This study establishes a solid foundation for further research and therapeutic advancement by identifying key biomolecules and pathways critical to liver disease. Additional experimental validation is needed to confirm clinical applicability.</div></div>\",\"PeriodicalId\":15479,\"journal\":{\"name\":\"Journal of Clinical and Experimental Hepatology\",\"volume\":\"15 5\",\"pages\":\"Article 102552\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-03-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Clinical and Experimental Hepatology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0973688325000520\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"GASTROENTEROLOGY & HEPATOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Clinical and Experimental Hepatology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0973688325000520","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

肝病每年造成约200万人死亡,是一项紧迫的全球卫生挑战。微生物-肠-肝轴内的微生物相互作用在各种肝脏疾病的发病机制中起着重要作用,包括早期慢性肝病(eCLD)、慢性肝病(CLD)、急性肝衰竭(ALF)、急性对慢性肝衰竭(ACLF)、非酒精性脂肪性肝病(NAFLD)、脂肪性肝炎和肝硬化。本研究旨在通过分析转录组学和肠道微生物组数据来识别参与肝脏疾病进展的关键分子特征,并使用机器学习模型评估其诊断效用。方法转录组学分析发现,当与调控元件microrna、转录因子、受体和肠道微生物组结合时,差异表达基因(DEGs)会突出疾病特异性的分子相互作用。为了评估这些分子特征的诊断潜力,进行了包括主成分分析(PCA)和随机森林分类的两步分析,ALF的准确率为75%,NAFLD的准确率为89%。此外,将k近邻、多层感知器(MLP)、决策树、随机森林、逻辑回归、梯度增强、CatBoost、极端梯度增强(XGB)和光梯度增强机(LGBM)等机器学习算法应用于ALF和NAFLD的基因表达数据。结果关键基因包括CLDN14、EGFR、GSK3B、MYC和TJP2,以及调节mirna let-7a-5p、miR-124-3p和miR-195-5p以及转录因子NFKB1和SP1可能被认为是肝病进展的关键基因。此外,肠道微生物群成员,盘状盘基钢菌和艾肯菌可能是与肝脏疾病相关的新候选者,突出了肠-肝轴的重要性。随机森林模型对ALF的准确率达到75%,曲线下面积达到83%,而NAFLD分类的准确率、精密度、召回率和曲线下面积达到100%,强调了稳健的诊断潜力。结论本研究确定了肝脏疾病的关键生物分子和通路,为进一步研究和治疗奠定了坚实的基础。需要进一步的实验验证来确认临床适用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Multiomics Analysis and Machine Learning-based Identification of Molecular Signatures for Diagnostic Classification in Liver Disease Types Along the Microbiota-gut-liver Axis

Multiomics Analysis and Machine Learning-based Identification of Molecular Signatures for Diagnostic Classification in Liver Disease Types Along the Microbiota-gut-liver Axis

Background

Liver disease, responsible for around two million deaths annually, remains a pressing global health challenge. Microbial interactions within the microbiota–gut–liver axis play a substantial role in the pathogenesis of various liver conditions, including early chronic liver disease (eCLD), chronic liver disease (CLD), acute liver failure (ALF), acute-on-chronic liver failure (ACLF), non-alcoholic fatty liver disease (NAFLD), steatohepatitis, and cirrhosis. This study aimed to identify key molecular signatures involved in liver disease progression by analyzing transcriptomic and gut microbiome data, and to evaluate their diagnostic utility using machine learning models.

Methods

Transcriptomic analysis identified differentially expressed genes (DEGs) that, when integrated with regulatory elements microRNAs, transcription factors, receptors, and the gut microbiome highlight disease-specific molecular interactions. To assess the diagnostic potential of these molecular signatures, a two-step analysis involving principal component analysis (PCA) and Random Forest classification was conducted, achieving accuracies of 75% for ALF and 89% for NAFLD. Additionally, machine learning algorithms, including K-neighbors, multi-layer perceptron (MLP), decision tree, Random Forest, logistic regression, gradient boosting, CatBoost, Extreme Gradient Boosting (XGB), and Light Gradient Boosting Machine (LGBM), were applied to gene expression data for ALF and NAFLD.

Results

Key genes including CLDN14, EGFR, GSK3B, MYC, and TJP2, alongside regulatory miRNAs let-7a-5p, miR-124-3p, and miR-195-5p and transcription factors NFKB1 and SP1 may be suggested as critical to liver disease progression. Additionally, gut microbiota members, Dictyostelium discoideum and Eikenella might be novel candidates associated with liver disease, highlighting the importance of the gut-liver axis. The Random Forest model reached 75% accuracy and 83% area under the curve for ALF, while NAFLD classification achieved 100% accuracy, precision, recall, and area under the curve underscoring robust diagnostic potential.

Conclusion

This study establishes a solid foundation for further research and therapeutic advancement by identifying key biomolecules and pathways critical to liver disease. Additional experimental validation is needed to confirm clinical applicability.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Clinical and Experimental Hepatology
Journal of Clinical and Experimental Hepatology GASTROENTEROLOGY & HEPATOLOGY-
CiteScore
4.90
自引率
16.70%
发文量
537
审稿时长
64 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信