Multiple machine learning algorithms identify 13 types of cell death-critical genes in large and multiple non-alcoholic steatohepatitis cohorts.

IF 3.9 2区 医学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY
Renao Jiang, Longfei Dai, Xinjian Xu, Zhen Zhang
{"title":"Multiple machine learning algorithms identify 13 types of cell death-critical genes in large and multiple non-alcoholic steatohepatitis cohorts.","authors":"Renao Jiang, Longfei Dai, Xinjian Xu, Zhen Zhang","doi":"10.1186/s12944-025-02588-5","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Dysregulated programmed cell death pathways mechanistically contribute to hepatic inflammation and fibrogenesis in non-alcoholic steatohepatitis (NASH). Identification of cell death genes may offer insights into diagnostic and therapeutic strategies for NASH.</p><p><strong>Methods: </strong>Data from multiple NASH cohorts were integrated, and 12 machine learning algorithms were applied to identify key dysregulated cell death-related genes and develop a binary classification model for NASH. Spearman's rank correlation coefficients quantified associations between these genes and clinical markers, immune infiltration profiles, and signature genes encoding pro-inflammatory mediators, metabolic regulators, and fibrotic drivers. Gene set enrichment analysis (GSEA) was performed to delineate the mechanistic underpinnings of these key genes. Consensus clustering analysis was then used to stratify patients with NASH into distinct phenotypic subgroups based on expression levels of these genes.</p><p><strong>Results: </strong>A NASH prediction model, developed using the random forest (RF) algorithm, demonstrated high diagnostic accuracy across multiple cohorts. Four key genes, enriched in lipid metabolism and inflammation pathways, were identified. Their transcriptional levels were significantly correlated with the non-alcoholic fatty liver disease activity score (NAS), hepatic inflammatory infiltration, molecular signatures of metabolic dysregulation (lipid homeostasis regulators), and fibrosis progression. These genes also enabled accurate classification of patients with NASH into clusters reflecting varying disease severity.</p><p><strong>Conclusions: </strong>A binary classification model, developed using the RF algorithm, accurately identified patients with NASH. The four cell death genes, identified through 12 machine learning algorithms, represent potential biomarkers and therapeutic targets for NASH. These genes contribute to inflammation-related immune cell activation, lipid metabolism dysregulation, and liver fibrosis, highlighting the complex interplay between cell death and NASH progression.</p>","PeriodicalId":18073,"journal":{"name":"Lipids in Health and Disease","volume":"24 1","pages":"169"},"PeriodicalIF":3.9000,"publicationDate":"2025-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12060327/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Lipids in Health and Disease","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12944-025-02588-5","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Dysregulated programmed cell death pathways mechanistically contribute to hepatic inflammation and fibrogenesis in non-alcoholic steatohepatitis (NASH). Identification of cell death genes may offer insights into diagnostic and therapeutic strategies for NASH.

Methods: Data from multiple NASH cohorts were integrated, and 12 machine learning algorithms were applied to identify key dysregulated cell death-related genes and develop a binary classification model for NASH. Spearman's rank correlation coefficients quantified associations between these genes and clinical markers, immune infiltration profiles, and signature genes encoding pro-inflammatory mediators, metabolic regulators, and fibrotic drivers. Gene set enrichment analysis (GSEA) was performed to delineate the mechanistic underpinnings of these key genes. Consensus clustering analysis was then used to stratify patients with NASH into distinct phenotypic subgroups based on expression levels of these genes.

Results: A NASH prediction model, developed using the random forest (RF) algorithm, demonstrated high diagnostic accuracy across multiple cohorts. Four key genes, enriched in lipid metabolism and inflammation pathways, were identified. Their transcriptional levels were significantly correlated with the non-alcoholic fatty liver disease activity score (NAS), hepatic inflammatory infiltration, molecular signatures of metabolic dysregulation (lipid homeostasis regulators), and fibrosis progression. These genes also enabled accurate classification of patients with NASH into clusters reflecting varying disease severity.

Conclusions: A binary classification model, developed using the RF algorithm, accurately identified patients with NASH. The four cell death genes, identified through 12 machine learning algorithms, represent potential biomarkers and therapeutic targets for NASH. These genes contribute to inflammation-related immune cell activation, lipid metabolism dysregulation, and liver fibrosis, highlighting the complex interplay between cell death and NASH progression.

多种机器学习算法在大型和多个非酒精性脂肪性肝炎队列中识别出13种细胞死亡关键基因。
背景:失调的程序性细胞死亡途径在机制上促进非酒精性脂肪性肝炎(NASH)的肝脏炎症和纤维化。细胞死亡基因的鉴定可能为NASH的诊断和治疗策略提供见解。方法:整合来自多个NASH队列的数据,应用12种机器学习算法识别关键的失调细胞死亡相关基因,并建立NASH的二元分类模型。Spearman等级相关系数量化了这些基因与临床标志物、免疫浸润谱、编码促炎介质、代谢调节因子和纤维化驱动因子的特征基因之间的关联。基因集富集分析(GSEA)被用来描述这些关键基因的机制基础。然后采用共识聚类分析,根据这些基因的表达水平将NASH患者分为不同的表型亚组。结果:使用随机森林(RF)算法开发的NASH预测模型在多个队列中显示出较高的诊断准确性。发现了四个关键基因,富集于脂质代谢和炎症途径。它们的转录水平与非酒精性脂肪肝疾病活动性评分(NAS)、肝脏炎症浸润、代谢失调的分子特征(脂质稳态调节因子)和纤维化进展显著相关。这些基因还能够准确地将NASH患者分类为反映不同疾病严重程度的群集。结论:使用RF算法建立的二元分类模型可以准确地识别NASH患者。通过12种机器学习算法鉴定的四种细胞死亡基因代表了NASH的潜在生物标志物和治疗靶点。这些基因有助于炎症相关的免疫细胞激活、脂质代谢失调和肝纤维化,突出了细胞死亡和NASH进展之间复杂的相互作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Lipids in Health and Disease
Lipids in Health and Disease 生物-生化与分子生物学
CiteScore
7.70
自引率
2.20%
发文量
122
审稿时长
3-8 weeks
期刊介绍: Lipids in Health and Disease is an open access, peer-reviewed, journal that publishes articles on all aspects of lipids: their biochemistry, pharmacology, toxicology, role in health and disease, and the synthesis of new lipid compounds. Lipids in Health and Disease is aimed at all scientists, health professionals and physicians interested in the area of lipids. Lipids are defined here in their broadest sense, to include: cholesterol, essential fatty acids, saturated fatty acids, phospholipids, inositol lipids, second messenger lipids, enzymes and synthetic machinery that is involved in the metabolism of various lipids in the cells and tissues, and also various aspects of lipid transport, etc. In addition, the journal also publishes research that investigates and defines the role of lipids in various physiological processes, pathology and disease. In particular, the journal aims to bridge the gap between the bench and the clinic by publishing articles that are particularly relevant to human diseases and the role of lipids in the management of various diseases.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信