An interpretable machine learning model for predicting metabolic dysfunction-associated steatotic liver disease in patients with type 2 diabetes.

IF 5.7 2区 医学 Q1 ENDOCRINOLOGY & METABOLISM
Zhuolin Zhou, Nan Gao, Jiaojiao Liu, Xuerong Ma, Zhijuan Ge, Cheng Ji
{"title":"An interpretable machine learning model for predicting metabolic dysfunction-associated steatotic liver disease in patients with type 2 diabetes.","authors":"Zhuolin Zhou, Nan Gao, Jiaojiao Liu, Xuerong Ma, Zhijuan Ge, Cheng Ji","doi":"10.1111/dom.70168","DOIUrl":null,"url":null,"abstract":"<p><strong>Aim: </strong>Patients with type 2 diabetes mellitus (T2DM) exhibit an elevated prevalence of metabolic dysfunction-associated steatotic liver disease (MASLD) and are at greater risk of liver-related adverse events. Existing non-invasive tools show limited diagnostic performance in this population. This study aims to develop a predictive model that accurately identifies the risk of MASLD among T2DM patients.</p><p><strong>Materials and methods: </strong>Clinical data were collected from T2DM patients hospitalised at Nanjing Drum Tower Hospital between January 2018 and May 2025. Eight machine learning methods were developed to predict the risk of MASLD in T2DM patients. The discriminatory ability of the models was evaluated using area under the receiver operating characteristic curve (AUROC), area under the precision-recall curve (AUPRC), accuracy, recall, negative predictive value, positive predictive value, and F1 score. Calibration curves and decision analysis curves were employed to evaluate the calibration and clinical utility. The models were interpreted using the Shapley additive explanations method, and unsupervised clustering was performed to identify potential high-risk subgroups.</p><p><strong>Results: </strong>A total of 3836 T2DM patients constituted the complete dataset, with a MASLD incidence rate of 55.9%. Thirteen feature variables were selected for model construction, and the XGB model achieved optimal overall performance, with an AUROC of 0.873 and an AUPRC of 0.904. Unsupervised clustering identified several high-risk subgroups with distinct metabolic characteristics.</p><p><strong>Conclusion: </strong>The model developed enables reliable and interpretable MASLD risk prediction in T2DM patients based on selected commonly available clinical data, providing a practical tool for routine identification and stratified management.</p>","PeriodicalId":158,"journal":{"name":"Diabetes, Obesity & Metabolism","volume":" ","pages":""},"PeriodicalIF":5.7000,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Diabetes, Obesity & Metabolism","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/dom.70168","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENDOCRINOLOGY & METABOLISM","Score":null,"Total":0}
引用次数: 0

Abstract

Aim: Patients with type 2 diabetes mellitus (T2DM) exhibit an elevated prevalence of metabolic dysfunction-associated steatotic liver disease (MASLD) and are at greater risk of liver-related adverse events. Existing non-invasive tools show limited diagnostic performance in this population. This study aims to develop a predictive model that accurately identifies the risk of MASLD among T2DM patients.

Materials and methods: Clinical data were collected from T2DM patients hospitalised at Nanjing Drum Tower Hospital between January 2018 and May 2025. Eight machine learning methods were developed to predict the risk of MASLD in T2DM patients. The discriminatory ability of the models was evaluated using area under the receiver operating characteristic curve (AUROC), area under the precision-recall curve (AUPRC), accuracy, recall, negative predictive value, positive predictive value, and F1 score. Calibration curves and decision analysis curves were employed to evaluate the calibration and clinical utility. The models were interpreted using the Shapley additive explanations method, and unsupervised clustering was performed to identify potential high-risk subgroups.

Results: A total of 3836 T2DM patients constituted the complete dataset, with a MASLD incidence rate of 55.9%. Thirteen feature variables were selected for model construction, and the XGB model achieved optimal overall performance, with an AUROC of 0.873 and an AUPRC of 0.904. Unsupervised clustering identified several high-risk subgroups with distinct metabolic characteristics.

Conclusion: The model developed enables reliable and interpretable MASLD risk prediction in T2DM patients based on selected commonly available clinical data, providing a practical tool for routine identification and stratified management.

预测2型糖尿病患者代谢功能障碍相关脂肪变性肝病的可解释机器学习模型
目的:2型糖尿病(T2DM)患者表现出代谢功能障碍相关脂肪变性肝病(MASLD)患病率升高,并且肝脏相关不良事件的风险更高。现有的非侵入性工具在这一人群中的诊断效果有限。本研究旨在建立一种准确识别T2DM患者MASLD风险的预测模型。材料与方法:收集2018年1月至2025年5月南京鼓楼医院住院的T2DM患者的临床资料。开发了8种机器学习方法来预测T2DM患者发生MASLD的风险。采用受试者工作特征曲线下面积(AUROC)、精确查全率曲线下面积(AUPRC)、准确率、查全率、负预测值、正预测值和F1评分评价模型的判别能力。采用校正曲线和决策分析曲线评价校正和临床应用。使用Shapley加性解释法对模型进行解释,并进行无监督聚类以识别潜在的高风险亚群。结果:完整数据集共3836例T2DM患者,MASLD发病率为55.9%。选择13个特征变量进行模型构建,XGB模型整体性能最优,AUROC为0.873,AUPRC为0.904。无监督聚类确定了几个具有不同代谢特征的高风险亚组。结论:建立的模型能够可靠且可解释地预测T2DM患者的MASLD风险,基于选定的常用临床数据,为常规识别和分层管理提供实用工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Diabetes, Obesity & Metabolism
Diabetes, Obesity & Metabolism 医学-内分泌学与代谢
CiteScore
10.90
自引率
6.90%
发文量
319
审稿时长
3-8 weeks
期刊介绍: Diabetes, Obesity and Metabolism is primarily a journal of clinical and experimental pharmacology and therapeutics covering the interrelated areas of diabetes, obesity and metabolism. The journal prioritises high-quality original research that reports on the effects of new or existing therapies, including dietary, exercise and lifestyle (non-pharmacological) interventions, in any aspect of metabolic and endocrine disease, either in humans or animal and cellular systems. ‘Metabolism’ may relate to lipids, bone and drug metabolism, or broader aspects of endocrine dysfunction. Preclinical pharmacology, pharmacokinetic studies, meta-analyses and those addressing drug safety and tolerability are also highly suitable for publication in this journal. Original research may be published as a main paper or as a research letter.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信