Zhuolin Zhou, Nan Gao, Jiaojiao Liu, Xuerong Ma, Zhijuan Ge, Cheng Ji
{"title":"预测2型糖尿病患者代谢功能障碍相关脂肪变性肝病的可解释机器学习模型","authors":"Zhuolin Zhou, Nan Gao, Jiaojiao Liu, Xuerong Ma, Zhijuan Ge, Cheng Ji","doi":"10.1111/dom.70168","DOIUrl":null,"url":null,"abstract":"<p><strong>Aim: </strong>Patients with type 2 diabetes mellitus (T2DM) exhibit an elevated prevalence of metabolic dysfunction-associated steatotic liver disease (MASLD) and are at greater risk of liver-related adverse events. Existing non-invasive tools show limited diagnostic performance in this population. This study aims to develop a predictive model that accurately identifies the risk of MASLD among T2DM patients.</p><p><strong>Materials and methods: </strong>Clinical data were collected from T2DM patients hospitalised at Nanjing Drum Tower Hospital between January 2018 and May 2025. Eight machine learning methods were developed to predict the risk of MASLD in T2DM patients. The discriminatory ability of the models was evaluated using area under the receiver operating characteristic curve (AUROC), area under the precision-recall curve (AUPRC), accuracy, recall, negative predictive value, positive predictive value, and F1 score. Calibration curves and decision analysis curves were employed to evaluate the calibration and clinical utility. The models were interpreted using the Shapley additive explanations method, and unsupervised clustering was performed to identify potential high-risk subgroups.</p><p><strong>Results: </strong>A total of 3836 T2DM patients constituted the complete dataset, with a MASLD incidence rate of 55.9%. Thirteen feature variables were selected for model construction, and the XGB model achieved optimal overall performance, with an AUROC of 0.873 and an AUPRC of 0.904. Unsupervised clustering identified several high-risk subgroups with distinct metabolic characteristics.</p><p><strong>Conclusion: </strong>The model developed enables reliable and interpretable MASLD risk prediction in T2DM patients based on selected commonly available clinical data, providing a practical tool for routine identification and stratified management.</p>","PeriodicalId":158,"journal":{"name":"Diabetes, Obesity & Metabolism","volume":" ","pages":""},"PeriodicalIF":5.7000,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An interpretable machine learning model for predicting metabolic dysfunction-associated steatotic liver disease in patients with type 2 diabetes.\",\"authors\":\"Zhuolin Zhou, Nan Gao, Jiaojiao Liu, Xuerong Ma, Zhijuan Ge, Cheng Ji\",\"doi\":\"10.1111/dom.70168\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Aim: </strong>Patients with type 2 diabetes mellitus (T2DM) exhibit an elevated prevalence of metabolic dysfunction-associated steatotic liver disease (MASLD) and are at greater risk of liver-related adverse events. Existing non-invasive tools show limited diagnostic performance in this population. This study aims to develop a predictive model that accurately identifies the risk of MASLD among T2DM patients.</p><p><strong>Materials and methods: </strong>Clinical data were collected from T2DM patients hospitalised at Nanjing Drum Tower Hospital between January 2018 and May 2025. Eight machine learning methods were developed to predict the risk of MASLD in T2DM patients. The discriminatory ability of the models was evaluated using area under the receiver operating characteristic curve (AUROC), area under the precision-recall curve (AUPRC), accuracy, recall, negative predictive value, positive predictive value, and F1 score. Calibration curves and decision analysis curves were employed to evaluate the calibration and clinical utility. The models were interpreted using the Shapley additive explanations method, and unsupervised clustering was performed to identify potential high-risk subgroups.</p><p><strong>Results: </strong>A total of 3836 T2DM patients constituted the complete dataset, with a MASLD incidence rate of 55.9%. Thirteen feature variables were selected for model construction, and the XGB model achieved optimal overall performance, with an AUROC of 0.873 and an AUPRC of 0.904. Unsupervised clustering identified several high-risk subgroups with distinct metabolic characteristics.</p><p><strong>Conclusion: </strong>The model developed enables reliable and interpretable MASLD risk prediction in T2DM patients based on selected commonly available clinical data, providing a practical tool for routine identification and stratified management.</p>\",\"PeriodicalId\":158,\"journal\":{\"name\":\"Diabetes, Obesity & Metabolism\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":5.7000,\"publicationDate\":\"2025-09-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Diabetes, Obesity & Metabolism\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1111/dom.70168\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENDOCRINOLOGY & METABOLISM\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Diabetes, Obesity & Metabolism","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/dom.70168","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENDOCRINOLOGY & METABOLISM","Score":null,"Total":0}
An interpretable machine learning model for predicting metabolic dysfunction-associated steatotic liver disease in patients with type 2 diabetes.
Aim: Patients with type 2 diabetes mellitus (T2DM) exhibit an elevated prevalence of metabolic dysfunction-associated steatotic liver disease (MASLD) and are at greater risk of liver-related adverse events. Existing non-invasive tools show limited diagnostic performance in this population. This study aims to develop a predictive model that accurately identifies the risk of MASLD among T2DM patients.
Materials and methods: Clinical data were collected from T2DM patients hospitalised at Nanjing Drum Tower Hospital between January 2018 and May 2025. Eight machine learning methods were developed to predict the risk of MASLD in T2DM patients. The discriminatory ability of the models was evaluated using area under the receiver operating characteristic curve (AUROC), area under the precision-recall curve (AUPRC), accuracy, recall, negative predictive value, positive predictive value, and F1 score. Calibration curves and decision analysis curves were employed to evaluate the calibration and clinical utility. The models were interpreted using the Shapley additive explanations method, and unsupervised clustering was performed to identify potential high-risk subgroups.
Results: A total of 3836 T2DM patients constituted the complete dataset, with a MASLD incidence rate of 55.9%. Thirteen feature variables were selected for model construction, and the XGB model achieved optimal overall performance, with an AUROC of 0.873 and an AUPRC of 0.904. Unsupervised clustering identified several high-risk subgroups with distinct metabolic characteristics.
Conclusion: The model developed enables reliable and interpretable MASLD risk prediction in T2DM patients based on selected commonly available clinical data, providing a practical tool for routine identification and stratified management.
期刊介绍:
Diabetes, Obesity and Metabolism is primarily a journal of clinical and experimental pharmacology and therapeutics covering the interrelated areas of diabetes, obesity and metabolism. The journal prioritises high-quality original research that reports on the effects of new or existing therapies, including dietary, exercise and lifestyle (non-pharmacological) interventions, in any aspect of metabolic and endocrine disease, either in humans or animal and cellular systems. ‘Metabolism’ may relate to lipids, bone and drug metabolism, or broader aspects of endocrine dysfunction. Preclinical pharmacology, pharmacokinetic studies, meta-analyses and those addressing drug safety and tolerability are also highly suitable for publication in this journal. Original research may be published as a main paper or as a research letter.