采用机器学习算法筛查代谢功能障碍相关性脂肪性肝病(MASLD)的晚期肝纤维化:深入的解释性分析。

IF 3 3区 医学 Q2 GASTROENTEROLOGY & HEPATOLOGY
Digestion Pub Date : 2024-10-25 DOI:10.1159/000542241
Shoham Dabbah, Itamar Mishani, Yana Davidov, Ziv Ben Ari
{"title":"采用机器学习算法筛查代谢功能障碍相关性脂肪性肝病(MASLD)的晚期肝纤维化:深入的解释性分析。","authors":"Shoham Dabbah, Itamar Mishani, Yana Davidov, Ziv Ben Ari","doi":"10.1159/000542241","DOIUrl":null,"url":null,"abstract":"<p><p>Background This study aimed to train machine learning algorithms(MLAs) to detect advanced fibrosis(AF) in MASLD patients at the level of primary care setting and to explain the predictions to ensure responsible use by clinicians. Methods Readily available features of 618 MASLD patients followed up at a tertiary center were used to train five MLAs. AF was defined as liver stiffness≥9.3 kPa, measured via 2-dimension shear wave elastography(n=495) or liver biopsy≥F3(n=123). MLAs were compared to Fibrosis-4 index(FIB-4) and NAFLD fibrosis score(NFS) on 540 MASLD patients from the primary care setting as validation. Feature importance, partial dependence, and shapely additive explanations(SHAP) were utilized for explanation. Results Extreme gradient boosting(XGBoost) achieved an AUC=0.91,outperforming FIB-4(AUC=0.78) and NFS(AUC=0.81, both p&lt;0.05) with specificity=76% vs. 59% and 48% for FIB-4≥1.3 and NFS≥-1.45, respectively(p&lt;0.05). Its sensitivity(91%) was superior to FIB-4(79%). XGBoost confidently excluded AF (negative predictive value=99%) with the highest positive predictive value (31%), superior to FIB-4 and NFS (all p&lt;0.05). The most important features were HbA1c and GGT with a steep increase in AF probability at HbA1c&gt;6.5%. The strongest interaction was between AST and age. XGBoost, but not logistic regression, extracted informative patterns from ALT, LDL-c,and ALP(p&lt;0.001). One quarter of the false positives (FP) were correctly reclassified with only one additional false negative based on the SHAP values of GGT, platelets, and ALT which were found to be associated with a FP classification. Conclusions: An explainable XGBoost algorithm was demonstrated superior to FIB-4 and NFS for screening of AF in MASLD patients at the primary care setting. The algorithm also proved safe for use as clinicians can understand the predictions and flag FP classifications.</p>","PeriodicalId":11315,"journal":{"name":"Digestion","volume":" ","pages":"1-20"},"PeriodicalIF":3.0000,"publicationDate":"2024-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Implementation of machine learning algorithms to screen for advanced liver fibrosis in metabolic dysfunction-associated steatotic liver disease (MASLD): an in-depth explanatory analysis.\",\"authors\":\"Shoham Dabbah, Itamar Mishani, Yana Davidov, Ziv Ben Ari\",\"doi\":\"10.1159/000542241\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Background This study aimed to train machine learning algorithms(MLAs) to detect advanced fibrosis(AF) in MASLD patients at the level of primary care setting and to explain the predictions to ensure responsible use by clinicians. Methods Readily available features of 618 MASLD patients followed up at a tertiary center were used to train five MLAs. AF was defined as liver stiffness≥9.3 kPa, measured via 2-dimension shear wave elastography(n=495) or liver biopsy≥F3(n=123). MLAs were compared to Fibrosis-4 index(FIB-4) and NAFLD fibrosis score(NFS) on 540 MASLD patients from the primary care setting as validation. Feature importance, partial dependence, and shapely additive explanations(SHAP) were utilized for explanation. Results Extreme gradient boosting(XGBoost) achieved an AUC=0.91,outperforming FIB-4(AUC=0.78) and NFS(AUC=0.81, both p&lt;0.05) with specificity=76% vs. 59% and 48% for FIB-4≥1.3 and NFS≥-1.45, respectively(p&lt;0.05). Its sensitivity(91%) was superior to FIB-4(79%). XGBoost confidently excluded AF (negative predictive value=99%) with the highest positive predictive value (31%), superior to FIB-4 and NFS (all p&lt;0.05). The most important features were HbA1c and GGT with a steep increase in AF probability at HbA1c&gt;6.5%. The strongest interaction was between AST and age. XGBoost, but not logistic regression, extracted informative patterns from ALT, LDL-c,and ALP(p&lt;0.001). One quarter of the false positives (FP) were correctly reclassified with only one additional false negative based on the SHAP values of GGT, platelets, and ALT which were found to be associated with a FP classification. Conclusions: An explainable XGBoost algorithm was demonstrated superior to FIB-4 and NFS for screening of AF in MASLD patients at the primary care setting. The algorithm also proved safe for use as clinicians can understand the predictions and flag FP classifications.</p>\",\"PeriodicalId\":11315,\"journal\":{\"name\":\"Digestion\",\"volume\":\" \",\"pages\":\"1-20\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2024-10-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digestion\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1159/000542241\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"GASTROENTEROLOGY & HEPATOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digestion","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1159/000542241","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景 本研究旨在训练机器学习算法(MLAs),以便在初级医疗机构检测MASLD患者的晚期纤维化(AF),并解释预测结果,确保临床医生负责任地使用这些算法。方法 利用一家三级中心随访的 618 名 MASLD 患者的现成特征来训练五个 MLA。通过二维剪切波弹性成像(n=495)或肝脏活检≥F3(n=123)测量的肝脏硬度≥9.3 kPa定义为AF。将 MLA 与来自初级医疗机构的 540 名 MASLD 患者的纤维化-4 指数(FIB-4)和非酒精性脂肪肝纤维化评分(NFS)进行比较,作为验证。利用特征重要性、部分依赖性和形状相加解释(SHAP)进行解释。结果 极端梯度提升(XGBoost)的AUC=0.91,优于FIB-4(AUC=0.78)和NFS(AUC=0.81,均为p<0.05),特异性=76%,而FIB-4≥1.3和NFS≥-1.45的特异性分别为59%和48%(p<0.05)。其灵敏度(91%)优于 FIB-4(79%)。XGBoost 能可靠地排除房颤(阴性预测值=99%),阳性预测值最高(31%),优于 FIB-4 和 NFS(所有 p<0.05)。最重要的特征是 HbA1c 和 GGT,HbA1c>6.5% 时房颤概率陡增。AST 和年龄之间的交互作用最强。XGBoost能从谷丙转氨酶、低密度脂蛋白胆固醇和谷草转氨酶(p<0.001)中提取信息模式,而逻辑回归则不能。四分之一的假阳性(FP)得到了正确的重新分类,仅有一个假阴性是基于 GGT、血小板和 ALT 的 SHAP 值,发现这些值与 FP 分类相关。结论在初级医疗机构对 MASLD 患者进行房颤筛查时,一种可解释的 XGBoost 算法被证明优于 FIB-4 和 NFS。该算法还被证明可以安全使用,因为临床医生可以理解预测结果并标记 FP 分类。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Implementation of machine learning algorithms to screen for advanced liver fibrosis in metabolic dysfunction-associated steatotic liver disease (MASLD): an in-depth explanatory analysis.

Background This study aimed to train machine learning algorithms(MLAs) to detect advanced fibrosis(AF) in MASLD patients at the level of primary care setting and to explain the predictions to ensure responsible use by clinicians. Methods Readily available features of 618 MASLD patients followed up at a tertiary center were used to train five MLAs. AF was defined as liver stiffness≥9.3 kPa, measured via 2-dimension shear wave elastography(n=495) or liver biopsy≥F3(n=123). MLAs were compared to Fibrosis-4 index(FIB-4) and NAFLD fibrosis score(NFS) on 540 MASLD patients from the primary care setting as validation. Feature importance, partial dependence, and shapely additive explanations(SHAP) were utilized for explanation. Results Extreme gradient boosting(XGBoost) achieved an AUC=0.91,outperforming FIB-4(AUC=0.78) and NFS(AUC=0.81, both p<0.05) with specificity=76% vs. 59% and 48% for FIB-4≥1.3 and NFS≥-1.45, respectively(p<0.05). Its sensitivity(91%) was superior to FIB-4(79%). XGBoost confidently excluded AF (negative predictive value=99%) with the highest positive predictive value (31%), superior to FIB-4 and NFS (all p<0.05). The most important features were HbA1c and GGT with a steep increase in AF probability at HbA1c>6.5%. The strongest interaction was between AST and age. XGBoost, but not logistic regression, extracted informative patterns from ALT, LDL-c,and ALP(p<0.001). One quarter of the false positives (FP) were correctly reclassified with only one additional false negative based on the SHAP values of GGT, platelets, and ALT which were found to be associated with a FP classification. Conclusions: An explainable XGBoost algorithm was demonstrated superior to FIB-4 and NFS for screening of AF in MASLD patients at the primary care setting. The algorithm also proved safe for use as clinicians can understand the predictions and flag FP classifications.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Digestion
Digestion 医学-胃肠肝病学
CiteScore
7.90
自引率
0.00%
发文量
39
审稿时长
6-12 weeks
期刊介绍: ''Digestion'' concentrates on clinical research reports: in addition to editorials and reviews, the journal features sections on Stomach/Esophagus, Bowel, Neuro-Gastroenterology, Liver/Bile, Pancreas, Metabolism/Nutrition and Gastrointestinal Oncology. Papers cover physiology in humans, metabolic studies and clinical work on the etiology, diagnosis, and therapy of human diseases. It is thus especially cut out for gastroenterologists employed in hospitals and outpatient units. Moreover, the journal''s coverage of studies on the metabolism and effects of therapeutic drugs carries considerable value for clinicians and investigators beyond the immediate field of gastroenterology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信