Shoham Dabbah, Itamar Mishani, Yana Davidov, Ziv Ben Ari
{"title":"采用机器学习算法筛查代谢功能障碍相关性脂肪性肝病(MASLD)的晚期肝纤维化:深入的解释性分析。","authors":"Shoham Dabbah, Itamar Mishani, Yana Davidov, Ziv Ben Ari","doi":"10.1159/000542241","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>This study aimed to train machine learning algorithms (MLAs) to detect advanced fibrosis (AF) in metabolic dysfunction-associated steatotic liver disease (MASLD) patients at the level of primary care setting and to explain the predictions to ensure responsible use by clinicians.</p><p><strong>Methods: </strong>Readily available features of 618 MASLD patients followed up at a tertiary center were used to train five MLAs. AF was defined as liver stiffness ≥9.3 kPa, measured via 2-dimension shear wave elastography (n = 495) or liver biopsy ≥F3 (n = 123). MLAs were compared to Fibrosis-4 index (FIB-4) and non-alcoholic fatty liver disease (NAFLD) fibrosis score (NFS) on 540 MASLD patients from the primary care setting as validation. Feature importance, partial dependence, and shapely additive explanations (SHAPs) were utilized for explanation.</p><p><strong>Results: </strong>Extreme gradient boosting (XGBoost) achieved an AUC = 0.91, outperforming FIB-4 (AUC = 0.78) and NFS (AUC = 0.81, both p < 0.05) with specificity = 76% versus 59% and 48% for FIB-4 ≥1.3 and NFS ≥-1.45, respectively (p < 0.05). Its sensitivity (91%) was superior to FIB-4 (79%). XGBoost confidently excluded AF (negative predictive value = 99%) with the highest positive predictive value (31%), superior to FIB-4 and NFS (all p < 0.05). The most important features were HbA1c and gamma glutamyl transpeptidase (GGT) with a steep increase in AF probability at HbA1c >6.5%. The strongest interaction was between AST and age. XGBoost, but not logistic regression, extracted informative patterns from ALT, low-density lipoprotein cholesterol, and alkaline phosphatase (p < 0.001). One-quarter of the false positives (FPs) were correctly reclassified with only one additional false negative based on the SHAP values of GGT, platelets, and ALT which were found to be associated with a FP classification.</p><p><strong>Conclusions: </strong>An explainable XGBoost algorithm was demonstrated superior to FIB-4 and NFS for screening of AF in MASLD patients at the primary care setting. The algorithm also proved safe for use as clinicians can understand the predictions and flag FP classifications.</p>","PeriodicalId":11315,"journal":{"name":"Digestion","volume":" ","pages":"1-14"},"PeriodicalIF":3.0000,"publicationDate":"2024-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Implementation of Machine Learning Algorithms to Screen for Advanced Liver Fibrosis in Metabolic Dysfunction-Associated Steatotic Liver Disease: An In-Depth Explanatory Analysis.\",\"authors\":\"Shoham Dabbah, Itamar Mishani, Yana Davidov, Ziv Ben Ari\",\"doi\":\"10.1159/000542241\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Introduction: </strong>This study aimed to train machine learning algorithms (MLAs) to detect advanced fibrosis (AF) in metabolic dysfunction-associated steatotic liver disease (MASLD) patients at the level of primary care setting and to explain the predictions to ensure responsible use by clinicians.</p><p><strong>Methods: </strong>Readily available features of 618 MASLD patients followed up at a tertiary center were used to train five MLAs. AF was defined as liver stiffness ≥9.3 kPa, measured via 2-dimension shear wave elastography (n = 495) or liver biopsy ≥F3 (n = 123). MLAs were compared to Fibrosis-4 index (FIB-4) and non-alcoholic fatty liver disease (NAFLD) fibrosis score (NFS) on 540 MASLD patients from the primary care setting as validation. Feature importance, partial dependence, and shapely additive explanations (SHAPs) were utilized for explanation.</p><p><strong>Results: </strong>Extreme gradient boosting (XGBoost) achieved an AUC = 0.91, outperforming FIB-4 (AUC = 0.78) and NFS (AUC = 0.81, both p < 0.05) with specificity = 76% versus 59% and 48% for FIB-4 ≥1.3 and NFS ≥-1.45, respectively (p < 0.05). Its sensitivity (91%) was superior to FIB-4 (79%). XGBoost confidently excluded AF (negative predictive value = 99%) with the highest positive predictive value (31%), superior to FIB-4 and NFS (all p < 0.05). The most important features were HbA1c and gamma glutamyl transpeptidase (GGT) with a steep increase in AF probability at HbA1c >6.5%. The strongest interaction was between AST and age. XGBoost, but not logistic regression, extracted informative patterns from ALT, low-density lipoprotein cholesterol, and alkaline phosphatase (p < 0.001). One-quarter of the false positives (FPs) were correctly reclassified with only one additional false negative based on the SHAP values of GGT, platelets, and ALT which were found to be associated with a FP classification.</p><p><strong>Conclusions: </strong>An explainable XGBoost algorithm was demonstrated superior to FIB-4 and NFS for screening of AF in MASLD patients at the primary care setting. The algorithm also proved safe for use as clinicians can understand the predictions and flag FP classifications.</p>\",\"PeriodicalId\":11315,\"journal\":{\"name\":\"Digestion\",\"volume\":\" \",\"pages\":\"1-14\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2024-10-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digestion\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1159/000542241\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"GASTROENTEROLOGY & HEPATOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digestion","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1159/000542241","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}
Implementation of Machine Learning Algorithms to Screen for Advanced Liver Fibrosis in Metabolic Dysfunction-Associated Steatotic Liver Disease: An In-Depth Explanatory Analysis.
Introduction: This study aimed to train machine learning algorithms (MLAs) to detect advanced fibrosis (AF) in metabolic dysfunction-associated steatotic liver disease (MASLD) patients at the level of primary care setting and to explain the predictions to ensure responsible use by clinicians.
Methods: Readily available features of 618 MASLD patients followed up at a tertiary center were used to train five MLAs. AF was defined as liver stiffness ≥9.3 kPa, measured via 2-dimension shear wave elastography (n = 495) or liver biopsy ≥F3 (n = 123). MLAs were compared to Fibrosis-4 index (FIB-4) and non-alcoholic fatty liver disease (NAFLD) fibrosis score (NFS) on 540 MASLD patients from the primary care setting as validation. Feature importance, partial dependence, and shapely additive explanations (SHAPs) were utilized for explanation.
Results: Extreme gradient boosting (XGBoost) achieved an AUC = 0.91, outperforming FIB-4 (AUC = 0.78) and NFS (AUC = 0.81, both p < 0.05) with specificity = 76% versus 59% and 48% for FIB-4 ≥1.3 and NFS ≥-1.45, respectively (p < 0.05). Its sensitivity (91%) was superior to FIB-4 (79%). XGBoost confidently excluded AF (negative predictive value = 99%) with the highest positive predictive value (31%), superior to FIB-4 and NFS (all p < 0.05). The most important features were HbA1c and gamma glutamyl transpeptidase (GGT) with a steep increase in AF probability at HbA1c >6.5%. The strongest interaction was between AST and age. XGBoost, but not logistic regression, extracted informative patterns from ALT, low-density lipoprotein cholesterol, and alkaline phosphatase (p < 0.001). One-quarter of the false positives (FPs) were correctly reclassified with only one additional false negative based on the SHAP values of GGT, platelets, and ALT which were found to be associated with a FP classification.
Conclusions: An explainable XGBoost algorithm was demonstrated superior to FIB-4 and NFS for screening of AF in MASLD patients at the primary care setting. The algorithm also proved safe for use as clinicians can understand the predictions and flag FP classifications.
期刊介绍:
''Digestion'' concentrates on clinical research reports: in addition to editorials and reviews, the journal features sections on Stomach/Esophagus, Bowel, Neuro-Gastroenterology, Liver/Bile, Pancreas, Metabolism/Nutrition and Gastrointestinal Oncology. Papers cover physiology in humans, metabolic studies and clinical work on the etiology, diagnosis, and therapy of human diseases. It is thus especially cut out for gastroenterologists employed in hospitals and outpatient units. Moreover, the journal''s coverage of studies on the metabolism and effects of therapeutic drugs carries considerable value for clinicians and investigators beyond the immediate field of gastroenterology.