Juan M. Lavista Ferres , Felipe Oviedo , Caleb Robinson , Linda Chu , Satomi Kawamoto , Elham Afghani , Jin He , Alison P. Klein , Mike Goggins , Christopher L. Wolfgang , Ammar A. Javed , Rahul Dodhia , Nick Papadopolous , Ken Kinzler , Ralph H. Hruban , William B. Weeks , Elliot K. Fishman , Anne Marie Lennon
{"title":"Performance of explainable artificial intelligence in guiding the management of patients with a pancreatic cyst","authors":"Juan M. Lavista Ferres , Felipe Oviedo , Caleb Robinson , Linda Chu , Satomi Kawamoto , Elham Afghani , Jin He , Alison P. Klein , Mike Goggins , Christopher L. Wolfgang , Ammar A. Javed , Rahul Dodhia , Nick Papadopolous , Ken Kinzler , Ralph H. Hruban , William B. Weeks , Elliot K. Fishman , Anne Marie Lennon","doi":"10.1016/j.pan.2024.09.001","DOIUrl":null,"url":null,"abstract":"<div><h3>Background/objectives</h3><div>Pancreatic cyst management can be distilled into three separate pathways – discharge, monitoring or surgery– based on the risk of malignant transformation. This study compares the performance of artificial intelligence (AI) models to clinical care for this task.</div></div><div><h3>Methods</h3><div>Two explainable boosting machine (EBM) models were developed and evaluated using clinical features only, or clinical features and cyst fluid molecular markers (CFMM) using a publicly available dataset, consisting of 850 cases (median age 64; 65 % female) with independent training (429 cases) and holdout test cohorts (421 cases). There were 137 cysts with no malignant potential, 114 malignant cysts, and 599 IPMNs and MCNs.</div></div><div><h3>Results</h3><div>The EBM and EBM with CFMM models had higher accuracy for identifying patients requiring monitoring (0.88 and 0.82) and surgery (0.66 and 0.82) respectively compared with current clinical care (0.62 and 0.58). For discharge, the EBM with CFMM model had a higher accuracy (0.91) than either the EBM model (0.84) or current clinical care (0.86). In the cohort of patients who underwent surgical resection, use of the EBM-CFMM model would have decreased the number of unnecessary surgeries by 59 % (n = 92), increased correct surgeries by 7.5 % (n = 11), identified patients who require monitoring by 122 % (n = 76), and increased the number of patients correctly classified for discharge by 138 % (n = 18) compared to clinical care.</div></div><div><h3>Conclusions</h3><div>EBM models had greater sensitivity and specificity for identifying the correct management compared with either clinical management or previous AI models. The model predictions are demonstrated to be interpretable by clinicians.</div></div>","PeriodicalId":19976,"journal":{"name":"Pancreatology","volume":"24 7","pages":"Pages 1182-1191"},"PeriodicalIF":2.8000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pancreatology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1424390324007300","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background/objectives
Pancreatic cyst management can be distilled into three separate pathways – discharge, monitoring or surgery– based on the risk of malignant transformation. This study compares the performance of artificial intelligence (AI) models to clinical care for this task.
Methods
Two explainable boosting machine (EBM) models were developed and evaluated using clinical features only, or clinical features and cyst fluid molecular markers (CFMM) using a publicly available dataset, consisting of 850 cases (median age 64; 65 % female) with independent training (429 cases) and holdout test cohorts (421 cases). There were 137 cysts with no malignant potential, 114 malignant cysts, and 599 IPMNs and MCNs.
Results
The EBM and EBM with CFMM models had higher accuracy for identifying patients requiring monitoring (0.88 and 0.82) and surgery (0.66 and 0.82) respectively compared with current clinical care (0.62 and 0.58). For discharge, the EBM with CFMM model had a higher accuracy (0.91) than either the EBM model (0.84) or current clinical care (0.86). In the cohort of patients who underwent surgical resection, use of the EBM-CFMM model would have decreased the number of unnecessary surgeries by 59 % (n = 92), increased correct surgeries by 7.5 % (n = 11), identified patients who require monitoring by 122 % (n = 76), and increased the number of patients correctly classified for discharge by 138 % (n = 18) compared to clinical care.
Conclusions
EBM models had greater sensitivity and specificity for identifying the correct management compared with either clinical management or previous AI models. The model predictions are demonstrated to be interpretable by clinicians.
期刊介绍:
Pancreatology is the official journal of the International Association of Pancreatology (IAP), the European Pancreatic Club (EPC) and several national societies and study groups around the world. Dedicated to the understanding and treatment of exocrine as well as endocrine pancreatic disease, this multidisciplinary periodical publishes original basic, translational and clinical pancreatic research from a range of fields including gastroenterology, oncology, surgery, pharmacology, cellular and molecular biology as well as endocrinology, immunology and epidemiology. Readers can expect to gain new insights into pancreatic physiology and into the pathogenesis, diagnosis, therapeutic approaches and prognosis of pancreatic diseases. The journal features original articles, case reports, consensus guidelines and topical, cutting edge reviews, thus representing a source of valuable, novel information for clinical and basic researchers alike.