Artificial Intelligence in Identifying Patients With Undiagnosed Nonalcoholic Steatohepatitis.

IF 2.3 Q2 ECONOMICS
Journal of Health Economics and Outcomes Research Pub Date : 2024-09-25 eCollection Date: 2024-01-01 DOI:10.36469/001c.123645
Onur Baser, Gabriela Samayoa, Nehir Yapar, Erdem Baser
{"title":"Artificial Intelligence in Identifying Patients With Undiagnosed Nonalcoholic Steatohepatitis.","authors":"Onur Baser, Gabriela Samayoa, Nehir Yapar, Erdem Baser","doi":"10.36469/001c.123645","DOIUrl":null,"url":null,"abstract":"<p><p><b>Background:</b> Although increasing in prevalence, nonalcoholic steatohepatitis (NASH) is often undiagnosed in clinical practice. <b>Objective:</b> This study identified patients in the Veterans Affairs (VA) health system who likely had undiagnosed NASH using a machine learning algorithm. <b>Methods:</b> From a VA data set of 25 million adult enrollees, the study population was divided into NASH-positive, non-NASH, and at-risk cohorts. We performed a claims data analysis using a machine learning algorithm. To build our model, the study population was randomly divided into an 80% training subset and a 20% testing subset and tested and trained using a cross-validation technique. In addition to the baseline model, a gradient-boosted classification tree, naïve Bayes, and random forest model were created and compared using receiver operator characteristics, area under the curve, and accuracy. The best performing model was retrained on the full 80% training subset and applied to the 20% testing subset to calculate the performance metrics. <b>Results:</b> In total, 4 223 443 patients met the study inclusion criteria, of whom 4903 were positive for NASH and 35 528 were non-NASH patients. The remainder was in the at-risk patient cohort, of which 514 997 patients (12%) were identified as likely to have NASH. Age, obesity, and abnormal liver function tests were the top determinants in assigning NASH probability. <b>Conclusions:</b> Utilization of machine learning to predict NASH allows for wider recognition, timely intervention, and targeted treatments to improve or mitigate disease progression and could be used as an initial screening tool.</p>","PeriodicalId":16012,"journal":{"name":"Journal of Health Economics and Outcomes Research","volume":"11 2","pages":"86-94"},"PeriodicalIF":2.3000,"publicationDate":"2024-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11441708/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Health Economics and Outcomes Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.36469/001c.123645","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Although increasing in prevalence, nonalcoholic steatohepatitis (NASH) is often undiagnosed in clinical practice. Objective: This study identified patients in the Veterans Affairs (VA) health system who likely had undiagnosed NASH using a machine learning algorithm. Methods: From a VA data set of 25 million adult enrollees, the study population was divided into NASH-positive, non-NASH, and at-risk cohorts. We performed a claims data analysis using a machine learning algorithm. To build our model, the study population was randomly divided into an 80% training subset and a 20% testing subset and tested and trained using a cross-validation technique. In addition to the baseline model, a gradient-boosted classification tree, naïve Bayes, and random forest model were created and compared using receiver operator characteristics, area under the curve, and accuracy. The best performing model was retrained on the full 80% training subset and applied to the 20% testing subset to calculate the performance metrics. Results: In total, 4 223 443 patients met the study inclusion criteria, of whom 4903 were positive for NASH and 35 528 were non-NASH patients. The remainder was in the at-risk patient cohort, of which 514 997 patients (12%) were identified as likely to have NASH. Age, obesity, and abnormal liver function tests were the top determinants in assigning NASH probability. Conclusions: Utilization of machine learning to predict NASH allows for wider recognition, timely intervention, and targeted treatments to improve or mitigate disease progression and could be used as an initial screening tool.

人工智能识别未确诊的非酒精性脂肪性肝炎患者。
背景:尽管非酒精性脂肪性肝炎(NASH)的发病率越来越高,但在临床实践中却常常得不到诊断。研究目的本研究利用机器学习算法在退伍军人事务(VA)医疗系统中识别出可能患有未确诊的非酒精性脂肪性肝炎的患者。方法从退伍军人事务部 2,500 万成年参保者的数据集中,将研究人群分为 NASH 阳性、非 NASH 和高危人群。我们使用机器学习算法对索赔数据进行了分析。为了建立模型,我们将研究人群随机分为 80% 的训练子集和 20% 的测试子集,并使用交叉验证技术进行测试和训练。除了基线模型外,我们还创建了梯度提升分类树、天真贝叶斯和随机森林模型,并使用接受者操作者特征、曲线下面积和准确率进行了比较。在全部 80% 的训练子集中对表现最好的模型进行再训练,并将其应用到 20% 的测试子集中以计算性能指标。结果:共有 4 223 443 例患者符合研究纳入标准,其中 4903 例为 NASH 阳性,35 528 例为非 NASH 患者。其余为高危患者,其中 514 997 名患者(12%)被确定为可能患有 NASH。年龄、肥胖和肝功能检测异常是确定 NASH 概率的首要因素。结论:利用机器学习预测 NASH 可以更广泛地识别、及时干预和有针对性的治疗,以改善或缓解疾病进展,可用作初步筛查工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.00
自引率
0.00%
发文量
55
审稿时长
10 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信