Serologic Detection of Hepatocellular Carcinoma: Application of Machine Learning and Implications for Diagnostic Models.

IF 3.3 Q2 ONCOLOGY

JCO Clinical Cancer Informatics Pub Date : 2024-03-01 DOI:10.1200/CCI.23.00199

Philip J Johnson, Ehsan Bhatti, Hidenori Toyoda, Shan He

{"title":"Serologic Detection of Hepatocellular Carcinoma: Application of Machine Learning and Implications for Diagnostic Models.","authors":"Philip J Johnson, Ehsan Bhatti, Hidenori Toyoda, Shan He","doi":"10.1200/CCI.23.00199","DOIUrl":null,"url":null,"abstract":"Purpose: The gender, age, lens culinaris agglutinin-reactive fraction of alphafetoprotein, alphafetoprotein, des-gamma-carboxyprothrombin (GALAD) score is a biomarker-based statistical model for the serologic diagnosis of hepatocellular carcinoma (HCC) that has been developed and validated using the case-control approach with a view to early detection. Performance has, however, been suboptimal in the first prospective studies which better reflect the real-world situation. In this article, we report the application of machine learning to a large, prospectively accrued, HCC surveillance data set.Patients and methods: Models were built on a cohort of 3,473 patients with chronic liver disease within a rigorous surveillance program between 1998 and 2014, during which 459 patients with HCC were detected. Two random forest (RF) models were trained. The first RF model uses the same variables as the original GALAD model (GALAD-RF); the second is based on routinely available clinical and laboratory features (RF-practical). For comparison, we evaluated a logistic regression GALAD model trained on this longitudinal prospective data set (termed GALAD-Ogaki).Results: Models were evaluated using a repetitive cross-validation approach with the metrics averaged over 100 independent runs. As judged by area under the receiver operator curve (AUROC) and F1 score, the GALAD RF model significantly outperformed the original GALAD model. The RF-practical model also outperformed the original GALAD model in terms of both AUROC and F1 score, and both models outperformed the individual biomarkers. An online web application that implemented the GALAD-RF and RF-practical models is presented.Conclusion: RF-based models improve on the diagnostic performance of the original GALAD model in the setting of a standard HCC surveillance program. Further prospective validation studies are warranted using these models and could be expanded to offer prediction of risk of HCC development over defined periods of time.","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"8 ","pages":"e2300199"},"PeriodicalIF":3.3000,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JCO Clinical Cancer Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1200/CCI.23.00199","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose: The gender, age, lens culinaris agglutinin-reactive fraction of alphafetoprotein, alphafetoprotein, des-gamma-carboxyprothrombin (GALAD) score is a biomarker-based statistical model for the serologic diagnosis of hepatocellular carcinoma (HCC) that has been developed and validated using the case-control approach with a view to early detection. Performance has, however, been suboptimal in the first prospective studies which better reflect the real-world situation. In this article, we report the application of machine learning to a large, prospectively accrued, HCC surveillance data set.

Patients and methods: Models were built on a cohort of 3,473 patients with chronic liver disease within a rigorous surveillance program between 1998 and 2014, during which 459 patients with HCC were detected. Two random forest (RF) models were trained. The first RF model uses the same variables as the original GALAD model (GALAD-RF); the second is based on routinely available clinical and laboratory features (RF-practical). For comparison, we evaluated a logistic regression GALAD model trained on this longitudinal prospective data set (termed GALAD-Ogaki).

Results: Models were evaluated using a repetitive cross-validation approach with the metrics averaged over 100 independent runs. As judged by area under the receiver operator curve (AUROC) and F1 score, the GALAD RF model significantly outperformed the original GALAD model. The RF-practical model also outperformed the original GALAD model in terms of both AUROC and F1 score, and both models outperformed the individual biomarkers. An online web application that implemented the GALAD-RF and RF-practical models is presented.

Conclusion: RF-based models improve on the diagnostic performance of the original GALAD model in the setting of a standard HCC surveillance program. Further prospective validation studies are warranted using these models and could be expanded to offer prediction of risk of HCC development over defined periods of time.

查看原文本刊更多论文

肝细胞癌的血清学检测：机器学习的应用及对诊断模型的影响。

目的：性别、年龄、α-甲胎蛋白、α-甲胎蛋白、去γ-羧基凝血酶原的晶状体凝集素反应分数（GALAD）评分是一种基于生物标志物的肝细胞癌（HCC）血清学诊断统计模型，该模型已通过病例对照方法开发并得到验证，以期实现早期检测。然而，在能更好地反映真实世界情况的首批前瞻性研究中，该模型的表现并不理想。在这篇文章中，我们报告了机器学习在大型前瞻性 HCC 监测数据集中的应用：在1998年至2014年期间，我们在一项严格的监测计划中对3473名慢性肝病患者的队列建立了模型，其中发现了459名HCC患者。训练了两个随机森林（RF）模型。第一个 RF 模型使用与原始 GALAD 模型相同的变量（GALAD-RF）；第二个模型基于常规可用的临床和实验室特征（RF-实用）。为了进行比较，我们评估了在该纵向前瞻性数据集上训练的逻辑回归 GALAD 模型（称为 GALAD-Ogaki）：我们采用重复交叉验证的方法对模型进行了评估，其指标是 100 次独立运行的平均值。根据接收者运算曲线下面积（AUROC）和 F1 分数判断，GALAD RF 模型明显优于原始 GALAD 模型。RF-实用模型在AUROC和F1得分方面也优于原始GALAD模型，两个模型在单个生物标记物方面的表现都优于原始GALAD模型。本文介绍了一个在线网络应用程序，该程序实现了 GALAD-RF 模型和 RF-practical 模型：结论：在标准 HCC 监测项目中，基于 RF 的模型提高了原始 GALAD 模型的诊断性能。有必要使用这些模型开展进一步的前瞻性验证研究，并可将其扩展至预测特定时期内的 HCC 发展风险。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

JCO Clinical Cancer Informatics ONCOLOGY-

CiteScore

6.20

自引率

4.80%

发文量

190