Tong Wu, Jianguo Yan, Feixiang Xiong, Xiaoli Liu, Yang Zhou, Xiaomin Ji, Peipei Meng, Yuyong Jiang, Yixin Hou
{"title":"基于机器学习的模型用于预测慢性乙型肝炎患者肝细胞癌的风险。","authors":"Tong Wu, Jianguo Yan, Feixiang Xiong, Xiaoli Liu, Yang Zhou, Xiaomin Ji, Peipei Meng, Yuyong Jiang, Yixin Hou","doi":"10.2147/JHC.S498463","DOIUrl":null,"url":null,"abstract":"<p><strong>Object: </strong>Currently, predictive models that effectively stratify the risk levels for hepatocellular carcinoma (HCC) are insufficient. Our study aimed to assess the 10-year cumulative risk of HCC among patients suffering from chronic hepatitis B (CHB) by employing an artificial neural network (ANN).</p><p><strong>Methods: </strong>This research involved 1717 patients admitted to Beijing Ditan Hospital of Capital Medical University and the People's Liberation Army Fifth Medical Center. The training group included 1309 individuals from Beijing Ditan Hospital of Capital Medical University, whereas the validation group contained 408 individuals from the People's Liberation Army Fifth Medical Center. By performing a univariate analysis, we pinpointed factors that had an independent impact on the development of HCC, which were subsequently employed to create the ANN model. To evaluate the ANN model, we analyzed its predictive accuracy, discriminative performance, and clinical net benefit through measures including the area under the receiver operating characteristic curve (AUC), concordance index (C-index), and calibration curves.</p><p><strong>Results: </strong>The cumulative incidence rates of HCC over a decade were observed to be 3.59% in the training cohort and 4.41% in the validation cohort. We incorporated nine distinct independent risk factors into the ANN model's development. Notably, in the training group, the area under the receiver operating characteristic (AUROC) curve for the ANN model was reported as 0.929 (95% CI 0.910-0.948), and the C-index was 0.917 (95% CI 0.907-0.927). These results were significantly superior to those of the mREACHE-B(0.700, 95% CI 0.639-0.761), mPAGE-B(0.800, 95% CI 0.757-0.844), HCC-RESCUE(0.787, 95% CI 0.732-0.837), CAMD(0.760, 95% CI 0.708-0.812), REAL-B(0.767, 95% CI 0.719-0.816), and PAGE-B(0.760, 95% CI 0.712-0.808) models (p < 0.001). The ANN model proficiently categorized patients into low-risk and high-risk groups based on their 10-year projections. In the training cohort, the positive predictive value (PPV) for the incidence of liver cancer in low-risk individuals was 92.5% (95% CI 0.921-0.939), whereas the negative predictive value (NPV) stood at 88.2% (95% CI 0.870-0.894). Among high-risk patients, the PPV reached 94.6% (95% CI 0.936-0.956) and the NPV was 90.2% (95% CI 0.897-0.917). These results were also confirmed in the independent validation cohort.</p><p><strong>Conclusion: </strong>The model utilizing artificial neural networks demonstrates strong performance in personalized predictions and could assist in assessing the likelihood of a 10-year risk of HCC in patients suffering from CHB.</p>","PeriodicalId":15906,"journal":{"name":"Journal of Hepatocellular Carcinoma","volume":"12 ","pages":"659-670"},"PeriodicalIF":4.2000,"publicationDate":"2025-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11974571/pdf/","citationCount":"0","resultStr":"{\"title\":\"Machine Learning-Based Model Used for Predicting the Risk of Hepatocellular Carcinoma in Patients with Chronic Hepatitis B.\",\"authors\":\"Tong Wu, Jianguo Yan, Feixiang Xiong, Xiaoli Liu, Yang Zhou, Xiaomin Ji, Peipei Meng, Yuyong Jiang, Yixin Hou\",\"doi\":\"10.2147/JHC.S498463\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Object: </strong>Currently, predictive models that effectively stratify the risk levels for hepatocellular carcinoma (HCC) are insufficient. Our study aimed to assess the 10-year cumulative risk of HCC among patients suffering from chronic hepatitis B (CHB) by employing an artificial neural network (ANN).</p><p><strong>Methods: </strong>This research involved 1717 patients admitted to Beijing Ditan Hospital of Capital Medical University and the People's Liberation Army Fifth Medical Center. The training group included 1309 individuals from Beijing Ditan Hospital of Capital Medical University, whereas the validation group contained 408 individuals from the People's Liberation Army Fifth Medical Center. By performing a univariate analysis, we pinpointed factors that had an independent impact on the development of HCC, which were subsequently employed to create the ANN model. To evaluate the ANN model, we analyzed its predictive accuracy, discriminative performance, and clinical net benefit through measures including the area under the receiver operating characteristic curve (AUC), concordance index (C-index), and calibration curves.</p><p><strong>Results: </strong>The cumulative incidence rates of HCC over a decade were observed to be 3.59% in the training cohort and 4.41% in the validation cohort. We incorporated nine distinct independent risk factors into the ANN model's development. Notably, in the training group, the area under the receiver operating characteristic (AUROC) curve for the ANN model was reported as 0.929 (95% CI 0.910-0.948), and the C-index was 0.917 (95% CI 0.907-0.927). These results were significantly superior to those of the mREACHE-B(0.700, 95% CI 0.639-0.761), mPAGE-B(0.800, 95% CI 0.757-0.844), HCC-RESCUE(0.787, 95% CI 0.732-0.837), CAMD(0.760, 95% CI 0.708-0.812), REAL-B(0.767, 95% CI 0.719-0.816), and PAGE-B(0.760, 95% CI 0.712-0.808) models (p < 0.001). The ANN model proficiently categorized patients into low-risk and high-risk groups based on their 10-year projections. In the training cohort, the positive predictive value (PPV) for the incidence of liver cancer in low-risk individuals was 92.5% (95% CI 0.921-0.939), whereas the negative predictive value (NPV) stood at 88.2% (95% CI 0.870-0.894). Among high-risk patients, the PPV reached 94.6% (95% CI 0.936-0.956) and the NPV was 90.2% (95% CI 0.897-0.917). These results were also confirmed in the independent validation cohort.</p><p><strong>Conclusion: </strong>The model utilizing artificial neural networks demonstrates strong performance in personalized predictions and could assist in assessing the likelihood of a 10-year risk of HCC in patients suffering from CHB.</p>\",\"PeriodicalId\":15906,\"journal\":{\"name\":\"Journal of Hepatocellular Carcinoma\",\"volume\":\"12 \",\"pages\":\"659-670\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2025-04-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11974571/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Hepatocellular Carcinoma\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.2147/JHC.S498463\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hepatocellular Carcinoma","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2147/JHC.S498463","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
目的:目前,有效划分肝细胞癌(HCC)危险等级的预测模型还不够。本研究旨在通过人工神经网络(ANN)评估慢性乙型肝炎(CHB)患者10年累积HCC风险。方法:选取首都医科大学附属北京地坛医院和解放军第五医疗中心收治的1717例患者为研究对象。训练组1309人来自首都医科大学附属北京地坛医院,验证组408人来自中国人民解放军第五医疗中心。通过进行单变量分析,我们确定了对HCC发展有独立影响的因素,这些因素随后被用于创建人工神经网络模型。为了评估人工神经网络模型,我们通过包括受试者工作特征曲线下面积(AUC)、一致性指数(C-index)和校准曲线等指标分析了其预测准确性、判别性能和临床净效益。结果:在培训组和验证组中,10年间HCC的累积发病率分别为3.59%和4.41%。我们将9个不同的独立风险因素纳入人工神经网络模型的开发中。值得注意的是,在训练组中,人工神经网络模型的接收者工作特征(AUROC)曲线下面积为0.929 (95% CI 0.910-0.948), c -指数为0.917 (95% CI 0.907-0.927)。这些结果显著优于mreach - b (0.700, 95% CI 0.639-0.761)、mPAGE-B(0.800, 95% CI 0.757-0.844)、HCC-RESCUE(0.787, 95% CI 0.732-0.837)、CAMD(0.760, 95% CI 0.708-0.812)、REAL-B(0.767, 95% CI 0.719-0.816)和PAGE-B(0.760, 95% CI 0.712-0.808)模型(p < 0.001)。人工神经网络模型根据10年预测熟练地将患者分为低风险组和高风险组。在培训队列中,低危人群肝癌发病率的阳性预测值(PPV)为92.5% (95% CI 0.921-0.939),阴性预测值(NPV)为88.2% (95% CI 0.870-0.894)。高危患者PPV为94.6% (95% CI 0.936-0.956), NPV为90.2% (95% CI 0.897-0.917)。这些结果在独立验证队列中也得到了证实。结论:利用人工神经网络的模型在个性化预测方面表现出色,可以帮助评估慢性乙型肝炎患者10年HCC风险的可能性。
Machine Learning-Based Model Used for Predicting the Risk of Hepatocellular Carcinoma in Patients with Chronic Hepatitis B.
Object: Currently, predictive models that effectively stratify the risk levels for hepatocellular carcinoma (HCC) are insufficient. Our study aimed to assess the 10-year cumulative risk of HCC among patients suffering from chronic hepatitis B (CHB) by employing an artificial neural network (ANN).
Methods: This research involved 1717 patients admitted to Beijing Ditan Hospital of Capital Medical University and the People's Liberation Army Fifth Medical Center. The training group included 1309 individuals from Beijing Ditan Hospital of Capital Medical University, whereas the validation group contained 408 individuals from the People's Liberation Army Fifth Medical Center. By performing a univariate analysis, we pinpointed factors that had an independent impact on the development of HCC, which were subsequently employed to create the ANN model. To evaluate the ANN model, we analyzed its predictive accuracy, discriminative performance, and clinical net benefit through measures including the area under the receiver operating characteristic curve (AUC), concordance index (C-index), and calibration curves.
Results: The cumulative incidence rates of HCC over a decade were observed to be 3.59% in the training cohort and 4.41% in the validation cohort. We incorporated nine distinct independent risk factors into the ANN model's development. Notably, in the training group, the area under the receiver operating characteristic (AUROC) curve for the ANN model was reported as 0.929 (95% CI 0.910-0.948), and the C-index was 0.917 (95% CI 0.907-0.927). These results were significantly superior to those of the mREACHE-B(0.700, 95% CI 0.639-0.761), mPAGE-B(0.800, 95% CI 0.757-0.844), HCC-RESCUE(0.787, 95% CI 0.732-0.837), CAMD(0.760, 95% CI 0.708-0.812), REAL-B(0.767, 95% CI 0.719-0.816), and PAGE-B(0.760, 95% CI 0.712-0.808) models (p < 0.001). The ANN model proficiently categorized patients into low-risk and high-risk groups based on their 10-year projections. In the training cohort, the positive predictive value (PPV) for the incidence of liver cancer in low-risk individuals was 92.5% (95% CI 0.921-0.939), whereas the negative predictive value (NPV) stood at 88.2% (95% CI 0.870-0.894). Among high-risk patients, the PPV reached 94.6% (95% CI 0.936-0.956) and the NPV was 90.2% (95% CI 0.897-0.917). These results were also confirmed in the independent validation cohort.
Conclusion: The model utilizing artificial neural networks demonstrates strong performance in personalized predictions and could assist in assessing the likelihood of a 10-year risk of HCC in patients suffering from CHB.