Machine learning model for predicting hepatitis C seroconversion in methadone maintenance patients in China.

BMJ public health Pub Date : 2025-08-28 eCollection Date: 2025-01-01 DOI:10.1136/bmjph-2024-002290
Xinyu Lu, Qing Yue, He Jing, Gangliang Zhong, Zhen Ning, Jiang Du, Min Zhao
{"title":"Machine learning model for predicting hepatitis C seroconversion in methadone maintenance patients in China.","authors":"Xinyu Lu, Qing Yue, He Jing, Gangliang Zhong, Zhen Ning, Jiang Du, Min Zhao","doi":"10.1136/bmjph-2024-002290","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Hepatitis C virus (HCV) infection is a substantial public health concern, particularly among individuals with opioid addiction. The methadone maintenance treatment (MMT) programmes serve as a harm reduction strategy to mitigate HIV disease spread, yet the risk of HCV infection remains high within these settings. Accurate risk prediction for HCV seroconversion is therefore crucial for improving patient outcomes.</p><p><strong>Methods: </strong>We collected data from 1547 individuals with opioid use disorder who entered the MMT programme from May 2005 to October 2023 in Shanghai, China, and 283 individuals from July 2006 to October 2023 in Mianyang, China, whose HCV infection status was monitored. Shanghai data were divided into training and validation sets in a 7:3 ratio, with 70% of the Shanghai samples used for model training and the remaining 30% reserved for internal validation. Additionally, the Mianyang dataset was employed as an independent external validation cohort to assess the model's generalisability. Four machine learning models were developed. We then validated the predictive performance of the model using C-index, receiver-operating characteristic curves, calibration plots and decision curve analysis.</p><p><strong>Results: </strong>13 predictive factors, including sex, age, ethnicity, education, occupation status, marriage status, living status, financial resource, drug use method in the past half year, injected drug last month, condom use, average methadone dosage and positive rate of drug urine tests, were all incorporated into the predictive model. The eXtreme Gradient Boosting (XGBoost) model exhibited superior performance in both discrimination and calibration compared with the other three models. Specifically, it achieved C-indices of 0.793 (95% CI: 0.771 to 0.813) in the training cohort, 0.744 (0.709 to 0.779) in the internal validation cohort and 0.756 (0.712 to 0.799) in the external validation cohort for predicting HCV seroconversion. A publicly accessible web tool was generated for the model.</p><p><strong>Conclusions: </strong>The developed XGBoost model has the potential to accurately predict individuals on MMT programmes at high risk of HCV seroconversion.</p>","PeriodicalId":101362,"journal":{"name":"BMJ public health","volume":"3 2","pages":"e002290"},"PeriodicalIF":0.0000,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12410665/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMJ public health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1136/bmjph-2024-002290","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction: Hepatitis C virus (HCV) infection is a substantial public health concern, particularly among individuals with opioid addiction. The methadone maintenance treatment (MMT) programmes serve as a harm reduction strategy to mitigate HIV disease spread, yet the risk of HCV infection remains high within these settings. Accurate risk prediction for HCV seroconversion is therefore crucial for improving patient outcomes.

Methods: We collected data from 1547 individuals with opioid use disorder who entered the MMT programme from May 2005 to October 2023 in Shanghai, China, and 283 individuals from July 2006 to October 2023 in Mianyang, China, whose HCV infection status was monitored. Shanghai data were divided into training and validation sets in a 7:3 ratio, with 70% of the Shanghai samples used for model training and the remaining 30% reserved for internal validation. Additionally, the Mianyang dataset was employed as an independent external validation cohort to assess the model's generalisability. Four machine learning models were developed. We then validated the predictive performance of the model using C-index, receiver-operating characteristic curves, calibration plots and decision curve analysis.

Results: 13 predictive factors, including sex, age, ethnicity, education, occupation status, marriage status, living status, financial resource, drug use method in the past half year, injected drug last month, condom use, average methadone dosage and positive rate of drug urine tests, were all incorporated into the predictive model. The eXtreme Gradient Boosting (XGBoost) model exhibited superior performance in both discrimination and calibration compared with the other three models. Specifically, it achieved C-indices of 0.793 (95% CI: 0.771 to 0.813) in the training cohort, 0.744 (0.709 to 0.779) in the internal validation cohort and 0.756 (0.712 to 0.799) in the external validation cohort for predicting HCV seroconversion. A publicly accessible web tool was generated for the model.

Conclusions: The developed XGBoost model has the potential to accurately predict individuals on MMT programmes at high risk of HCV seroconversion.

预测中国美沙酮维持患者丙型肝炎血清转化的机器学习模型。
丙型肝炎病毒(HCV)感染是一个重大的公共卫生问题,特别是在阿片类药物成瘾者中。美沙酮维持治疗(MMT)规划作为一种减轻艾滋病毒疾病传播的危害战略,但在这些环境中丙型肝炎病毒感染的风险仍然很高。因此,HCV血清转化的准确风险预测对于改善患者预后至关重要。方法:我们收集了2005年5月至2023年10月在中国上海进入MMT计划的1547名阿片类药物使用障碍患者和2006年7月至2023年10月在中国绵阳监测HCV感染状况的283名患者的数据。上海数据以7:3的比例分为训练集和验证集,其中70%的上海样本用于模型训练,其余30%用于内部验证。此外,采用绵阳数据集作为独立的外部验证队列来评估模型的通用性。开发了四种机器学习模型。然后,我们使用c指数、接受者工作特征曲线、校准图和决策曲线分析验证了模型的预测性能。结果:将性别、年龄、民族、文化程度、职业状况、婚姻状况、生活状况、经济状况、近半年用药方式、上月注射药物、安全套使用情况、美沙酮平均剂量、药尿检测阳性率等13项预测因素纳入预测模型。与其他三种模型相比,极端梯度增强(XGBoost)模型在识别和校准方面都表现出优异的性能。具体来说,在预测HCV血清转化方面,训练组的c指数为0.793 (95% CI: 0.771 ~ 0.813),内部验证组为0.744(0.709 ~ 0.779),外部验证组为0.756(0.712 ~ 0.799)。为该模型生成了一个可公开访问的web工具。结论:开发的XGBoost模型具有准确预测MMT计划中HCV血清转化高风险个体的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信