Using Intelligent Screening Service Platform (ISSP) to improve the screening process of clinical trial subjects during COVID-19 pandemic: an experimental study

IF 1.3 3区 计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS
Bin Li, Runfang Guo, Huan Zhou, Yuanyuan Liu, Xiaolei Zhang, Qian Zhang
{"title":"Using Intelligent Screening Service Platform (ISSP) to improve the screening process of clinical trial subjects during COVID-19 pandemic: an experimental study","authors":"Bin Li, Runfang Guo, Huan Zhou, Yuanyuan Liu, Xiaolei Zhang, Qian Zhang","doi":"10.1162/dint_a_00253","DOIUrl":null,"url":null,"abstract":"\n Background: During the COVID-19 pandemic, clinical trial recruitment cannot be carried out due to travel restrictions, transmission risks and other factors, resulting in the stagnation of a large number of ongoing or upcoming clinical trials.\n Objective: An intelligent screening app was developed using artificial intelligence technology to rapidly pre-screen potential patients for phase I solid tumor drug clinical trials.\n Methods: A total of 429 screening process records were collected from 27 phase I solid tumor drug clinical trials at the First Affiliated Hospital of Bengbu Medical College from April 2018 to May 2021. Features of the experimental data were analyzed, and the collinearity (principal component analysis) and strong correlation (χ2 test) among features were eliminated. XGBoost, Random Forest, and Naive Bayes were used to sort the weight importance of features. Finally, the pre-screening models were constructed using classification machine learning algorithm, and the optimal model was selected.\n Results: Among the 429 screening records, 33 were data generated by repeated subject participation in different clinical trials, and of the remaining 396 screening records, 246 (62.12%) were screened successfully. The gold standard for subject screening success is the final judgment made by the principal investigator (PI) based on the clinical trial protocol. A Venn diagram was used to identify the important feature intersections of machine learning algorithms. After intersecting the top 15 characteristic variables of different feature screening models, 9 common variables were obtained: age, sex, distance from residence to the central institution, tumor histology, tumor stage, tumorectomy, the interval from diagnosis/postoperative to screening, chemotherapy, and ECOG (Eastern Cooperative Oncology Group, ECOG) score. To select the optimal subset, the 9 important feature variables were expanded to 12 and 15 feature subsets, and the performance of different feature subsets under different machine learning models was validated. The results showed that optimal performance, accuracy and practicability were achieved using XGBoost with the 12 feature subset. The final model could accurately predict the screening success rates in both internal (AUC =3D 0.895) and external (AUC =3D 0.796) validation, and has been transformed into a convenient tool to facilitate its application in the clinical settings. Subjects with a probability exceeding or equaling to the threshold in the final model had a higher probability to be successfully screened.\n Conclusion: Based on the optimal model, we created an online prediction calculator and visualization app -- ISSP (Intelligent Screening Service Platform), which can rapidly screen patients for phase I solid tumor drug clinical trials. ISSP can effectively solve the problem of space and time interval. On the mobile terminal, it realizes the matching between clinical trial projects and patients, and completes the rapid screening of clinical trial subjects, so as to obtain more clinical trial subjects. As an auxiliary tool, ISSP optimizes the screening process of clinical trials and provides more convenient services for clinical investigators and patients.","PeriodicalId":34023,"journal":{"name":"Data Intelligence","volume":null,"pages":null},"PeriodicalIF":1.3000,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data Intelligence","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1162/dint_a_00253","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: During the COVID-19 pandemic, clinical trial recruitment cannot be carried out due to travel restrictions, transmission risks and other factors, resulting in the stagnation of a large number of ongoing or upcoming clinical trials. Objective: An intelligent screening app was developed using artificial intelligence technology to rapidly pre-screen potential patients for phase I solid tumor drug clinical trials. Methods: A total of 429 screening process records were collected from 27 phase I solid tumor drug clinical trials at the First Affiliated Hospital of Bengbu Medical College from April 2018 to May 2021. Features of the experimental data were analyzed, and the collinearity (principal component analysis) and strong correlation (χ2 test) among features were eliminated. XGBoost, Random Forest, and Naive Bayes were used to sort the weight importance of features. Finally, the pre-screening models were constructed using classification machine learning algorithm, and the optimal model was selected. Results: Among the 429 screening records, 33 were data generated by repeated subject participation in different clinical trials, and of the remaining 396 screening records, 246 (62.12%) were screened successfully. The gold standard for subject screening success is the final judgment made by the principal investigator (PI) based on the clinical trial protocol. A Venn diagram was used to identify the important feature intersections of machine learning algorithms. After intersecting the top 15 characteristic variables of different feature screening models, 9 common variables were obtained: age, sex, distance from residence to the central institution, tumor histology, tumor stage, tumorectomy, the interval from diagnosis/postoperative to screening, chemotherapy, and ECOG (Eastern Cooperative Oncology Group, ECOG) score. To select the optimal subset, the 9 important feature variables were expanded to 12 and 15 feature subsets, and the performance of different feature subsets under different machine learning models was validated. The results showed that optimal performance, accuracy and practicability were achieved using XGBoost with the 12 feature subset. The final model could accurately predict the screening success rates in both internal (AUC =3D 0.895) and external (AUC =3D 0.796) validation, and has been transformed into a convenient tool to facilitate its application in the clinical settings. Subjects with a probability exceeding or equaling to the threshold in the final model had a higher probability to be successfully screened. Conclusion: Based on the optimal model, we created an online prediction calculator and visualization app -- ISSP (Intelligent Screening Service Platform), which can rapidly screen patients for phase I solid tumor drug clinical trials. ISSP can effectively solve the problem of space and time interval. On the mobile terminal, it realizes the matching between clinical trial projects and patients, and completes the rapid screening of clinical trial subjects, so as to obtain more clinical trial subjects. As an auxiliary tool, ISSP optimizes the screening process of clinical trials and provides more convenient services for clinical investigators and patients.
利用智能筛选服务平台(ISSP)改进 COVID-19 大流行期间临床试验受试者的筛选过程:一项实验研究
背景:在 COVID-19 大流行期间,由于旅行限制、传播风险等因素,临床试验招募工作无法进行,导致大量正在进行或即将进行的临床试验停滞不前。目标:利用人工智能技术开发了一款智能筛查应用程序,用于快速预筛I期实体瘤药物临床试验的潜在患者。方法:使用人工智能技术开发了一款智能筛选应用程序,用于快速预筛选 I 期实体肿瘤药物临床试验的潜在患者:收集了2018年4月至2021年5月蚌埠医学院第一附属医院27项I期实体瘤药物临床试验的筛选过程记录,共计429条。对实验数据进行特征分析,剔除特征间的共线性(主成分分析)和强相关性(χ2检验)。使用 XGBoost、随机森林和 Naive Bayes 对特征的权重重要性进行排序。最后,使用分类机器学习算法构建预筛选模型,并选出最优模型。结果在 429 条筛查记录中,有 33 条是受试者重复参与不同临床试验产生的数据,在剩余的 396 条筛查记录中,有 246 条(62.12%)筛查成功。受试者筛选成功与否的金标准是主要研究者(PI)根据临床试验方案做出的最终判断。维恩图用于确定机器学习算法的重要特征交叉点。将不同特征筛选模型的前 15 个特征变量交叉后,得到了 9 个共同变量:年龄、性别、居住地到中心机构的距离、肿瘤组织学、肿瘤分期、肿瘤切除术、从诊断/术后到筛选的时间间隔、化疗和 ECOG(东部合作肿瘤学组,Eastern Cooperative Oncology Group)评分。为了选择最佳子集,将 9 个重要特征变量扩展为 12 个和 15 个特征子集,并验证了不同特征子集在不同机器学习模型下的性能。结果表明,使用 12 个特征子集的 XGBoost 模型实现了最佳性能、准确性和实用性。最终模型在内部验证(AUC =3D 0.895)和外部验证(AUC =3D 0.796)中都能准确预测筛查成功率,并已转化为方便的工具,便于在临床中应用。在最终模型中,概率超过或等于阈值的受试者被筛查成功的概率更高。结论基于最优模型,我们创建了一个在线预测计算器和可视化应用程序--ISSP(智能筛查服务平台),它可以快速筛查实体瘤药物临床试验 I 期的患者。ISSP 能有效解决空间和时间间隔问题。在移动终端上,实现临床试验项目与患者的匹配,完成临床试验受试者的快速筛选,从而获得更多的临床试验受试者。作为辅助工具,ISSP 优化了临床试验筛选流程,为临床研究者和患者提供了更便捷的服务。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Data Intelligence
Data Intelligence COMPUTER SCIENCE, INFORMATION SYSTEMS-
CiteScore
6.50
自引率
15.40%
发文量
40
审稿时长
8 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信