Machine Learning-Based Prediction of Helicobacter pylori Infection Study in Adults.

IF 3.1 4区 医学 Q1 Medicine
Min Liu, Shiyu Liu, Zhaolin Lu, Hu Chen, Yuling Xu, Xue Gong, Guangxia Chen
{"title":"Machine Learning-Based Prediction of Helicobacter pylori Infection Study in Adults.","authors":"Min Liu, Shiyu Liu, Zhaolin Lu, Hu Chen, Yuling Xu, Xue Gong, Guangxia Chen","doi":"10.12659/MSM.943666","DOIUrl":null,"url":null,"abstract":"<p><p>BACKGROUND Helicobacter pylori has a high infection rate worldwide, and epidemiological study of H. pylori is important. Artificial intelligence has been widely used in the field of medical research and has become a hotspot in recent years. This paper proposed a prediction model for H. pylori infection based on machine learning in adults. MATERIAL AND METHODS Adult patients were selected as research participants, and information on 30 factors was collected. The chi-square test, mutual information, ReliefF, and information gain were used to screen the feature factors and establish 2 subsets. We constructed an H. pylori infection prediction model based on XGBoost and optimized the model using a grid search by analyzing the correlation between features. The performance of the model was assessed by comparing its accuracy, recall, precision, F1 score, and AUC with those of 4 other classical machine learning methods. RESULTS The model performed better on the part B subset than on the part A subset. Compared with the other 4 machine learning methods, the model had the highest accuracy, recall, F1 score, and AUC. SHAP was used to evaluate the importance of features in the model. It was found that H. pylori infection of family members, living in rural areas, poor washing hands before meals and after using the toilet were risk factors for H. pylori infection. CONCLUSIONS The model proposed in this paper is superior to other models in predicting H. pylori infection and can provide a scientific basis for identifying the population susceptible to H. pylori and preventing H. pylori infection.</p>","PeriodicalId":48888,"journal":{"name":"Medical Science Monitor","volume":null,"pages":null},"PeriodicalIF":3.1000,"publicationDate":"2024-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11168235/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical Science Monitor","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.12659/MSM.943666","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

Abstract

BACKGROUND Helicobacter pylori has a high infection rate worldwide, and epidemiological study of H. pylori is important. Artificial intelligence has been widely used in the field of medical research and has become a hotspot in recent years. This paper proposed a prediction model for H. pylori infection based on machine learning in adults. MATERIAL AND METHODS Adult patients were selected as research participants, and information on 30 factors was collected. The chi-square test, mutual information, ReliefF, and information gain were used to screen the feature factors and establish 2 subsets. We constructed an H. pylori infection prediction model based on XGBoost and optimized the model using a grid search by analyzing the correlation between features. The performance of the model was assessed by comparing its accuracy, recall, precision, F1 score, and AUC with those of 4 other classical machine learning methods. RESULTS The model performed better on the part B subset than on the part A subset. Compared with the other 4 machine learning methods, the model had the highest accuracy, recall, F1 score, and AUC. SHAP was used to evaluate the importance of features in the model. It was found that H. pylori infection of family members, living in rural areas, poor washing hands before meals and after using the toilet were risk factors for H. pylori infection. CONCLUSIONS The model proposed in this paper is superior to other models in predicting H. pylori infection and can provide a scientific basis for identifying the population susceptible to H. pylori and preventing H. pylori infection.

基于机器学习的成人幽门螺旋杆菌感染预测研究。
背景幽门螺杆菌在全球的感染率很高,对幽门螺杆菌进行流行病学研究具有重要意义。人工智能已广泛应用于医学研究领域,成为近年来的研究热点。本文提出了一种基于机器学习的成人幽门螺杆菌感染预测模型。材料与方法 选择成年患者作为研究对象,收集 30 个因素的信息。利用卡方检验、互信息、ReliefF 和信息增益筛选特征因子,并建立了两个子集。我们构建了基于 XGBoost 的幽门螺杆菌感染预测模型,并通过分析特征之间的相关性,使用网格搜索对模型进行了优化。通过比较该模型与其他 4 种经典机器学习方法的准确率、召回率、精确度、F1 分数和 AUC,评估了该模型的性能。结果 该模型在 B 部分子集上的表现优于 A 部分子集。与其他 4 种机器学习方法相比,该模型具有最高的准确率、召回率、F1 分数和 AUC。利用 SHAP 评估了模型中特征的重要性。结果发现,家庭成员感染幽门螺杆菌、居住在农村地区、饭前便后洗手不彻底是幽门螺杆菌感染的风险因素。结论 本文提出的模型在预测幽门螺杆菌感染方面优于其他模型,可为确定幽门螺杆菌易感人群和预防幽门螺杆菌感染提供科学依据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Medical Science Monitor
Medical Science Monitor MEDICINE, RESEARCH & EXPERIMENTAL-
CiteScore
6.40
自引率
3.20%
发文量
514
审稿时长
3.0 months
期刊介绍: Medical Science Monitor (MSM) established in 1995 is an international, peer-reviewed scientific journal which publishes original articles in Clinical Medicine and related disciplines such as Epidemiology and Population Studies, Product Investigations, Development of Laboratory Techniques :: Diagnostics and Medical Technology which enable presentation of research or review works in overlapping areas of medicine and technology such us (but not limited to): medical diagnostics, medical imaging systems, computer simulation of health and disease processes, new medical devices, etc. Reviews and Special Reports - papers may be accepted on the basis that they provide a systematic, critical and up-to-date overview of literature pertaining to research or clinical topics. Meta-analyses are considered as reviews. A special attention will be paid to a teaching value of a review paper. Medical Science Monitor is internationally indexed in Thomson-Reuters Web of Science, Journals Citation Report (JCR), Science Citation Index Expanded (SCI), Index Medicus MEDLINE, PubMed, PMC, EMBASE/Excerpta Medica, Chemical Abstracts CAS and Index Copernicus.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信