影响早期胃癌发病率的因素:贝叶斯网络分析。

IF 2.5 3区 医学 Q2 GASTROENTEROLOGY & HEPATOLOGY
Ruiyu Li, Taiming Yang, Zi Dong, Yin Gao, Nan Li, Ting Song, Jinshu Sun, Ying Chen
{"title":"影响早期胃癌发病率的因素:贝叶斯网络分析。","authors":"Ruiyu Li, Taiming Yang, Zi Dong, Yin Gao, Nan Li, Ting Song, Jinshu Sun, Ying Chen","doi":"10.1186/s12876-025-03765-7","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>This study aims to establish a Bayesian network risk prediction model for gastric cancer using data mining methods. It explores both direct and indirect factors influencing the incidence of gastric cancer and reveals the interrelationships among these factors.</p><p><strong>Methods: </strong>Data were collected from early cancer screenings conducted at the People's Hospital of Lincang between 2022 and 2023. Initial variable selection was performed using Least Absolute Shrinkage and Selection Operator (Lasso) and Sliding Windows Sequential Forward Selection (SWSFS), and the screened variables and demographic characteristics features were used as variables for constructing the Bayesian network (BN) model. Subsequently, the performance of the models was evaluated, and the optimal model was selected for network mapping and Bayesian inference using the best model.</p><p><strong>Results: </strong>The incidence rate of gastric cancer in this region's high-risk population was determined to be 7.09%. The BN model constructed from the set of variables consisting of Lasso's selection variables and demographic characteristics had better performance. A total of 12 variables were incorporated into the BN model to form a network structure consisting of 13 nodes and 18 edges. The model shows that age, gender, ethnicity, current address, upper gastrointestinal symptoms (nausea, acid reflux, vomiting), alcohol consumption, smoking, SGIM gastritis, and family history are important risk factors for gastric cancer development.</p><p><strong>Conclusion: </strong>The Bayesian network model provides an intuitive framework for understanding the direct and indirect factors contributing to the early onset of gastric cancer, elucidating the interrelationships among these factors. Furthermore, the model demonstrates satisfactory predictive performance, which may facilitate the early detection of gastric cancer and enhance the levels of early diagnosis and treatment among high-risk populations.</p>","PeriodicalId":9129,"journal":{"name":"BMC Gastroenterology","volume":"25 1","pages":"194"},"PeriodicalIF":2.5000,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11927266/pdf/","citationCount":"0","resultStr":"{\"title\":\"Factors influencing the incidence of early gastric cancer: a bayesian network analysis.\",\"authors\":\"Ruiyu Li, Taiming Yang, Zi Dong, Yin Gao, Nan Li, Ting Song, Jinshu Sun, Ying Chen\",\"doi\":\"10.1186/s12876-025-03765-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>This study aims to establish a Bayesian network risk prediction model for gastric cancer using data mining methods. It explores both direct and indirect factors influencing the incidence of gastric cancer and reveals the interrelationships among these factors.</p><p><strong>Methods: </strong>Data were collected from early cancer screenings conducted at the People's Hospital of Lincang between 2022 and 2023. Initial variable selection was performed using Least Absolute Shrinkage and Selection Operator (Lasso) and Sliding Windows Sequential Forward Selection (SWSFS), and the screened variables and demographic characteristics features were used as variables for constructing the Bayesian network (BN) model. Subsequently, the performance of the models was evaluated, and the optimal model was selected for network mapping and Bayesian inference using the best model.</p><p><strong>Results: </strong>The incidence rate of gastric cancer in this region's high-risk population was determined to be 7.09%. The BN model constructed from the set of variables consisting of Lasso's selection variables and demographic characteristics had better performance. A total of 12 variables were incorporated into the BN model to form a network structure consisting of 13 nodes and 18 edges. The model shows that age, gender, ethnicity, current address, upper gastrointestinal symptoms (nausea, acid reflux, vomiting), alcohol consumption, smoking, SGIM gastritis, and family history are important risk factors for gastric cancer development.</p><p><strong>Conclusion: </strong>The Bayesian network model provides an intuitive framework for understanding the direct and indirect factors contributing to the early onset of gastric cancer, elucidating the interrelationships among these factors. Furthermore, the model demonstrates satisfactory predictive performance, which may facilitate the early detection of gastric cancer and enhance the levels of early diagnosis and treatment among high-risk populations.</p>\",\"PeriodicalId\":9129,\"journal\":{\"name\":\"BMC Gastroenterology\",\"volume\":\"25 1\",\"pages\":\"194\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2025-03-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11927266/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Gastroenterology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12876-025-03765-7\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"GASTROENTEROLOGY & HEPATOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Gastroenterology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12876-025-03765-7","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景:本研究旨在利用数据挖掘方法建立胃癌贝叶斯网络风险预测模型。探讨影响胃癌发生的直接因素和间接因素,揭示这些因素之间的相互关系。方法:收集2022 - 2023年在临沧市人民医院进行的早期癌症筛查数据。使用最小绝对收缩和选择算子(Lasso)和滑动窗口顺序前向选择(SWSFS)进行初始变量选择,并将筛选的变量和人口统计学特征作为构建贝叶斯网络(BN)模型的变量。随后,对模型的性能进行评价,选择最优模型进行网络映射和贝叶斯推理。结果:确定该地区高危人群胃癌发病率为7.09%。由Lasso选择变量和人口统计学特征组成的变量集构建的BN模型具有较好的性能。BN模型共纳入12个变量,形成13个节点18条边的网络结构。该模型显示,年龄、性别、种族、住址、上消化道症状(恶心、反酸、呕吐)、饮酒、吸烟、SGIM型胃炎和家族史是胃癌发生的重要危险因素。结论:贝叶斯网络模型为了解胃癌早期发病的直接和间接因素提供了一个直观的框架,阐明了这些因素之间的相互关系。此外,该模型具有良好的预测性能,有助于胃癌的早期发现,提高高危人群的早期诊断和治疗水平。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Factors influencing the incidence of early gastric cancer: a bayesian network analysis.

Background: This study aims to establish a Bayesian network risk prediction model for gastric cancer using data mining methods. It explores both direct and indirect factors influencing the incidence of gastric cancer and reveals the interrelationships among these factors.

Methods: Data were collected from early cancer screenings conducted at the People's Hospital of Lincang between 2022 and 2023. Initial variable selection was performed using Least Absolute Shrinkage and Selection Operator (Lasso) and Sliding Windows Sequential Forward Selection (SWSFS), and the screened variables and demographic characteristics features were used as variables for constructing the Bayesian network (BN) model. Subsequently, the performance of the models was evaluated, and the optimal model was selected for network mapping and Bayesian inference using the best model.

Results: The incidence rate of gastric cancer in this region's high-risk population was determined to be 7.09%. The BN model constructed from the set of variables consisting of Lasso's selection variables and demographic characteristics had better performance. A total of 12 variables were incorporated into the BN model to form a network structure consisting of 13 nodes and 18 edges. The model shows that age, gender, ethnicity, current address, upper gastrointestinal symptoms (nausea, acid reflux, vomiting), alcohol consumption, smoking, SGIM gastritis, and family history are important risk factors for gastric cancer development.

Conclusion: The Bayesian network model provides an intuitive framework for understanding the direct and indirect factors contributing to the early onset of gastric cancer, elucidating the interrelationships among these factors. Furthermore, the model demonstrates satisfactory predictive performance, which may facilitate the early detection of gastric cancer and enhance the levels of early diagnosis and treatment among high-risk populations.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
BMC Gastroenterology
BMC Gastroenterology 医学-胃肠肝病学
CiteScore
4.20
自引率
0.00%
发文量
465
审稿时长
6 months
期刊介绍: BMC Gastroenterology is an open access, peer-reviewed journal that considers articles on all aspects of the prevention, diagnosis and management of gastrointestinal and hepatobiliary disorders, as well as related molecular genetics, pathophysiology, and epidemiology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信