Development of binary-based prediction models for colorectal polyps

Aaron Morelos-Gomez , Kohjiro Tokutake , Ken-ichi Hoshi , Akira Matsushima , Armando David Martinez-Iniesta , Michio Katouda , Syogo Tejima , Morinobu Endo
{"title":"Development of binary-based prediction models for colorectal polyps","authors":"Aaron Morelos-Gomez ,&nbsp;Kohjiro Tokutake ,&nbsp;Ken-ichi Hoshi ,&nbsp;Akira Matsushima ,&nbsp;Armando David Martinez-Iniesta ,&nbsp;Michio Katouda ,&nbsp;Syogo Tejima ,&nbsp;Morinobu Endo","doi":"10.1016/j.ibmed.2025.100236","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and aims</h3><div>Even though several colorectal cancer (CRC) screening strategies can lower CRC mortality, screening rates remain low. Removing polyps to achieve a clean colon is effective in preventing CRC. This study evaluated the possibility of using artificial intelligence to select features and threshold values required to construct an optimal screening model to prevent colorectal neoplasia.</div></div><div><h3>Methods</h3><div>The collected data consisted of medical check-ups, blood analysis, demographics, colonoscopy observations, and fecal immunochemical test (FIT). The data was divided according to sex and used to construct a screening model that converted each feature into a zero or a one based on a threshold value obtained through particle swarm optimization and the best group of features was selected by sequential combinations. Three optimization targets were evaluated: Mathew's correlation coefficient, the area under the curve, and the minimum between sensitivity and specificity.</div></div><div><h3>Results</h3><div>Using the minimum between sensitivity and specificity as an optimization target the obtained models yielded better overall prediction metrics. The optimization algorithm selected three features for women and ten features for men. The optimized models for both sexes agree that obesity is determinant for predicting polyps according to the selected features. In addition, both models outperform traditional FIT which is used for colorectal cancer screening.</div></div><div><h3>Conclusions</h3><div>The developed algorithm is effective in creating polyp screening models for men and women based on medical data with higher prediction metrics than FIT. In addition, the obtained threshold values and prediction probability can act as a guide for medical practitioners.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"11 ","pages":"Article 100236"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligence-based medicine","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666521225000407","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Background and aims

Even though several colorectal cancer (CRC) screening strategies can lower CRC mortality, screening rates remain low. Removing polyps to achieve a clean colon is effective in preventing CRC. This study evaluated the possibility of using artificial intelligence to select features and threshold values required to construct an optimal screening model to prevent colorectal neoplasia.

Methods

The collected data consisted of medical check-ups, blood analysis, demographics, colonoscopy observations, and fecal immunochemical test (FIT). The data was divided according to sex and used to construct a screening model that converted each feature into a zero or a one based on a threshold value obtained through particle swarm optimization and the best group of features was selected by sequential combinations. Three optimization targets were evaluated: Mathew's correlation coefficient, the area under the curve, and the minimum between sensitivity and specificity.

Results

Using the minimum between sensitivity and specificity as an optimization target the obtained models yielded better overall prediction metrics. The optimization algorithm selected three features for women and ten features for men. The optimized models for both sexes agree that obesity is determinant for predicting polyps according to the selected features. In addition, both models outperform traditional FIT which is used for colorectal cancer screening.

Conclusions

The developed algorithm is effective in creating polyp screening models for men and women based on medical data with higher prediction metrics than FIT. In addition, the obtained threshold values and prediction probability can act as a guide for medical practitioners.

Abstract Image

结直肠息肉二值预测模型的建立
背景与目的虽然有几种结直肠癌筛查策略可以降低结直肠癌死亡率,但筛查率仍然很低。切除息肉以清洁结肠是预防结直肠癌的有效方法。本研究评估了使用人工智能选择特征和阈值以构建预防结直肠肿瘤的最佳筛选模型的可能性。方法收集的资料包括体格检查、血液分析、人口统计学、结肠镜检查和粪便免疫化学试验(FIT)。将数据按性别划分,构建筛选模型,根据粒子群优化得到的阈值将每个特征转换为0或1,并通过顺序组合选择最佳特征组。对3个优化指标进行评价:马修相关系数、曲线下面积、敏感性与特异性之间的最小值。结果以敏感性和特异性之间的最小值为优化目标,所得模型具有较好的整体预测指标。优化算法为女性选择了3个特征,为男性选择了10个特征。针对两性的优化模型一致认为,根据选择的特征,肥胖是预测息肉的决定性因素。此外,这两种模型都优于用于结直肠癌筛查的传统FIT。结论该算法可有效地建立基于医学数据的男性和女性息肉筛查模型,预测指标高于FIT。此外,得到的阈值和预测概率可以作为医生的指导。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Intelligence-based medicine
Intelligence-based medicine Health Informatics
CiteScore
5.00
自引率
0.00%
发文量
0
审稿时长
187 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信