{"title":"Development of binary-based prediction models for colorectal polyps","authors":"Aaron Morelos-Gomez , Kohjiro Tokutake , Ken-ichi Hoshi , Akira Matsushima , Armando David Martinez-Iniesta , Michio Katouda , Syogo Tejima , Morinobu Endo","doi":"10.1016/j.ibmed.2025.100236","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and aims</h3><div>Even though several colorectal cancer (CRC) screening strategies can lower CRC mortality, screening rates remain low. Removing polyps to achieve a clean colon is effective in preventing CRC. This study evaluated the possibility of using artificial intelligence to select features and threshold values required to construct an optimal screening model to prevent colorectal neoplasia.</div></div><div><h3>Methods</h3><div>The collected data consisted of medical check-ups, blood analysis, demographics, colonoscopy observations, and fecal immunochemical test (FIT). The data was divided according to sex and used to construct a screening model that converted each feature into a zero or a one based on a threshold value obtained through particle swarm optimization and the best group of features was selected by sequential combinations. Three optimization targets were evaluated: Mathew's correlation coefficient, the area under the curve, and the minimum between sensitivity and specificity.</div></div><div><h3>Results</h3><div>Using the minimum between sensitivity and specificity as an optimization target the obtained models yielded better overall prediction metrics. The optimization algorithm selected three features for women and ten features for men. The optimized models for both sexes agree that obesity is determinant for predicting polyps according to the selected features. In addition, both models outperform traditional FIT which is used for colorectal cancer screening.</div></div><div><h3>Conclusions</h3><div>The developed algorithm is effective in creating polyp screening models for men and women based on medical data with higher prediction metrics than FIT. In addition, the obtained threshold values and prediction probability can act as a guide for medical practitioners.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"11 ","pages":"Article 100236"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligence-based medicine","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666521225000407","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Background and aims
Even though several colorectal cancer (CRC) screening strategies can lower CRC mortality, screening rates remain low. Removing polyps to achieve a clean colon is effective in preventing CRC. This study evaluated the possibility of using artificial intelligence to select features and threshold values required to construct an optimal screening model to prevent colorectal neoplasia.
Methods
The collected data consisted of medical check-ups, blood analysis, demographics, colonoscopy observations, and fecal immunochemical test (FIT). The data was divided according to sex and used to construct a screening model that converted each feature into a zero or a one based on a threshold value obtained through particle swarm optimization and the best group of features was selected by sequential combinations. Three optimization targets were evaluated: Mathew's correlation coefficient, the area under the curve, and the minimum between sensitivity and specificity.
Results
Using the minimum between sensitivity and specificity as an optimization target the obtained models yielded better overall prediction metrics. The optimization algorithm selected three features for women and ten features for men. The optimized models for both sexes agree that obesity is determinant for predicting polyps according to the selected features. In addition, both models outperform traditional FIT which is used for colorectal cancer screening.
Conclusions
The developed algorithm is effective in creating polyp screening models for men and women based on medical data with higher prediction metrics than FIT. In addition, the obtained threshold values and prediction probability can act as a guide for medical practitioners.