Naim Abu-Freha , Zaid Afawi , Miar Yousef , Walid Alamor , Noor Sanalla , Simon Esbit , Malik Yousef
{"title":"A machine learning approach to differentiate stage IV from stage I colorectal cancer","authors":"Naim Abu-Freha , Zaid Afawi , Miar Yousef , Walid Alamor , Noor Sanalla , Simon Esbit , Malik Yousef","doi":"10.1016/j.compbiomed.2025.110179","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and aim</h3><div>The stage at which Colorectal cancer (CRC) diagnosed is a crucial prognostic factor. Our study proposed a novel approach to aid in the diagnosis of stage IV CRC by utilizing supervised machine learning, analyzing clinical history, and laboratory values, comparing them with those of stage I CRC.</div></div><div><h3>Methods</h3><div>We conducted a respective study using patients diagnosed with stage I (n = 433) and stage IV CRC (n = 457). We employed supervised machine learning using random forest. The decision tree is used to visualize the model to identify key clinical and laboratory factors that differentiate between stage IV and stage I CRC.</div></div><div><h3>Results</h3><div>The decision tree classifier revealed that symptoms combined with laboratory values were critical predictors of stage IV CRC. Change in bowel habits was predictive for stage IV CRC among 14 of 22 patients (63 %). Weight loss, constipation, and abdominal pain in combination with different levels of carcinoembryonic antigen (CEA) were predictors for stage IV CRC. A CEA level higher than 260 was indicative for stage IV CRC in all observed patients (61 out of 61 patients). Additionally, a lower CEA level, in combination with hemoglobin, white blood cell count, and platelet count, also predicted stage IV CRC.</div></div><div><h3>Conclusions</h3><div>By applying a machine learning based approach, we identified symptoms and laboratory values (CEA, hemoglobin, white blood cell count, and platelet count), as crucial predictors for stage IV CRC diagnosis. This method holds potential for facilitating the diagnosis of stage IV CRC in clinical practice, even before imaging tests are conducted.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"191 ","pages":"Article 110179"},"PeriodicalIF":7.0000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S001048252500530X","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background and aim
The stage at which Colorectal cancer (CRC) diagnosed is a crucial prognostic factor. Our study proposed a novel approach to aid in the diagnosis of stage IV CRC by utilizing supervised machine learning, analyzing clinical history, and laboratory values, comparing them with those of stage I CRC.
Methods
We conducted a respective study using patients diagnosed with stage I (n = 433) and stage IV CRC (n = 457). We employed supervised machine learning using random forest. The decision tree is used to visualize the model to identify key clinical and laboratory factors that differentiate between stage IV and stage I CRC.
Results
The decision tree classifier revealed that symptoms combined with laboratory values were critical predictors of stage IV CRC. Change in bowel habits was predictive for stage IV CRC among 14 of 22 patients (63 %). Weight loss, constipation, and abdominal pain in combination with different levels of carcinoembryonic antigen (CEA) were predictors for stage IV CRC. A CEA level higher than 260 was indicative for stage IV CRC in all observed patients (61 out of 61 patients). Additionally, a lower CEA level, in combination with hemoglobin, white blood cell count, and platelet count, also predicted stage IV CRC.
Conclusions
By applying a machine learning based approach, we identified symptoms and laboratory values (CEA, hemoglobin, white blood cell count, and platelet count), as crucial predictors for stage IV CRC diagnosis. This method holds potential for facilitating the diagnosis of stage IV CRC in clinical practice, even before imaging tests are conducted.
期刊介绍:
Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.