Sisi Feng, Manli Zhou, Zixin Huang, Xiaomin Xiao, Baiyun Zhong
{"title":"A machine learning-based prediction model for colorectal liver metastasis.","authors":"Sisi Feng, Manli Zhou, Zixin Huang, Xiaomin Xiao, Baiyun Zhong","doi":"10.1007/s10238-025-01699-8","DOIUrl":null,"url":null,"abstract":"<p><p>Colorectal liver metastasis (CRLM) is a primary factor contributing to poor prognosis and metastasis in colorectal cancer (CRC) patients. This study aims to develop and validate a machine learning (ML)-based risk prediction model using conventional clinical data to forecast the occurrence of CRLM. This retrospective study analyzed the clinical data of 865 CRC patients between January 2018 and September 2024. Patients were categorized into non-CRLM and CRLM groups. The least absolute shrinkage and selection operator regression was employed to identify key clinical variables, and five ML algorithms were utilized to develop prediction models. The optimal model was selected based on performance metrics including the receiver operating characteristic curve, precision-recall curve, decision curve analysis, and calibration curve, which collectively evaluated both the predictive accuracy and clinical utility of the model. Among the five ML algorithms evaluated, Random forest demonstrated the best performance. Leveraging the Random forest algorithm, we developed the CRLM-Lab6 prediction model, which incorporates six features: LDH, CA199, ALT, CEA, TBIL, and AGR. This model exhibits robust predictive performance, achieving an area under the curve of 0.94, a sensitivity of 0.88, and a specificity of 0.93. To enhance its practical utility, the model has been integrated into an accessible web application. This study developed a novel risk prediction model by integrating ML algorithms with conventional laboratory test data to evaluate the likelihood of CRLM occurrence. The model demonstrates excellent predictive performance and has significant clinical application potential.</p>","PeriodicalId":10337,"journal":{"name":"Clinical and Experimental Medicine","volume":"25 1","pages":"156"},"PeriodicalIF":3.2000,"publicationDate":"2025-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12069471/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical and Experimental Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s10238-025-01699-8","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Colorectal liver metastasis (CRLM) is a primary factor contributing to poor prognosis and metastasis in colorectal cancer (CRC) patients. This study aims to develop and validate a machine learning (ML)-based risk prediction model using conventional clinical data to forecast the occurrence of CRLM. This retrospective study analyzed the clinical data of 865 CRC patients between January 2018 and September 2024. Patients were categorized into non-CRLM and CRLM groups. The least absolute shrinkage and selection operator regression was employed to identify key clinical variables, and five ML algorithms were utilized to develop prediction models. The optimal model was selected based on performance metrics including the receiver operating characteristic curve, precision-recall curve, decision curve analysis, and calibration curve, which collectively evaluated both the predictive accuracy and clinical utility of the model. Among the five ML algorithms evaluated, Random forest demonstrated the best performance. Leveraging the Random forest algorithm, we developed the CRLM-Lab6 prediction model, which incorporates six features: LDH, CA199, ALT, CEA, TBIL, and AGR. This model exhibits robust predictive performance, achieving an area under the curve of 0.94, a sensitivity of 0.88, and a specificity of 0.93. To enhance its practical utility, the model has been integrated into an accessible web application. This study developed a novel risk prediction model by integrating ML algorithms with conventional laboratory test data to evaluate the likelihood of CRLM occurrence. The model demonstrates excellent predictive performance and has significant clinical application potential.
期刊介绍:
Clinical and Experimental Medicine (CEM) is a multidisciplinary journal that aims to be a forum of scientific excellence and information exchange in relation to the basic and clinical features of the following fields: hematology, onco-hematology, oncology, virology, immunology, and rheumatology. The journal publishes reviews and editorials, experimental and preclinical studies, translational research, prospectively designed clinical trials, and epidemiological studies. Papers containing new clinical or experimental data that are likely to contribute to changes in clinical practice or the way in which a disease is thought about will be given priority due to their immediate importance. Case reports will be accepted on an exceptional basis only, and their submission is discouraged. The major criteria for publication are clarity, scientific soundness, and advances in knowledge. In compliance with the overwhelmingly prevailing request by the international scientific community, and with respect for eco-compatibility issues, CEM is now published exclusively online.