Transparency in Externally Validated Models: A systematic review of machine learning vs. logistic regression for predicting colorectal anastomotic leakage
Sara Ben Hmido , Houssam Abder Rahim , Erik W. Ingwersen , George Burchell , Roel Hompes , Donald van der Peet , Eric Sonneveld , Geert Kazemier , Freek Daams
{"title":"Transparency in Externally Validated Models: A systematic review of machine learning vs. logistic regression for predicting colorectal anastomotic leakage","authors":"Sara Ben Hmido , Houssam Abder Rahim , Erik W. Ingwersen , George Burchell , Roel Hompes , Donald van der Peet , Eric Sonneveld , Geert Kazemier , Freek Daams","doi":"10.1016/j.ejso.2025.110367","DOIUrl":null,"url":null,"abstract":"<div><h3>Introduction</h3><div>Colorectal resection carries a 2.8 %–30 % risk of anastomotic leakage. Machine learning can estimate risks and guide decisions, but clinical implementation remains inadequate due to transparency issues. This review assesses the performance and transparency of machine learning models compared to logistic regression.</div></div><div><h3>Methods</h3><div>A systematic review followed PRISMA guidelines. Medline, Embase, Web of Science, and Cochrane databases were searched for studies using Logistic Regression or Machine Learning with external validation for colorectal anastomotic leakage prediction. Data were extracted using CHARMS, risk of bias assessed with PROBAST, and transparency with TRIPOD + AI.</div></div><div><h3>Results</h3><div>Ten studies were included. Machine learning models were validated on smaller cohorts than logistic regression. Transparency scores ranged from 29 % to 63 %, averaging 45 % for logistic regression and 43 % for machine learning. Reporting of missing data was inconsistent, and external validation was limited. Most studies had a high risk of bias due to small sample sizes and low event counts.</div></div><div><h3>Conclusion</h3><div>In comparison to Logistic regression studies, machine learning studies are limited by small cohorts, low outcome numbers, and a lower level of transparency. Future research should prioritise transparency, adhere to TRIPOD + AI standards, and develop LR and ML models in parallel using the same datasets while ensuring separate models for colon and rectal surgery. Currently, these models are not yet suitable for clinical implementation; more robust and transparent models must be developed based on these recommendations before they can be applied in clinical practice.</div></div>","PeriodicalId":11522,"journal":{"name":"Ejso","volume":"51 10","pages":"Article 110367"},"PeriodicalIF":2.9000,"publicationDate":"2025-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ejso","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0748798325007954","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction
Colorectal resection carries a 2.8 %–30 % risk of anastomotic leakage. Machine learning can estimate risks and guide decisions, but clinical implementation remains inadequate due to transparency issues. This review assesses the performance and transparency of machine learning models compared to logistic regression.
Methods
A systematic review followed PRISMA guidelines. Medline, Embase, Web of Science, and Cochrane databases were searched for studies using Logistic Regression or Machine Learning with external validation for colorectal anastomotic leakage prediction. Data were extracted using CHARMS, risk of bias assessed with PROBAST, and transparency with TRIPOD + AI.
Results
Ten studies were included. Machine learning models were validated on smaller cohorts than logistic regression. Transparency scores ranged from 29 % to 63 %, averaging 45 % for logistic regression and 43 % for machine learning. Reporting of missing data was inconsistent, and external validation was limited. Most studies had a high risk of bias due to small sample sizes and low event counts.
Conclusion
In comparison to Logistic regression studies, machine learning studies are limited by small cohorts, low outcome numbers, and a lower level of transparency. Future research should prioritise transparency, adhere to TRIPOD + AI standards, and develop LR and ML models in parallel using the same datasets while ensuring separate models for colon and rectal surgery. Currently, these models are not yet suitable for clinical implementation; more robust and transparent models must be developed based on these recommendations before they can be applied in clinical practice.
期刊介绍:
JSO - European Journal of Surgical Oncology ("the Journal of Cancer Surgery") is the Official Journal of the European Society of Surgical Oncology and BASO ~ the Association for Cancer Surgery.
The EJSO aims to advance surgical oncology research and practice through the publication of original research articles, review articles, editorials, debates and correspondence.