Jakob Woerner , Yonghyun Nam , Sang-Hyuk Jung , Manu Shivakumar , Matthew Lee , Eun Kyung Choe , Min Jung Kim , Rumi Shin , Seung-Bum Ryoo , Seung-Yong Jeong , Kyu Joo Park , Sung Chan Park , Dae Kyung Sohn , Jae Hwan Oh , Dokyoon Kim , Ji Won Park
{"title":"利用自动机器学习从临床特征和风险组预测结肠癌预后:一项回顾性队列研究","authors":"Jakob Woerner , Yonghyun Nam , Sang-Hyuk Jung , Manu Shivakumar , Matthew Lee , Eun Kyung Choe , Min Jung Kim , Rumi Shin , Seung-Bum Ryoo , Seung-Yong Jeong , Kyu Joo Park , Sung Chan Park , Dae Kyung Sohn , Jae Hwan Oh , Dokyoon Kim , Ji Won Park","doi":"10.1016/j.ejso.2025.110194","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Predicting colon cancer recurrence is crucial for determining the need for adjuvant therapy after curative resection. However, clinical decisions often rely on limited features, even when a large amount of data is available.</div></div><div><h3>Methods</h3><div>We assessed the clinical utility of automated machine learning (AutoML) models to predict the prognosis of colon cancer patients from a tertiary hospital using clinical features, pathologic characteristics, and blood markers. We also compared these AutoML models to manually trained and tuned models and evaluated survival predictions.</div></div><div><h3>Results</h3><div>We found comparable performance between linear and ensemble models, and the predicted prognosis was significantly associated with overall survival and disease-free survival outcomes. Interpretable machine learning models identified T and N staging as important features and highlighted the prognostic immune and nutritional index (PINI) as a meaningful biomarker. The XGBoost model predicted prognosis with an AUC of 0.798 in an independent test set from a different hospital, demonstrating the model's interoperability. Additionally, the model was able to distinguish stage IIA patients that would benefit from adjuvant chemotherapy, a complex and difficult decision for clinicians. We also showed that simplified models generally maintained predictive accuracy, and that the automated approach was equally predictive as manually curated models.</div></div><div><h3>Conclusion</h3><div>With extensive validation through multiple test sets and internal cross-validation, this work underscores the potential of AutoML in identifying survival-related signatures in colon cancer from routinely collected data, providing clinicians with valuable insights for personalized treatment strategies.</div></div>","PeriodicalId":11522,"journal":{"name":"Ejso","volume":"51 9","pages":"Article 110194"},"PeriodicalIF":2.9000,"publicationDate":"2025-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Leveraging automated machine learning to predict colon cancer prognosis from clinical features and risk groups: a retrospective cohort study\",\"authors\":\"Jakob Woerner , Yonghyun Nam , Sang-Hyuk Jung , Manu Shivakumar , Matthew Lee , Eun Kyung Choe , Min Jung Kim , Rumi Shin , Seung-Bum Ryoo , Seung-Yong Jeong , Kyu Joo Park , Sung Chan Park , Dae Kyung Sohn , Jae Hwan Oh , Dokyoon Kim , Ji Won Park\",\"doi\":\"10.1016/j.ejso.2025.110194\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><div>Predicting colon cancer recurrence is crucial for determining the need for adjuvant therapy after curative resection. However, clinical decisions often rely on limited features, even when a large amount of data is available.</div></div><div><h3>Methods</h3><div>We assessed the clinical utility of automated machine learning (AutoML) models to predict the prognosis of colon cancer patients from a tertiary hospital using clinical features, pathologic characteristics, and blood markers. We also compared these AutoML models to manually trained and tuned models and evaluated survival predictions.</div></div><div><h3>Results</h3><div>We found comparable performance between linear and ensemble models, and the predicted prognosis was significantly associated with overall survival and disease-free survival outcomes. Interpretable machine learning models identified T and N staging as important features and highlighted the prognostic immune and nutritional index (PINI) as a meaningful biomarker. The XGBoost model predicted prognosis with an AUC of 0.798 in an independent test set from a different hospital, demonstrating the model's interoperability. Additionally, the model was able to distinguish stage IIA patients that would benefit from adjuvant chemotherapy, a complex and difficult decision for clinicians. We also showed that simplified models generally maintained predictive accuracy, and that the automated approach was equally predictive as manually curated models.</div></div><div><h3>Conclusion</h3><div>With extensive validation through multiple test sets and internal cross-validation, this work underscores the potential of AutoML in identifying survival-related signatures in colon cancer from routinely collected data, providing clinicians with valuable insights for personalized treatment strategies.</div></div>\",\"PeriodicalId\":11522,\"journal\":{\"name\":\"Ejso\",\"volume\":\"51 9\",\"pages\":\"Article 110194\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-05-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ejso\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0748798325006225\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ejso","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0748798325006225","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
Leveraging automated machine learning to predict colon cancer prognosis from clinical features and risk groups: a retrospective cohort study
Background
Predicting colon cancer recurrence is crucial for determining the need for adjuvant therapy after curative resection. However, clinical decisions often rely on limited features, even when a large amount of data is available.
Methods
We assessed the clinical utility of automated machine learning (AutoML) models to predict the prognosis of colon cancer patients from a tertiary hospital using clinical features, pathologic characteristics, and blood markers. We also compared these AutoML models to manually trained and tuned models and evaluated survival predictions.
Results
We found comparable performance between linear and ensemble models, and the predicted prognosis was significantly associated with overall survival and disease-free survival outcomes. Interpretable machine learning models identified T and N staging as important features and highlighted the prognostic immune and nutritional index (PINI) as a meaningful biomarker. The XGBoost model predicted prognosis with an AUC of 0.798 in an independent test set from a different hospital, demonstrating the model's interoperability. Additionally, the model was able to distinguish stage IIA patients that would benefit from adjuvant chemotherapy, a complex and difficult decision for clinicians. We also showed that simplified models generally maintained predictive accuracy, and that the automated approach was equally predictive as manually curated models.
Conclusion
With extensive validation through multiple test sets and internal cross-validation, this work underscores the potential of AutoML in identifying survival-related signatures in colon cancer from routinely collected data, providing clinicians with valuable insights for personalized treatment strategies.
期刊介绍:
JSO - European Journal of Surgical Oncology ("the Journal of Cancer Surgery") is the Official Journal of the European Society of Surgical Oncology and BASO ~ the Association for Cancer Surgery.
The EJSO aims to advance surgical oncology research and practice through the publication of original research articles, review articles, editorials, debates and correspondence.