Development and comparison of machine-learning models for predicting prolonged postoperative length of stay in lung cancer patients following video-assisted thoracoscopic surgery
Guolong Zhang , Xuanhui Liu , Yuning Hu , Qinchi Luo , Liang Ruan , Hongxia Xie , Yingchun Zeng
{"title":"Development and comparison of machine-learning models for predicting prolonged postoperative length of stay in lung cancer patients following video-assisted thoracoscopic surgery","authors":"Guolong Zhang , Xuanhui Liu , Yuning Hu , Qinchi Luo , Liang Ruan , Hongxia Xie , Yingchun Zeng","doi":"10.1016/j.apjon.2024.100493","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><p>This study aimed to develop models for predicting prolonged postoperative length of stay (PPOLOS) in lung cancer patients undergoing video-assisted thoracoscopic surgery (VATS) by utilizing machine-learning techniques. These models aim to offer valuable insights for clinical decision-making.</p></div><div><h3>Methods</h3><p>This retrospective cohort study analyzed a dataset of lung cancer patients who underwent VATS, identifying 25 numerical features and 45 textual features. Three classification machine-learning models were developed: XGBoost, random forest, and neural network. The performance of these models was evaluated based on accuracy (ACC) and area under the receiver operating characteristic curve, whereas the importance of variables was assessed using the feature importance parameter from the random forest model.</p></div><div><h3>Results</h3><p>Of the 6767 lung cancer patients, 1481 patients (21.9%) experienced a postoperative length of stay of > 4 days. The majority were male (4111, 60.8%), married (6246, 92.3%), and diagnosed with adenocarcinoma (4145, 61.3%). The Random Forest classifier exhibited superior prediction performance with an area under the curve (AUC) of 0.792 and ACC of 0.804. The calibration plot revealed that all three classifiers were in close alignment with the ideal calibration line, indicating high calibration reliability. The five most critical features identified were the following: surgical duration (0.116), age (0.066), creatinine (0.062), hemoglobin (0.058), and total protein (0.054).</p></div><div><h3>Conclusions</h3><p>This study developed and evaluated three machine-learning models for predicting PPOLOS in lung cancer patients undergoing VATS. The findings revealed that the Random Forest model is most accurately predicting the PPOLOS. Findings of this study enable the identification of crucial determinants and the formulation of targeted interventions to shorten the length of stay among lung cancer patients after VATS, which contribute to optimize the allocation of healthcare resources.</p></div>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2347562524001136/pdfft?md5=70d08ea2cb66b8116ae9b2a1af0e75a8&pid=1-s2.0-S2347562524001136-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2347562524001136","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 0
Abstract
Objective
This study aimed to develop models for predicting prolonged postoperative length of stay (PPOLOS) in lung cancer patients undergoing video-assisted thoracoscopic surgery (VATS) by utilizing machine-learning techniques. These models aim to offer valuable insights for clinical decision-making.
Methods
This retrospective cohort study analyzed a dataset of lung cancer patients who underwent VATS, identifying 25 numerical features and 45 textual features. Three classification machine-learning models were developed: XGBoost, random forest, and neural network. The performance of these models was evaluated based on accuracy (ACC) and area under the receiver operating characteristic curve, whereas the importance of variables was assessed using the feature importance parameter from the random forest model.
Results
Of the 6767 lung cancer patients, 1481 patients (21.9%) experienced a postoperative length of stay of > 4 days. The majority were male (4111, 60.8%), married (6246, 92.3%), and diagnosed with adenocarcinoma (4145, 61.3%). The Random Forest classifier exhibited superior prediction performance with an area under the curve (AUC) of 0.792 and ACC of 0.804. The calibration plot revealed that all three classifiers were in close alignment with the ideal calibration line, indicating high calibration reliability. The five most critical features identified were the following: surgical duration (0.116), age (0.066), creatinine (0.062), hemoglobin (0.058), and total protein (0.054).
Conclusions
This study developed and evaluated three machine-learning models for predicting PPOLOS in lung cancer patients undergoing VATS. The findings revealed that the Random Forest model is most accurately predicting the PPOLOS. Findings of this study enable the identification of crucial determinants and the formulation of targeted interventions to shorten the length of stay among lung cancer patients after VATS, which contribute to optimize the allocation of healthcare resources.