Development and comparison of machine-learning models for predicting prolonged postoperative length of stay in lung cancer patients following video-assisted thoracoscopic surgery

IF 2.8 3区医学 Q1 NURSING

Asia-Pacific Journal of Oncology Nursing Pub Date : 2024-04-22 DOI:10.1016/j.apjon.2024.100493

Guolong Zhang , Xuanhui Liu , Yuning Hu , Qinchi Luo , Liang Ruan , Hongxia Xie , Yingchun Zeng

{"title":"Development and comparison of machine-learning models for predicting prolonged postoperative length of stay in lung cancer patients following video-assisted thoracoscopic surgery","authors":"Guolong Zhang , Xuanhui Liu , Yuning Hu , Qinchi Luo , Liang Ruan , Hongxia Xie , Yingchun Zeng","doi":"10.1016/j.apjon.2024.100493","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><p>This study aimed to develop models for predicting prolonged postoperative length of stay (PPOLOS) in lung cancer patients undergoing video-assisted thoracoscopic surgery (VATS) by utilizing machine-learning techniques. These models aim to offer valuable insights for clinical decision-making.</p></div><div><h3>Methods</h3><p>This retrospective cohort study analyzed a dataset of lung cancer patients who underwent VATS, identifying 25 numerical features and 45 textual features. Three classification machine-learning models were developed: XGBoost, random forest, and neural network. The performance of these models was evaluated based on accuracy (ACC) and area under the receiver operating characteristic curve, whereas the importance of variables was assessed using the feature importance parameter from the random forest model.</p></div><div><h3>Results</h3><p>Of the 6767 lung cancer patients, 1481 patients (21.9%) experienced a postoperative length of stay of > 4 days. The majority were male (4111, 60.8%), married (6246, 92.3%), and diagnosed with adenocarcinoma (4145, 61.3%). The Random Forest classifier exhibited superior prediction performance with an area under the curve (AUC) of 0.792 and ACC of 0.804. The calibration plot revealed that all three classifiers were in close alignment with the ideal calibration line, indicating high calibration reliability. The five most critical features identified were the following: surgical duration (0.116), age (0.066), creatinine (0.062), hemoglobin (0.058), and total protein (0.054).</p></div><div><h3>Conclusions</h3><p>This study developed and evaluated three machine-learning models for predicting PPOLOS in lung cancer patients undergoing VATS. The findings revealed that the Random Forest model is most accurately predicting the PPOLOS. Findings of this study enable the identification of crucial determinants and the formulation of targeted interventions to shorten the length of stay among lung cancer patients after VATS, which contribute to optimize the allocation of healthcare resources.</p></div>","PeriodicalId":8569,"journal":{"name":"Asia-Pacific Journal of Oncology Nursing","volume":"11 6","pages":"Article 100493"},"PeriodicalIF":2.8000,"publicationDate":"2024-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2347562524001136/pdfft?md5=70d08ea2cb66b8116ae9b2a1af0e75a8&pid=1-s2.0-S2347562524001136-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Asia-Pacific Journal of Oncology Nursing","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2347562524001136","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NURSING","Score":null,"Total":0}

引用次数: 0

Abstract

Objective

This study aimed to develop models for predicting prolonged postoperative length of stay (PPOLOS) in lung cancer patients undergoing video-assisted thoracoscopic surgery (VATS) by utilizing machine-learning techniques. These models aim to offer valuable insights for clinical decision-making.

Methods

This retrospective cohort study analyzed a dataset of lung cancer patients who underwent VATS, identifying 25 numerical features and 45 textual features. Three classification machine-learning models were developed: XGBoost, random forest, and neural network. The performance of these models was evaluated based on accuracy (ACC) and area under the receiver operating characteristic curve, whereas the importance of variables was assessed using the feature importance parameter from the random forest model.

Results

Of the 6767 lung cancer patients, 1481 patients (21.9%) experienced a postoperative length of stay of > 4 days. The majority were male (4111, 60.8%), married (6246, 92.3%), and diagnosed with adenocarcinoma (4145, 61.3%). The Random Forest classifier exhibited superior prediction performance with an area under the curve (AUC) of 0.792 and ACC of 0.804. The calibration plot revealed that all three classifiers were in close alignment with the ideal calibration line, indicating high calibration reliability. The five most critical features identified were the following: surgical duration (0.116), age (0.066), creatinine (0.062), hemoglobin (0.058), and total protein (0.054).

Conclusions

This study developed and evaluated three machine-learning models for predicting PPOLOS in lung cancer patients undergoing VATS. The findings revealed that the Random Forest model is most accurately predicting the PPOLOS. Findings of this study enable the identification of crucial determinants and the formulation of targeted interventions to shorten the length of stay among lung cancer patients after VATS, which contribute to optimize the allocation of healthcare resources.

查看原文本刊更多论文

用于预测肺癌患者视频辅助胸腔镜手术后住院时间延长的机器学习模型的开发与比较

目的本研究旨在利用机器学习技术开发预测接受视频辅助胸腔镜手术（VATS）的肺癌患者术后住院时间延长（PPOLOS）的模型。这项回顾性队列研究分析了接受 VATS 手术的肺癌患者数据集，确定了 25 个数字特征和 45 个文字特征。开发了三种分类机器学习模型：XGBoost、随机森林和神经网络。根据准确率（ACC）和接收者操作特征曲线下面积评估了这些模型的性能，并使用随机森林模型中的特征重要性参数评估了变量的重要性。大多数患者为男性（4111 人，占 60.8%）、已婚（6246 人，占 92.3%）和腺癌（4145 人，占 61.3%）。随机森林分类器的预测性能优越，曲线下面积（AUC）为 0.792，ACC 为 0.804。校准图显示，所有三个分类器都与理想校准线接近，表明校准可靠性很高。确定的五个最关键特征如下：手术时间（0.116）、年龄（0.066）、肌酐（0.062）、血红蛋白（0.058）和总蛋白（0.054）。研究结果表明，随机森林模型对 PPOLOS 的预测最为准确。这项研究的结果有助于确定关键的决定因素并制定有针对性的干预措施，以缩短肺癌患者接受 VATS 术后的住院时间，从而有助于优化医疗资源的分配。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊