Development and comparison of machine-learning models for predicting prolonged postoperative length of stay in lung cancer patients following video-assisted thoracoscopic surgery

IF 4.6 Q2 MATERIALS SCIENCE, BIOMATERIALS
Guolong Zhang , Xuanhui Liu , Yuning Hu , Qinchi Luo , Liang Ruan , Hongxia Xie , Yingchun Zeng
{"title":"Development and comparison of machine-learning models for predicting prolonged postoperative length of stay in lung cancer patients following video-assisted thoracoscopic surgery","authors":"Guolong Zhang ,&nbsp;Xuanhui Liu ,&nbsp;Yuning Hu ,&nbsp;Qinchi Luo ,&nbsp;Liang Ruan ,&nbsp;Hongxia Xie ,&nbsp;Yingchun Zeng","doi":"10.1016/j.apjon.2024.100493","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><p>This study aimed to develop models for predicting prolonged postoperative length of stay (PPOLOS) in lung cancer patients undergoing video-assisted thoracoscopic surgery (VATS) by utilizing machine-learning techniques. These models aim to offer valuable insights for clinical decision-making.</p></div><div><h3>Methods</h3><p>This retrospective cohort study analyzed a dataset of lung cancer patients who underwent VATS, identifying 25 numerical features and 45 textual features. Three classification machine-learning models were developed: XGBoost, random forest, and neural network. The performance of these models was evaluated based on accuracy (ACC) and area under the receiver operating characteristic curve, whereas the importance of variables was assessed using the feature importance parameter from the random forest model.</p></div><div><h3>Results</h3><p>Of the 6767 lung cancer patients, 1481 patients (21.9%) experienced a postoperative length of stay of &gt; 4 days. The majority were male (4111, 60.8%), married (6246, 92.3%), and diagnosed with adenocarcinoma (4145, 61.3%). The Random Forest classifier exhibited superior prediction performance with an area under the curve (AUC) of 0.792 and ACC of 0.804. The calibration plot revealed that all three classifiers were in close alignment with the ideal calibration line, indicating high calibration reliability. The five most critical features identified were the following: surgical duration (0.116), age (0.066), creatinine (0.062), hemoglobin (0.058), and total protein (0.054).</p></div><div><h3>Conclusions</h3><p>This study developed and evaluated three machine-learning models for predicting PPOLOS in lung cancer patients undergoing VATS. The findings revealed that the Random Forest model is most accurately predicting the PPOLOS. Findings of this study enable the identification of crucial determinants and the formulation of targeted interventions to shorten the length of stay among lung cancer patients after VATS, which contribute to optimize the allocation of healthcare resources.</p></div>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2347562524001136/pdfft?md5=70d08ea2cb66b8116ae9b2a1af0e75a8&pid=1-s2.0-S2347562524001136-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2347562524001136","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 0

Abstract

Objective

This study aimed to develop models for predicting prolonged postoperative length of stay (PPOLOS) in lung cancer patients undergoing video-assisted thoracoscopic surgery (VATS) by utilizing machine-learning techniques. These models aim to offer valuable insights for clinical decision-making.

Methods

This retrospective cohort study analyzed a dataset of lung cancer patients who underwent VATS, identifying 25 numerical features and 45 textual features. Three classification machine-learning models were developed: XGBoost, random forest, and neural network. The performance of these models was evaluated based on accuracy (ACC) and area under the receiver operating characteristic curve, whereas the importance of variables was assessed using the feature importance parameter from the random forest model.

Results

Of the 6767 lung cancer patients, 1481 patients (21.9%) experienced a postoperative length of stay of > 4 days. The majority were male (4111, 60.8%), married (6246, 92.3%), and diagnosed with adenocarcinoma (4145, 61.3%). The Random Forest classifier exhibited superior prediction performance with an area under the curve (AUC) of 0.792 and ACC of 0.804. The calibration plot revealed that all three classifiers were in close alignment with the ideal calibration line, indicating high calibration reliability. The five most critical features identified were the following: surgical duration (0.116), age (0.066), creatinine (0.062), hemoglobin (0.058), and total protein (0.054).

Conclusions

This study developed and evaluated three machine-learning models for predicting PPOLOS in lung cancer patients undergoing VATS. The findings revealed that the Random Forest model is most accurately predicting the PPOLOS. Findings of this study enable the identification of crucial determinants and the formulation of targeted interventions to shorten the length of stay among lung cancer patients after VATS, which contribute to optimize the allocation of healthcare resources.

用于预测肺癌患者视频辅助胸腔镜手术后住院时间延长的机器学习模型的开发与比较
目的本研究旨在利用机器学习技术开发预测接受视频辅助胸腔镜手术(VATS)的肺癌患者术后住院时间延长(PPOLOS)的模型。这项回顾性队列研究分析了接受 VATS 手术的肺癌患者数据集,确定了 25 个数字特征和 45 个文字特征。开发了三种分类机器学习模型:XGBoost、随机森林和神经网络。根据准确率(ACC)和接收者操作特征曲线下面积评估了这些模型的性能,并使用随机森林模型中的特征重要性参数评估了变量的重要性。大多数患者为男性(4111 人,占 60.8%)、已婚(6246 人,占 92.3%)和腺癌(4145 人,占 61.3%)。随机森林分类器的预测性能优越,曲线下面积(AUC)为 0.792,ACC 为 0.804。校准图显示,所有三个分类器都与理想校准线接近,表明校准可靠性很高。确定的五个最关键特征如下:手术时间(0.116)、年龄(0.066)、肌酐(0.062)、血红蛋白(0.058)和总蛋白(0.054)。研究结果表明,随机森林模型对 PPOLOS 的预测最为准确。这项研究的结果有助于确定关键的决定因素并制定有针对性的干预措施,以缩短肺癌患者接受 VATS 术后的住院时间,从而有助于优化医疗资源的分配。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
ACS Applied Bio Materials
ACS Applied Bio Materials Chemistry-Chemistry (all)
CiteScore
9.40
自引率
2.10%
发文量
464
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信